Feeds

How WinXP can make non-MS files invisible

Put that paranoia away - it's a notabug

  • alert
  • submit to reddit

Combat fraud and increase customer satisfaction

Updated again: Windows XP's search system includes a bizarre feature that appears to exclude files with non-Microsoft file extensions, under some conditions. It is however so odd that it's surely got to be a bug, rather than monkey business. But you could go as far as saying it's one of those MS things that inconvenience other companies if they don't do things the new way we're doing them in Redmond.

But in this case, it's largely just a minor inconvenience, albeit one that can easily baffle users, and did this one.

Here's how you can verify it. Go to an XP directory where you know you've got files with both Microsoft and non-Microsoft extensions. Search for *.doc, or another Microsoft extension of your choice and will show up. Search for a non-Microsoft extension in the same way, and it'll show up too. Obviously.

Now, search by the extension and also for a string that you know is going to be in the file. For documents, "the" or "and" would be a pretty good bet, for C++ files (*.cpp) you're inevitably going to get a "for". You still find the file with the Microsoft extension, but magically, it can't find the one with the non-Microsoft extension.

Thanks to the reader who pointed us at this one. He checked with *.java and *.wpd, and we've just checked it with *.ddf (Musicmatch) and *.js (Javascript). But does it apply to every non-MS extension, and if so, why?

Later

We've a lot of mail coming in about this one. Changing associations and extensions seems not to change the result. Change a .txt file to a .cpp while maintaining the Notepad association doesn't make it findable. Associating a .isu file with Notepad does not make it findable, changing its extension to .txt still does not make it findable.

And it doesn't seem to be anything to do with the capabilities of the indexing service, because that is switched off on the machine we're trying it on.

Or is it? In Windows 2000 the search defaulted to treating everything as .txt, so it'd crunch through everything it didn't understand. WinXP (our thanks to Alex Fein and the reader who pointed us at his explanation) doesn't default to .txt, and ignores everything it doesn't have a filter for. This, clearly, is nothing to do with whether you've got the index service switched on or off, because on the machine we're using here the service has never been switched on in the first place.

So that thought was a red herring, and with hindsight a dumb thing to think in the first place. XP has its own batch of Microsoft filters that install with the software, and Office XP comes with a few more. If search finds a file extension it doesn't have a filter for, then it skips the file. Developers who need the search system to be able to search inside their files therefore now have to produce filters for them, or to register them with one of the Microsoft filters.

This doesn't as far as we can see, explain how you can change the extension of a file with no filter for it to .txt, change its association to Notepad for good measure, and still not be able to find it. So there's surely something more in there, but it does seem to be related to filters.

Ah, but we seem to be there now. Thanks to Ami for telling us: "The filters are probably invoked based on file-association, but ignore files that either don't have the correct extension or don't match the format (most file formats have a 'magic' number at the beginning that confirms the format is what the user says it is)." Gotcha, we think.

Why did Microsoft make the switch? Skipping huge MP3s that don't need to be searched in would speed things up considerably, and if you were using the index system, keep the index size down. Alex Feinman has a fuller explanation here, plus a routine that will change the behaviour if you want to do so. ®

3 Big data security analytics techniques

More from The Register

next story
Ubuntu 14.04 LTS: Great changes, but sssh don't mention the...
Why HELLO Amazon! You weren't here last time
This time it's 'Personal': new Office 365 sub covers just two devices
Redmond also brings Office into Google's back yard
Next Windows obsolescence panic is 450 days from … NOW!
The clock is ticking louder for Windows Server 2003 R2 users
Half of Twitter's 'active users' are SILENT STALKERS
Nearly 50% have NEVER tweeted a word
OpenBSD founder wants to bin buggy OpenSSL library, launches fork
One Heartbleed vuln was too many for Theo de Raadt
Got Windows 8.1 Update yet? Get ready for YET ANOTHER ONE – rumor
Leaker claims big release due this fall as Microsoft herds us into the CLOUD
Batten down the hatches, Ubuntu 14.04 LTS due in TWO DAYS
Admins dab straining server brows in advance of Trusty Tahr's long-term support landing
Red Hat to ship RHEL 7 release candidate with a taste of container tech
Grab 'near-final' version of next Enterprise Linux next week
Apple inaugurates free OS X beta program for world+dog
Prerelease software now open to anyone, not just developers – as long as you keep quiet
prev story

Whitepapers

Mobile application security study
Download this report to see the alarming realities regarding the sheer number of applications vulnerable to attack, as well as the most common and easily addressable vulnerability errors.
3 Big data security analytics techniques
Applying these Big Data security analytics techniques can help you make your business safer by detecting attacks early, before significant damage is done.
The benefits of software based PBX
Why you should break free from your proprietary PBX and how to leverage your existing server hardware.
Securing web applications made simple and scalable
In this whitepaper learn how automated security testing can provide a simple and scalable way to protect your web applications.
Combat fraud and increase customer satisfaction
Based on their experience using HP ArcSight Enterprise Security Manager for IT security operations, Finansbank moved to HP ArcSight ESM for fraud management.