Feeds

Linguists use sounds to bypass Skype crypto

And you thought grammar was useless…

Securing Web Applications Made Simple and Scalable

Decryption is difficult and computationally expensive. So what if, instead of decrypting the content of a message, you found a correlation between the encrypted data and its meaning – without having to crack the code itself?

Such an approach has been demonstrated by a group of University of North Carolina linguists working with computer scientists on encrypted Skype calls. While their research paper only managed to partially recover conversations, an encryption scheme that leaks even some of the data it’s meant to protect is no longer secure.

It works like this: spoken English has a set of known – and quite settled – rules for its phonetic grammar.

For non-linguists, this means the order in which we can and cannot put different sounds together. The “ds” sound, or phoneme, at the end of sounds is fairly common at the end of English words, but doesn’t occur at the beginning.

Systems like speech-to-text converters use these rules to break strings of sounds into individual words; they match sounds against a dictionary of legal phoneme combinations and map these into words. What the researchers discovered is that encryption leaves a pattern that can be subjected to this kind of analysis – without decrypting the data.

When you encode spoken English for VoIP using (in the case of Skype) CELP (code excited linear projection), you will end up with patterns in the data that match the patterns in the sounds. In particular, those patterns end up being reflected in the size of the data frame: the more complex the sound that’s being encoded, the larger the frame, resulting in a correlation between frame size and the original sounds spoken.

When the data created by CELP is encrypted, it retains the original frame size – and that means that even encrypted Skype data will retain the correlation between the size of the data frame and the original phonemes.

The technique gets another helping hand: at least some of the time, boundaries between sounds correspond to sudden changes in frame size, hinting at the difference between “Han Solo” and “Hans Solo”.

The researchers mapped the size of encrypted data frames in the Skype stream back to likely patterns of phonemes, and used that mapping – which they called “Phonetic Reconstruction” – to reconstruct the call, without decrypting the data.

So how well does it work? Not so well that we should all abandon Skype tomorrow. However, the researchers noted that if an encryption scheme is to be considered secure, “no reconstruction, even a partial one, should be possible; indeed, any cryptographic system that leaked as much information as shown here would immediately be deemed insecure.”

Bigger phoneme-word dictionaries (covering more dialects and languages) and faster processing would improve the accuracy of this kind of analysis ®

The smart choice: opportunity from uncertainty

More from The Register

next story
BMW's ConnectedDrive falls over, bosses blame upgrade snafu
Traffic flows up 20% as motorway middle lanes miraculously unclog
Putin: Crack Tor for me and I'll make you a MILLIONAIRE
Russian Interior Ministry offers big pile o' roubles for busting pro-privacy browser
Mozilla fixes CRITICAL security holes in Firefox, urges v31 upgrade
Misc memory hazards 'could be exploited' - and guess what, one's a Javascript vuln
Manic malware Mayhem spreads through Linux, FreeBSD web servers
And how Google could cripple infection rate in a second
How long is too long to wait for a security fix?
Synology finally patches OpenSSL bugs in Trevor's NAS
Don't look, Snowden: Security biz chases Tails with zero-day flaws alert
Exodus vows not to sell secrets of whistleblower's favorite OS
Roll out the welcome mat to hackers and crackers
Security chap pens guide to bug bounty programs that won't fail like Yahoo!'s
prev story

Whitepapers

Top three mobile application threats
Prevent sensitive data leakage over insecure channels or stolen mobile devices.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Boost IT visibility and business value
How building a great service catalog relieves pressure points and demonstrates the value of IT service management.
Designing a Defense for Mobile Applications
Learn about the various considerations for defending mobile applications - from the application architecture itself to the myriad testing technologies.
Build a business case: developing custom apps
Learn how to maximize the value of custom applications by accelerating and simplifying their development.