Feeds

Roses are #f00, violets are #00f. This witty code is a boffinry breakthrough

'I like my relationships like I like my kernel source... open'

Build a business case: developing custom apps

What do you call a computer program that uses big data to write jokes? Basic, judging by the list of groan-worthy gags generated by this new wisecracking software.

Eggheads at the University of Edinburgh have developed code dedicated to spitting out quips along the lines of: "I like my men like I like my monoxide - odourless" and "I like my women like I like my gas - natural".

The system was tested on a group of volunteers who claimed the witty algorithms made them chuckle a few times, although not as much as similar, human-penned jokes chosen from Twitter.

It uses 2,000,000 noun-adjective pairs of words to draw up jokes "with an element of surprise", something the creators claim is key to good comedy. The one-liners were produced by searching for connections between pairings of words using Google n-gram data and Wordnet's part-of-speech tags.

Other jokes calculated by the software include:

  • I like my relationships like I like my source code... open
  • I like my boys like I like my disk sectors... bad
  • I like my coffee like I like my war... cold

As snigger-worthy as they are, there's some way to go until computers are as funny as their human masters, said David Matthews of the university's School of Informatics, who wrote the program with Sasa Petrovic.

"Computers have an advantage over people in that they can process masses of information, so we fed computers a wealth of material from which they extracted creative and unusual word combinations to fit our joke template," he said.

"The holy grail for machine-generated comedy would be to include cultural references, but these are very hard to capture."

Speaking of cultural references, the academic paper describing the software is also amusing, in a nerdy way. Here's one extract as an example:

In the automatic evaluation we measure the effect of the different factors in the model … It is a local approximation to log-likelihood, and we therefore dub it LOcal Log-likelihood, or LOL-likelihood for short. Our second metric computes the rank of the human-generated jokes in the distribution of all possible jokes sorted decreasingly by their LOL-likelihood.

This Rank OF Likelihood (ROFL) is computed relative to the number of all possible jokes, and like LOL-likelihood is averaged over all the jokes in our development data.

For measuring LOL-likelihood and ROFL we use a set of 48 jokes randomly sampled from Twitter that fit the "I like my X like I like my Y, Z" pattern.

The e-comedian project [paper PDF] will be presented at the Association for Computational Linguistics annual meeting in Sofia, Bulgaria, next week. ®

(An SQL statement walks into a bar, wanders over to two tables and asks: “May I join you?” ...we're here all week. For some commentards, months.)

Gartner critical capabilities for enterprise endpoint backup

More from The Register

next story
Why has the web gone to hell? Market chaos and HUMAN NATURE
Tim Berners-Lee isn't happy, but we should be
Microsoft boots 1,500 dodgy apps from the Windows Store
DEVELOPERS! DEVELOPERS! DEVELOPERS! Naughty, misleading developers!
Mozilla's 'Tiles' ads debut in new Firefox nightlies
You can try turning them off and on again
'Stop dissing Google or quit': OK, I quit, says Code Club co-founder
And now a message from our sponsors: 'STFU or else'
Apple promises to lift Curse of the Drained iPhone 5 Battery
Have you tried turning it off and...? Never mind, here's a replacement
Uber, Lyft and cutting corners: The true face of the Sharing Economy
Casual labour and tired ideas = not really web-tastic
Linux turns 23 and Linus Torvalds celebrates as only he can
No, not with swearing, but by controlling the release cycle
prev story

Whitepapers

Top 10 endpoint backup mistakes
Avoid the ten endpoint backup mistakes to ensure that your critical corporate data is protected and end user productivity is improved.
Implementing global e-invoicing with guaranteed legal certainty
Explaining the role local tax compliance plays in successful supply chain management and e-business and how leading global brands are addressing this.
Backing up distributed data
Eliminating the redundant use of bandwidth and storage capacity and application consolidation in the modern data center.
The essential guide to IT transformation
ServiceNow discusses three IT transformations that can help CIOs automate IT services to transform IT and the enterprise
Next gen security for virtualised datacentres
Legacy security solutions are inefficient due to the architectural differences between physical and virtual environments.