The Register® — Biting the hand that feeds IT

Feeds

Roses are #f00, violets are #00f. This witty code is a boffinry breakthrough

'I like my relationships like I like my kernel source... open'

Email delivery: Hate phishing emails? You'll love DMARC

What do you call a computer program that uses big data to write jokes? Basic, judging by the list of groan-worthy gags generated by this new wisecracking software.

Eggheads at the University of Edinburgh have developed code dedicated to spitting out quips along the lines of: "I like my men like I like my monoxide - odourless" and "I like my women like I like my gas - natural".

The system was tested on a group of volunteers who claimed the witty algorithms made them chuckle a few times, although not as much as similar, human-penned jokes chosen from Twitter.

It uses 2,000,000 noun-adjective pairs of words to draw up jokes "with an element of surprise", something the creators claim is key to good comedy. The one-liners were produced by searching for connections between pairings of words using Google n-gram data and Wordnet's part-of-speech tags.

Other jokes calculated by the software include:

  • I like my relationships like I like my source code... open
  • I like my boys like I like my disk sectors... bad
  • I like my coffee like I like my war... cold

As snigger-worthy as they are, there's some way to go until computers are as funny as their human masters, said David Matthews of the university's School of Informatics, who wrote the program with Sasa Petrovic.

"Computers have an advantage over people in that they can process masses of information, so we fed computers a wealth of material from which they extracted creative and unusual word combinations to fit our joke template," he said.

"The holy grail for machine-generated comedy would be to include cultural references, but these are very hard to capture."

Speaking of cultural references, the academic paper describing the software is also amusing, in a nerdy way. Here's one extract as an example:

In the automatic evaluation we measure the effect of the different factors in the model … It is a local approximation to log-likelihood, and we therefore dub it LOcal Log-likelihood, or LOL-likelihood for short. Our second metric computes the rank of the human-generated jokes in the distribution of all possible jokes sorted decreasingly by their LOL-likelihood.

This Rank OF Likelihood (ROFL) is computed relative to the number of all possible jokes, and like LOL-likelihood is averaged over all the jokes in our development data.

For measuring LOL-likelihood and ROFL we use a set of 48 jokes randomly sampled from Twitter that fit the "I like my X like I like my Y, Z" pattern.

The e-comedian project [paper PDF] will be presented at the Association for Computational Linguistics annual meeting in Sofia, Bulgaria, next week. ®

(An SQL statement walks into a bar, wanders over to two tables and asks: “May I join you?” ...we're here all week. For some commentards, months.)

5 ways to prepare your advertising infrastructure for disaster

Whitepapers

5 ways to reduce advertising network latency
Implementing the tactics laid out in this whitepaper can help reduce your overall advertising network latency.
Reg Reader Research: SaaS based Email and Office Productivity Tools
Read this Reg reader report which provides advice and guidance for SMBs towards the use of SaaS based email and Office productivity tools.
Email delivery: 4 steps to get more email to the inbox
This whitepaper lists some steps and information that will give you the best opportunity to achieve an amazing sender reputation.
High Performance for All
While HPC is not new, it has traditionally been seen as a specialist area – is it now geared up to meet more mainstream requirements?
5 ways to prepare your advertising infrastructure for disaster
Being prepared allows your brand to greatly improve your advertising infrastructure performance and reliability that, in the end, will boost confidence in your brand.

More from The Register

next story
Windows 8 fans out-enthuse Apple fanbois
Redmond allows 81 Win 8 devices to use one user ID, solving side-loading shemozzle
'200 million' fanbois using iOS 7 just a week after release - study
Plus: Most US iDevice users are drinking Cupertino's latest Koolaid
No luck at all for BlackBerry as Messenger apps launch stalls
Leaked Android build 'causes issues,' is withdrawn
App Store ratings mess: What do we like? Sigh, we dunno – fanbois
How do I know what to download if I don't know what everyone else is doing?
OUCH: Google preps ad goo injection for Android mobile Gmail app
Don't worry, fandroids, wallet-plumping serum won't hurt a bit
Launchpads, catapults... what a load of - WAIT, there's £15m for grabs?
Quango sprinkles cash on games, animation and trendy meeja types
Apple iOS 7 makes some users literally SICK. As in puking, not upset
'Eye candy really is as bad as classical candy is for the teeth,' writes one
Google reveals its Hummingbird: Fly, my little algorithm - FLY!
Update brings Googleplex one step closer to sentience
Oracle hides ExaLogic price cut
Old price lists prove price halved, so why has Big Red deleted the post announcing it?
prev story