EU moves to establish gibberish as lingua franca
Brussels docs to teach translation software
The European Commission is seeking to make us all speak in Brusselsese by donating millions of its documents to translation software developers.
The commission described the donation of its "collection of about one million sentences and their high quality translations in 22 of the 23 official EU languages" as a step further in its "efforts to foster multilingualism as a key part of European unity in diversity".
At first glance, the offer would appear to be a boon to anyone developing a translation engine. The EU has 23 official languages, and its translation services span 253 possible language pair combinations, producing about 1.5 million translated pages a year. This should provide plenty of high quality material for engines to get their teeth into.
On the other hand, we are talking about Brussels here, where everything reads like gibberish and gobbledegook, even when it's written in your native tongue to begin with.
For starters, just try getting your tongue round the intro to the EU's Language Translation program's copyright notice, which warns us: "Where prior permission must be obtained for the reproduction or use of textual and multimedia information (sound, images, software, etc.), such permission shall cancel the above-mentioned general permission and shall clearly indicate any restrictions on use."
OK, we admit it, that's clear enough, if you're a lawyer. But it's still good practice for slogging your way through the Lisbon Treaty. Just check out this section on data protection: "The European Parliament and the Council, acting in accordance with the ordinary legislative procedure, shall lay down the rules relating to the protection of individuals with regard to the processing of personal data by Union institutions, bodies, offices and agencies, and by the Member States when carrying out activities which fall within the scope of Union law, and the rules relating to the free movement of such data. Compliance with these rules shall be subject to the control of independent authorities."
Shockingly, the only language excluded from the program is Irish, leaving the 90,000 residents of Ireland's Gaeltacht out in the cold, linguistically speaking. ®
@Michael H.F. Wilkinson
Occitan in the South East would be a bit of a turn up for the books, I would expect to find it a long way West of there - West of the places where Provencal is still alive (or was last time I was there - a few decades ago) albeit not very alive.
It would be nice to have translations of EU documents into all those minority languages from France, and or course into Gaoluinn and Gaeilge and Gaidhlig and Gaelk and Welsh, also Cornish if there's even one person who speaks it well enough to translate this sort of stuff. Maybe I could get a job as a translator and give up this IT nonsense.
Well in my case (I'm the one that defended the author... to a point) it's because I foolishly registered with my own name, thinking that was the done thing here.
Now I see the error of my ways, as these pages are filled with comments from pseudonymous accounts like "Hate2Register".
I can't see any account settings, so... aw heck -- why not go AC?
Re: Nice try
"Stop Press: EU WRITES LEGAL DOCUMENTS IN LEGAL LANGUAGE!"
Well if they spent some time simplifying the original (*) text, they'd save a bundle on translation costs and saves years of subsequent wrangling over treaties that meant something slightly different to every country.
IT angle: it's a classic case of automating a flawed process.