Founding offer · lifetime membership for a single £24, exclusive to our first members · closes 20 June Claim your place →
Global Research Partnerships £24 Lifetime Log inCreate free account

Funded Projects › H2020

EMBEDDIA · Cross-Lingual Embeddings for Less-Represented Languages in European News Media

H2020Status: CLOSED1 January 201931 March 2022EU funding €2,998,850Call H2020-ICT-2018-20

Access to the internet is no longer a luxury---it is a basic component of everyday life and civic engagement, but one in which language continues to be a challenge for fair and equitable access. As Europe becomes more multicultural, and personal and professional mobility between cultures rapidly increases, access to fundamental resources such as local news and government services is limited by the great diversity of the EU's 37 languages. The internet mostly developed in English, and without clear planning for how language issues might form barriers to access and engagement, nor how multilingualism might be supported. In the EU, websites and online services for citizens have developed national local language resources, and often only provide a second language (usually English) when absolutely needed; but the great proliferation of web content, multiple and fast-changing content streams, and an expanding user interest base make this approach untenable. And while advanced natural language research and resources exist for a few dominant languages (English, French, German), many of Europe's smaller language communities---and the news media industry that serves them---lack appropriate tools for multilingual internet development. For the EU to realise a truly equitable, open, multilingual future internet, new tools allowing high quality transformations (not translations) between languages are urgently needed. The EMBEDDIA project seeks to address these challenges by leveraging innovations in the use of cross-lingual embeddings coupled with deep neural networks to allow existing monolingual resources to be used across languages, leveraging their high speed of operation for near real-time applications, without the need for large computational resources. Across three years, the project's six academic and four industry partners will develop novel solutions including for under-represented languages, and test them in real-world news and media production contexts.

Consortium · 11 organisations

coordinator

INSTITUT JOZEF STEFAN

SI · €560,060

participant

OY SUOMEN TIETOTOIMISTO - FINSKA NOTISBYRAN AB

FI · €111,738

participant

HELSINGIN YLIOPISTO

FI · €448,125

participant

STYRIA MEDIJSKI SERVISI DOO ZA TRGOVINU I USLUGE

HR · €11,014

participant

TEXTA OU

EE · €306,250

participant

AS EKSPRESS MEEDIA

EE · €113,438

participant

LA ROCHELLE UNIVERSITE

FR · €372,500

participant

TRIKODER DRUSTVO S OGRANICENOM ODGOVORNOSCU ZA RAZVOJ INTERNET SUSTAVAI OBLIKOVANJE

HR · €125,177

participant

QUEEN MARY UNIVERSITY OF LONDON

UK · €451,800

participant

THE UNIVERSITY OF EDINBURGH

UK · €175,000

participant

UNIVERZA V LJUBLJANI

SI · €323,750

Research fields

View the official record on CORDIS →

← Find collaborators and more funded projects

Source: CORDIS, Publications Office of the European Union. Global Research Partnerships surfaces open EU research data to help you find collaborators; we are not affiliated with the European Union.