4th IODC

Blog IODC 2016
Madrid. October 6-7, 2016


Open data and language processing technologies

September 1, 2016 by Juan Llorens

Juan Llorens is a telecommunications engineer and civil servant currently working at the Secretary of State for Telecommunications and Information Society, Spain

The growth of Internet and ICTs generates such an overwhelming amount of text in electronic format that is already beyond human limits, so that automatic utilization of these textual resources is becoming an urgent matter.

Language Technologies are a set of diverse technologies that set the path to a deeper automatic understanding of human language. These comprehend Natural Language Processing (NLP) as well as Machine Translation. These are the technologies that allow automatic use of that amount of textual data.

Consequently Language Technologies generate an emerging, innovative and cross-cutting industrial sector.

Organizations accumulate large amounts of text in electronic format which could be fuel for language technologies industry.

These texts are valuable in two ways:

  • Its direct value as raw material to produce relevant information by means of Language Technologies.
  • But not less relevant, it is also very useful to create and train Language Technology itself (A good example is the translation memory of European Commission Directorate-General for Translation, which the most downloaded dataset at European Union Open Data Portal).

We could think even further, since combining Open Data with Language Technologies may enable a new knowledge revolution, a new global Enlightenment.

But to achieve its potential benefits, its specific societal, economic, legal and technical challenges must be faced.

To bring attention to the potential benefits of the conjunction of Open Data and Language Processing Technologies; and to address its specific societal, economic, legal and technical challenges, two events will take place in the context of the International Open Data Conference IODC 2016, that will be held in Madrid (Spain) in October 2016.

The first is a Workshop on October, the 5th (15:30-19:30), where relevant experts in different sides of this polyhedral issue will have time to share and discuss among them and with the audience their different but revealing views and experiences on the matter in a collective effort to shed new and enriched light on it.

You can find more information on this Workshop here: http://opendatacon.org/agenda/pre-events/open-linguistic-data/

The second is an Impact session on October, the 6th (17:00-17:45), where relevant experts will share with us well informed reflections on its challenges and opportunities from different angles, illustrated with use cases where they have been directly involved.

You can find more information on this Impact session here: http://opendatacon.org/agenda/

Attendance is free.


Cover photo by Fabien Barral

Use of cookies

This site uses cookies in order to improve your user experience. By continuing to use the site, you are agreeing to the use of cookies and accepting our cookies policy. .