Wit3: Web inventory of transcribed and translated talks

M Cettolo, C Girardi, M Federico - Proceedings of the Conference of …, 2012 - cris.fbk.eu
M Cettolo, C Girardi, M Federico
Proceedings of the Conference of European Association for Machine Translation …, 2012cris.fbk.eu
We describe here a Web inventory named WIT3 that offers access to a collection of
transcribed and translated talks. The core of WIT3 is the TED Talks corpus, that basically
redistributes the original content published by the TED Conference website (http://www. ted.
com). Since 2007, the TED Conference, based in California, has been posting all video
recordings of its talks together with subtitles in English and their translations in more than 80
languages. Aside from its cultural and social relevance, this content, which is published …
Abstract
We describe here a Web inventory named WIT3 that offers access to a collection of transcribed and translated talks. The core of WIT3 is the TED Talks corpus, that basically redistributes the original content published by the TED Conference website (http://www. ted. com). Since 2007, the TED Conference, based in California, has been posting all video recordings of its talks together with subtitles in English and their translations in more than 80 languages. Aside from its cultural and social relevance, this content, which is published under the Creative Commons BY-NC-ND license, also represents a precious language resource for the machine translation research community, thanks to its size, variety of topics, and covered languages. This effort repurposes the original content in a way which is more convenient for machine translation researchers.
cris.fbk.eu
Showing the best result for this search. See all results