The University of Dublin | Trinity College -- Ollscoil Átha Cliath | Coláiste na Tríonóide
Trinity's Access to Research Archive
Home :: Log In :: Submit :: Alerts ::

TARA >
School of Computer Science and Statistics >
Computer Science >
Computer Science (Scholarly Publications) >

Please use this identifier to cite or link to this item: http://hdl.handle.net/2262/62488

Title: Linking localisation and language resources
Other Titles: Linked Data in Linguistics
Author: LEWIS, DAVID
JONES, DOMINIC
LAWLESS, SEAMUS
Sponsor: 
Name Grant Number
07/CE/I1142

Author's Homepage: http://people.tcd.ie/delewis
http://people.tcd.ie/jonesdo
http://people.tcd.ie/selawles
Keywords: Computer Science and Engineering
Industrial localisation
Issue Date: 2012
Publisher: Springer-Verlag LNCS
Citation: David Lewis, Alexander O'Connor, Sebastien Molines, Leroy Finn, Dominic Jones, Stephen Curran and Séamus Lawless, Linking localisation and language resources, Linked Data in Linguistics, Frankfurt/Main, Germany, March 7 9, 2012, Chiarcos, Christian; Nordhoff, Sebastian; Hellmann, Sebastian, Springer-Verlag LNCS, 2012
Abstract: Industrial localisation is changing from the periodic translation of large bodies of content to a long-tail of small, heterogeneous translations processed in an agile and demand-driven manner. Software localisation and crowd-source translation already practice continuous fine-grained distribution of translation work. This requires close integration and round-trip interoperabil- ity between content creation and localisation processes, while at the same time recording the provenance of translated content to maximise it reuse in future translation tasks, and, increasingly, in training Statistical Machine Translation (SMT) engines. This work adopts a Linked Data approach to integrating the content translation roundtrip process with the logging of process quality assur- ance provenance. This integration supports a pull-based interoperability model that supports continuous synchronising of content and process meta-data be- tween the generating organisation and any number of language service provid- ers or translators. We present a platform architecture for sharing, searching and interlinking of Linked Localisation and Language Data (termed L3Data) on the web. This is accomplished using a semantic schema for L3Data that is compati- ble with existing localisation data exchange standards and can be used to sup- port the round-trip sharing of language resources. The paper describes our ap- proach to development of L3Data schema and data management processes, web-based tools and data sharing infrastructure that use it. An initial proof of concept prototype is presented which implements a web application that seg- ments and machine translates content for crowd-sourced post-editing and rating.
Description: IN_PRESS
Frankfurt/Main, Germany
URI: http://hdl.handle.net/2262/62488
Related links: http://www.scss.tcd.ie/seamus.lawless/papers/LDL2012.pdf
http://www.springer.com/computer/ai/book/978-3-642-28248-5
Appears in Collections:Computer Science (Scholarly Publications)

Files in This Item:

File Description SizeFormat
Linking Localisation and Language Resources.pdfPublished (author's copy) peer-reviewed372.26 kBAdobe PDFView/Open


This item is protected by original copyright


Please note: There is a known bug in some browsers that causes an error when a user tries to view large pdf file within the browser window. If you receive the message "The file is damaged and could not be repaired", please try one of the solutions linked below based on the browser you are using.

Items in TARA are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Valid XHTML 1.0! DSpace Software Copyright © 2002-2010  Duraspace - Feedback