Using Comparable Corpora for Under-Resourced Areas of Machine Translation

This book provides an overview of how comparable corpora can be used to overcome the lack of parallel resources when building machine translation systems for under-resourced languages and domains. It presents a wealth of methods and open tools for building comparable corpora from the Web, evaluating...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Συγγραφή απο Οργανισμό/Αρχή: SpringerLink (Online service)
Άλλοι συγγραφείς: Skadiņa, Inguna (Επιμελητής έκδοσης, http://id.loc.gov/vocabulary/relators/edt), Gaizauskas, Robert (Επιμελητής έκδοσης, http://id.loc.gov/vocabulary/relators/edt), Babych, Bogdan (Επιμελητής έκδοσης, http://id.loc.gov/vocabulary/relators/edt), Ljubešić, Nikola (Επιμελητής έκδοσης, http://id.loc.gov/vocabulary/relators/edt), Tufiş, Dan (Επιμελητής έκδοσης, http://id.loc.gov/vocabulary/relators/edt), Vasiļjevs, Andrejs (Επιμελητής έκδοσης, http://id.loc.gov/vocabulary/relators/edt)
Μορφή: Ηλεκτρονική πηγή Ηλ. βιβλίο
Γλώσσα:English
Έκδοση: Cham : Springer International Publishing : Imprint: Springer, 2019.
Έκδοση:1st ed. 2019.
Σειρά:Theory and Applications of Natural Language Processing,
Θέματα:
Διαθέσιμο Online:Full Text via HEAL-Link
Περιγραφή
Περίληψη:This book provides an overview of how comparable corpora can be used to overcome the lack of parallel resources when building machine translation systems for under-resourced languages and domains. It presents a wealth of methods and open tools for building comparable corpora from the Web, evaluating comparability and extracting parallel data that can be used for the machine translation task. It is divided into several sections, each covering a specific task such as building, processing, and using comparable corpora, focusing particularly on under-resourced language pairs and domains. The book is intended for anyone interested in data-driven machine translation for under-resourced languages and domains, especially for developers of machine translation systems, computational linguists and language workers. It offers a valuable resource for specialists and students in natural language processing, machine translation, corpus linguistics and computer-assisted translation, and promotes the broader use of comparable corpora in natural language processing and computational linguistics.
Φυσική περιγραφή:VI, 323 p. 63 illus., 39 illus. in color. online resource.
ISBN:9783319990040
ISSN:2192-032X
DOI:10.1007/978-3-319-99004-0