Using Comparable Corpora for Under-Resourced Areas of Machine Translation

This book provides an overview of how comparable corpora can be used to overcome the lack of parallel resources when building machine translation systems for under-resourced languages and domains. It presents a wealth of methods and open tools for building comparable corpora from the Web, evaluating...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Συγγραφή απο Οργανισμό/Αρχή: SpringerLink (Online service)
Άλλοι συγγραφείς: Skadiņa, Inguna (Επιμελητής έκδοσης, http://id.loc.gov/vocabulary/relators/edt), Gaizauskas, Robert (Επιμελητής έκδοσης, http://id.loc.gov/vocabulary/relators/edt), Babych, Bogdan (Επιμελητής έκδοσης, http://id.loc.gov/vocabulary/relators/edt), Ljubešić, Nikola (Επιμελητής έκδοσης, http://id.loc.gov/vocabulary/relators/edt), Tufiş, Dan (Επιμελητής έκδοσης, http://id.loc.gov/vocabulary/relators/edt), Vasiļjevs, Andrejs (Επιμελητής έκδοσης, http://id.loc.gov/vocabulary/relators/edt)
Μορφή: Ηλεκτρονική πηγή Ηλ. βιβλίο
Γλώσσα:English
Έκδοση: Cham : Springer International Publishing : Imprint: Springer, 2019.
Έκδοση:1st ed. 2019.
Σειρά:Theory and Applications of Natural Language Processing,
Θέματα:
Διαθέσιμο Online:Full Text via HEAL-Link
LEADER 05282nam a2200553 4500
001 978-3-319-99004-0
003 DE-He213
005 20190619133644.0
007 cr nn 008mamaa
008 190206s2019 gw | s |||| 0|eng d
020 |a 9783319990040  |9 978-3-319-99004-0 
024 7 |a 10.1007/978-3-319-99004-0  |2 doi 
040 |d GrThAP 
050 4 |a QA76.9.N38 
072 7 |a UYQL  |2 bicssc 
072 7 |a COM073000  |2 bisacsh 
072 7 |a UYQL  |2 thema 
082 0 4 |a 006.35  |2 23 
245 1 0 |a Using Comparable Corpora for Under-Resourced Areas of Machine Translation  |h [electronic resource] /  |c edited by Inguna Skadiņa, Robert Gaizauskas, Bogdan Babych, Nikola Ljubešić, Dan Tufiş, Andrejs Vasiļjevs. 
250 |a 1st ed. 2019. 
264 1 |a Cham :  |b Springer International Publishing :  |b Imprint: Springer,  |c 2019. 
300 |a VI, 323 p. 63 illus., 39 illus. in color.  |b online resource. 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file  |b PDF  |2 rda 
490 1 |a Theory and Applications of Natural Language Processing,  |x 2192-032X 
505 0 |a Introduction -- Cross-language comparability and its Applications for MT (Bogdan Babych, Fangzhong Su, Anthony Hartley, Ahmet Aker, Monica Lestari Paramita, Paul Clough, Robert Gaizauskas) -- Collecting comparable corpora (Monica Lestari Paramita, Ahmet Aker, Paul Clough, Robert Gaizauskas, Nikos Glaros, Nikos Mastropavlos, Olga Yannoutsou, Radu Ion, Dan Ștefănescu, Alexandru Ceauşu, Dan Tufiș and Judita Preiss) -- Extracting data from comparable corpora (Mārcis Pinnis, Nikola Ljubešić, Dan Ştefănescu, Inguna Skadiņa, Marko Tadić, Tatjana Gornostaja, Špela Vintar, Darja Fišer) -- Mapping and aligning units from comparable corpora (Ahmet Aker, Alexandru Ceaușu, Yang Feng, Robert Gaizauskas, Sabine Hunsicker, Radu Ion, Elena Irimia, Dan Ștefănescu, Dan Tufiș) -- Training, enhancing, evaluating and using MT-Systems with comparable data (Bogdan Babych, Yu Chen, Andreas Eisele, Sabine Hunsicker, Mārcis Pinnis, Inguna Skadiņa, Raivis Skadiņš, Gregor Thurmair, Andrejs Vasiļjevs, Mateja Verlic, Xiaojun Zhang) -- New areas of application of comparable corpora (Reinhard Rapp, Vivian Xu, Michael Zock, Serge Sharoff, Richard Forsyth, Bogdan Babych, Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi) -- Appendices (Ahmet Aker, Radu Ion, Nikos Mastropavlos, Monica Paramita, Mārcis Pinnis, Dan Ştefănescu, Fangzhong Su, Gregor Thurmair,Elena Irimia, Nikola Ljubešić, Evangelos Kanoulas, Judita Preiss, Rob Gaizauskas, Paul Clough, Emma Barker, Nikos Glaros, Tiberiu Boroș, Inguna Skadiņa, Andrejs Vasiļjevs). 
520 |a This book provides an overview of how comparable corpora can be used to overcome the lack of parallel resources when building machine translation systems for under-resourced languages and domains. It presents a wealth of methods and open tools for building comparable corpora from the Web, evaluating comparability and extracting parallel data that can be used for the machine translation task. It is divided into several sections, each covering a specific task such as building, processing, and using comparable corpora, focusing particularly on under-resourced language pairs and domains. The book is intended for anyone interested in data-driven machine translation for under-resourced languages and domains, especially for developers of machine translation systems, computational linguists and language workers. It offers a valuable resource for specialists and students in natural language processing, machine translation, corpus linguistics and computer-assisted translation, and promotes the broader use of comparable corpora in natural language processing and computational linguistics. 
650 0 |a Natural language processing (Computer science). 
650 0 |a Computational linguistics. 
650 0 |a Data mining. 
650 1 4 |a Natural Language Processing (NLP).  |0 http://scigraph.springernature.com/things/product-market-codes/I21040 
650 2 4 |a Computational Linguistics.  |0 http://scigraph.springernature.com/things/product-market-codes/N22000 
650 2 4 |a Data Mining and Knowledge Discovery.  |0 http://scigraph.springernature.com/things/product-market-codes/I18030 
700 1 |a Skadiņa, Inguna.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Gaizauskas, Robert.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Babych, Bogdan.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Ljubešić, Nikola.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Tufiş, Dan.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
700 1 |a Vasiļjevs, Andrejs.  |e editor.  |4 edt  |4 http://id.loc.gov/vocabulary/relators/edt 
710 2 |a SpringerLink (Online service) 
773 0 |t Springer eBooks 
776 0 8 |i Printed edition:  |z 9783319990033 
776 0 8 |i Printed edition:  |z 9783319990057 
830 0 |a Theory and Applications of Natural Language Processing,  |x 2192-032X 
856 4 0 |u https://doi.org/10.1007/978-3-319-99004-0  |z Full Text via HEAL-Link 
912 |a ZDB-2-SCS 
950 |a Computer Science (Springer-11645)