76_[9783110305258 - Approaches] Towards.pdf

In this paper, we discuss advantages of clustering approaches to automated language classification, describe distance measures used for this purpose, and present results of several proof-of-concept experiments. We advocate the use of probability based distances – those that take into account the dis...

Πλήρης περιγραφή

Λεπτομέρειες βιβλιογραφικής εγγραφής
Γλώσσα:English
Έκδοση: De Gruyter 2019
id oapen-20.500.12657-23712
record_format dspace
spelling oapen-20.500.12657-237122024-03-22T19:23:05Z Chapter Towards automated language classification Buch, Armin Erschler, David Jäger, Gerhard Lupas, Andrei Saxena, Anju Borin, Lars linguistic differences thema EDItEUR::C Language and Linguistics::CF Linguistics In this paper, we discuss advantages of clustering approaches to automated language classification, describe distance measures used for this purpose, and present results of several proof-of-concept experiments. We advocate the use of probability based distances – those that take into account the distribution of relevant features across the language sample in question 2019-11-19 23:55 2020-01-07 16:47:06 2020-04-01T09:26:38Z 2020-04-01T09:26:38Z 2013 chapter 1006432 9783110488081 http://library.oapen.org/handle/20.500.12657/23712 eng application/pdf n/a 76_[9783110305258 - Approaches] Towards.pdf De Gruyter Approaches to Measuring Linguistic Differences 10.1515/9783110305258.303 10.1515/9783110305258.303 2b386f62-fc18-4108-bcf1-ade3ed4cf2f3 d344d431-123c-48b3-94be-c8d10c495b20 7292b17b-f01a-4016-94d3-d7fb5ef9fb79 9783110488081 European Research Council (ERC) Berlin/Boston 324246 FP7 Ideas: European Research Council FP7-IDEAS-ERC - Specific Programme: "Ideas" Implementing the Seventh Framework Programme of the European Community for Research, Technological Development and Demonstration Activities (2007 to 2013) open access
institution OAPEN
collection DSpace
language English
description In this paper, we discuss advantages of clustering approaches to automated language classification, describe distance measures used for this purpose, and present results of several proof-of-concept experiments. We advocate the use of probability based distances – those that take into account the distribution of relevant features across the language sample in question
title 76_[9783110305258 - Approaches] Towards.pdf
spellingShingle 76_[9783110305258 - Approaches] Towards.pdf
title_short 76_[9783110305258 - Approaches] Towards.pdf
title_full 76_[9783110305258 - Approaches] Towards.pdf
title_fullStr 76_[9783110305258 - Approaches] Towards.pdf
title_full_unstemmed 76_[9783110305258 - Approaches] Towards.pdf
title_sort 76_[9783110305258 - approaches] towards.pdf
publisher De Gruyter
publishDate 2019
_version_ 1799945242710900736