Loading...
Loading...

Go to the content (press return)

Unsupervised learning of agglutinated morphology using nested Pitman-Yor process based morpheme induction algorithm

Author
Kumar, A.; Padro, L.; Oliver, A.
Type of activity
Presentation of work at congresses
Name of edition
19th International Conference on Asian Language Processing
Date of publication
2015
Presentation's date
2015-10
Book of congress proceedings
IALP 2015: 19th International Conference on Asian Language Processing: Suzhow, China: October 24-25, 2015: proceedings book
First page
45
Last page
48
DOI
https://doi.org/10.1109/IALP.2015.7451528 Open in new window
Repository
http://hdl.handle.net/2117/83378 Open in new window
URL
http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=7451528 Open in new window
Abstract
In this paper we describe a method of morphologically segment highly agglutinating and inflectional languages from the Dravidian family. We use the nested Pitman-Yor process to segment long agglutinated words into their basic components, and use a corpus based morpheme induction algorithm to perform morpheme segmentation. We test our method on two languages, Malayalam and Kannada and compare the results with Morfessor-baseline. © 2015 IEEE. In this paper we describe a method to morphologically ...
Citation
Kumar, A., Padro, L., Oliver, A. Unsupervised learning of agglutinated morphology using nested Pitman-Yor process based morpheme induction algorithm. A: International Conference on Asian Language Processing. "IALP 2015: 19th International Conference on Asian Language Processing: Suzhow, China: October 24-25, 2015: proceedings book". Suzhou: 2015.
Keywords
Agglutinated mor-phology, Algorithms, Corpus-based, Indian languages, Induction algorithms, Malayalams, Natural language processing Systems, Process-based
Group of research
GPLN - Natural Language Processing Group
IDEAI-UPC - Intelligent Data Science and Artificial Intelligence Research Center
TALP - Centre for Language and Speech Technologies and Applications

Participants

  • Kumar, Arun  (author and speaker )
  • Padró Cirera, Lluís  (author and speaker )
  • Oliver González, Antoni  (author and speaker )

Attachments