State of the Art Language Technologies in Western Armenian


The Armenian Communities Department of the Calouste Gulbenkian Foundation announced that the Western Armenian Universal Dependencies Treebank was released on May 15, and is now available on the Universal Dependencies Consortium webpage.A Treebank is essential for the vitality of any language. It is a linguistic tool that analyzes and describes the structure of the language, identifying its different components in a manner that computer programs could work with it.Treebanks revolutionized computational linguistics in the early 1990s, after the expansion of machine learning methods and artificial neural networks in the field of Natural Language Processing.The Western Armenian Treebank currently consists of 1780 sentences, containing 7.5 million words, compiled from 110 works by more than 50 authors from 1895 to 2010, in many genres such as fiction, personal and official correspondences, travelogues, political and literary speeches, memoirs and travel notes.The Western Armenian Treebank, and the Natural Language Processing solutions developed on its basis, are decisive in bringing state of the art language technologies to Armenian, ensuring the vitality of the language in the modern digital era.To read the full story, check out the link below to the article on Asbarez.


Submit a comment

Your email address will not be published. Required fields are marked *