DEVELOPING NLP TOOL FOR LINGUISTIC ANALYSIS OF UZBEK LANGUAGES
Keywords:
NLP, computer linguistics, tokenizationAbstract
Automatic processing of unstructured texts in natural languages is one of the urgent problems of computer analysis and synthesis of texts. It is possible to separately highlight the task of text normalization, usually implying the implementation of such processes as tokenization , stemming and lemmatization . Existing stemming algorithms are mostly focused on synthetic languages, in which form formation using morphemes prevails. The Uzbek language is an example of an agglutinative language, characterized by polysemantic affixal and service morphemes. Although the Uzbek language has many differences, for example, from the English language, nevertheless, it can be successfully processed by stemming algorithms.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Ass. professor Marhamat Haydarova Yunusovna, Assistant Guzal Shikhnazarova Alisherovna

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
