Empirical Bayes estimators of structural distribution of words in Lithuanian texts
Articles
Karolina Piaseckienė
Šiauliai University, Lithuania
Marijus Radavičius
Vilnius University
Published 2014-10-30
https://doi.org/10.15388/NA.2014.4.7
PDF

Keywords

structural distribution
Zipf–Mandelbrot law
empirical Bayes
Poisson mixture
sparse data

How to Cite

Piaseckienė K. and Radavičius M. (2014) “Empirical Bayes estimators of structural distribution of words in Lithuanian texts”, Nonlinear Analysis: Modelling and Control, 19(4), pp. 611-625. doi: 10.15388/NA.2014.4.7.

Abstract

Lithuanian language has great inflexion, free word order and other features which distinguish it from other languages. This raises a problem of testing for the Lithuanian language validity of findings established for other languages. In the paper, an empirical study of a collection of Lithuanian texts is performed. It is supposed that authors of texts are basic elements of the population under study and its heterogeneity stems out of the heterogeneity of preferences and choices of the authors. An attempt to estimate structural distributions of words in a collection of texts of different authors is made by making use of a simple statistical model and empirical Bayes approach.

PDF
Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Please read the Copyright Notice in Journal Policy