Empirical Bayes estimators of structural distribution of words in Lithuanian texts

Karolina Piaseckienė; Marijus Radavičius

doi:10.15388/NA.2014.4.7

Articles

Karolina Piaseckienė

Šiauliai University, Lithuania

Marijus Radavičius

Vilnius University

Published 2014-10-30

https://doi.org/10.15388/NA.2014.4.7

PDF

Keywords

structural distribution
Zipf–Mandelbrot law
empirical Bayes
Poisson mixture
sparse data

How to Cite

Piaseckienė, K. and Radavičius, M. (2014) “Empirical Bayes estimators of structural distribution of words in Lithuanian texts”, Nonlinear Analysis: Modelling and Control, 19(4), pp. 611–625. doi:10.15388/NA.2014.4.7.

Download Citation

Abstract

Lithuanian language has great inflexion, free word order and other features which distinguish it from other languages. This raises a problem of testing for the Lithuanian language validity of findings established for other languages. In the paper, an empirical study of a collection of Lithuanian texts is performed. It is supposed that authors of texts are basic elements of the population under study and its heterogeneity stems out of the heterogeneity of preferences and choices of the authors. An attempt to estimate structural distributions of words in a collection of texts of different authors is made by making use of a simple statistical model and empirical Bayes approach.

PDF

References

Downloads

Download data is not yet available.

Most read articles by the same author(s)

Marijus Radavičius, Pavel Samusenko, Goodness-of-fit tests for sparse nominal data based on grouping , Nonlinear Analysis: Modelling and Control: Vol. 17 No. 4 (2012): Nonlinear Analysis: Modelling and Control