Wiktionary:Frequency lists/Lithuanian
Jump to navigation
Jump to search
Lithuanian Frequency Lists
[edit]- 885K most common Lithuanian words occurring in mixed web sources (2020-21)
- Wordlist of Lemmas from the Joint Corpus of Lithuanian
- Full list of Lithuanian terms by frequency, based on 2018 data from www.opensubtitles.org
External Links
[edit]- Word frequency lists for Lithuanian and other languages from 10K up to 1M, available for download as part of the Leipzig Corpora Collection (CC BY-4.0)
- 50K and larger word lists based on www.opensubtitles.org for Lithuanian and other languages (CC BY-SA-4.0)
- Dictionary of the Written Lithuanian Language based on Frequency (Dažninis rašytinės lietuvių kalbos žodynas)
Further reading
[edit]- Kasparaitis, Pijus & Anbinderis, Tomas. "Lietuvių kalbos homografų vienareikšminimas remiantis leksemų ir morfologinių pažymų vartosenos dažniais [Disambiguation of Lithuanian Homographs Based on the Frequencies of Lexemes and Morphological Tags]". Kalbų Studijos, no. 14, Kauno Technologijos Universitetas, pp. 25-31, 2009. (in Lithuanian)
- Paulikas, Sarunas & Navakauskas, Dalius. (2006). "Discrimination of Homographs Distorted by a Lengthy Impulsive Noise." Informatica, Lith. Acad. Sci.. 17. 297-304. 10.15388/Informatica.2006.139.
- Rimkutė, Erika. "Homoformos dabartinės lietuvių kalbos tekstyne [Homoforms in the corpus of the Lithuanian language]." Lituanistica, No. 2, pp. 86-101, 2002.