User:AKA MBG/Statistics:POS
The parsed database name: enwikt20140908_parsed[1] This page outlines:
- Number of meanings.
- Number of empty definitions for each language.
- Number of entries for each part of speech (POS).
See about Part of Speech (POS) headers:
Meanings
[edit]Number of words (with meanings) with unknown POS: 40828
The total of all unique noun, verb, etc. (+ with empty definitions): 1915645
Number of empty definitions: 109418
Number of words (unique noun, verb, etc.) with nonempty definitions: 1806227
Number of records in the table lang_pos: 1915646
Number of words having different number of meanings / definitions
[edit]Table description:
- column 0 - number of words with empty definitions (total and for each language)
- column 1 - number of monosemous words (total and for each language)
- column 2 - number of words with two meanings, etc.
- last column ("Total") - total number of words for this language.
Only the first 9 meanings (columns) are presented in the table.
Number of meanings: | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | Total | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
code | Total (all languages) : | 109418 | 1477775 | 213489 | 61135 | 22147 | 13074 | 7201 | 2323 | 4621 | 886 | 1912069 |
pms | Piedmontese | 0 | 31 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
mwl | Mirandese | 1 | 253 | 15 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 270 |
mg | Malagasy | 35 | 3301 | 23 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 3363 |
wrh | Wiradjuri | 6 | 143 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 159 |
chc | Catawba | 1 | 16 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 18 |
rej | Rejang | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
tvl | Tuvaluan | 0 | 27 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 30 |
ktn | Karitiâna | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
see | Seneca | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
rtm | Rotuman | 0 | 14 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 15 |
chn | Chinook Jargon | 0 | 65 | 12 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 80 |
sl | Slovene | 21 | 3235 | 298 | 65 | 22 | 12 | 5 | 0 | 0 | 5 | 3663 |
osa | Osage | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ems | Alutiiq | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
btk | Batak | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
cuk | Kuna | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
pdc | Pennsylvania German | 1 | 21 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
crp-rsn | Russenorsk | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
pa | Punjabi | 0 | 174 | 32 | 8 | 1 | 5 | 0 | 0 | 0 | 0 | 220 |
mnc | Manchu | 0 | 40 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 42 |
xdc | Dacian | 0 | 40 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 40 |
rap | Rapa Nui | 1 | 291 | 16 | 4 | 0 | 1 | 0 | 0 | 0 | 0 | 313 |
axm | Middle Armenian | 6 | 129 | 17 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 160 |
gul | Gullah | 0 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
ext | Extremaduran | 0 | 86 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 90 |
luo | Dholuo | 0 | 36 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 37 |
nys | Nyunga | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
dak | Dakota | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
kayah | Kayah | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
frk | Frankish | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
syc | Syriac | 181 | 1391 | 619 | 381 | 200 | 123 | 88 | 39 | 26 | 25 | 3073 |
pag | Pangasinan | 0 | 11 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
agj | Argobba | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
gsw | Swiss German | 4 | 140 | 14 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 160 |
oj | Ojibwe | 0 | 202 | 41 | 16 | 7 | 4 | 1 | 0 | 0 | 0 | 271 |
uz | Uzbek | 1 | 301 | 43 | 12 | 1 | 1 | 0 | 0 | 0 | 0 | 359 |
for | Fore | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
hak | Hakka | 0 | 52 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 54 |
xtg | Gaulish | 0 | 25 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 28 |
kjb | Q'anjob'al | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ave | Avestan | 0 | 19 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 21 |
nij | Ngaju | 0 | 8 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 10 |
wbw | Woi | 0 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
ntj | Ngaanyatjarra | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
si | Sinhala | 1 | 148 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 150 |
ase | American Sign Language | 0 | 299 | 47 | 11 | 3 | 1 | 0 | 0 | 0 | 0 | 361 |
are | Arrernte | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
mrv | Mangareva | 0 | 7 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
lij | Ligurian | 0 | 147 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 150 |
cop | Coptic | 0 | 15 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 18 |
ada | Adangme | 0 | 21 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
pit | Pitta-Pitta | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
cui | Cuiba | 0 | 13 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
wo | Wolof | 0 | 27 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 27 |
akk | Akkadian | 0 | 144 | 23 | 8 | 5 | 2 | 1 | 0 | 1 | 0 | 184 |
kn | Kannada | 0 | 393 | 35 | 3 | 0 | 1 | 0 | 1 | 0 | 0 | 433 |
xvn | Vandalic | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
gn | Guaraní | 0 | 63 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 66 |
tsn | Tswana | 1 | 76 | 15 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 95 |
cjs | Shor | 0 | 38 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
yux | Southern Yukaghir | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
alt | Altai | 0 | 64 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 64 |
pjt | Pitjantjatjara | 0 | 153 | 33 | 11 | 2 | 0 | 0 | 0 | 0 | 0 | 199 |
xdm | Edomite | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
ccc | Chamicuro | 0 | 441 | 9 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 452 |
bej | Beja | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
ary | Moroccan Arabic | 0 | 35 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
nog | Nogai | 0 | 22 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
nzi | Nzema | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
gmh | Middle High German | 1 | 69 | 6 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 79 |
asm | Assamese | 0 | 37 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
gil | Gilbertese | 0 | 19 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 22 |
mic | Mi'kmaq | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
sc | Sardinian | 0 | 135 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 140 |
cow | Cowlitz | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
rup | Aromanian | 22 | 976 | 180 | 61 | 11 | 3 | 1 | 0 | 0 | 0 | 1254 |
cdo | Min Dong | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
crp-gep | Greenlandic Eskimo Pidgin | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
jam | Jamaican Creole | 0 | 31 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
pox | Polabian | 0 | 157 | 6 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 164 |
pam | Kapampangan | 0 | 11 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 14 |
khw | Khowar | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
eu | Basque | 22 | 1214 | 68 | 13 | 2 | 2 | 0 | 0 | 0 | 0 | 1321 |
sv | Swedish | 4994 | 15283 | 1993 | 549 | 231 | 84 | 33 | 19 | 16 | 5 | 23207 |
tkd | Tukudede | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
sh | Serbo-Croatian | 118 | 39967 | 7189 | 2203 | 654 | 255 | 110 | 42 | 21 | 28 | 50587 |
se | Northern Sami | 1 | 773 | 37 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 817 |
kaa | Karakalpak | 0 | 13 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
ryu | Okinawan | 13 | 90 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 106 |
sco | Scots | 140 | 1770 | 232 | 65 | 27 | 12 | 5 | 3 | 0 | 0 | 2254 |
nah | Nahuatl | 36 | 1439 | 133 | 23 | 11 | 2 | 1 | 0 | 0 | 0 | 1645 |
nb | Bokmål | 2116 | 4208 | 523 | 133 | 45 | 13 | 3 | 1 | 1 | 0 | 7043 |
zen | Zenaga | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
aib | Äynu | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
sma | Southern Sami | 0 | 33 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
ang | Old English | 149 | 2943 | 732 | 276 | 100 | 42 | 4 | 2 | 0 | 1 | 4249 |
mia | Miami-Illinois | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
am | Amharic | 0 | 176 | 6 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 184 |
dif | Dieri | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
ln | Lingala | 0 | 63 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 66 |
ecr | Eteocretan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
eo | Esperanto | 1173 | 12029 | 807 | 99 | 13 | 0 | 0 | 0 | 0 | 0 | 14121 |
mzn | Mazandarani | 0 | 54 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 57 |
cpi | Chinese Pidgin English | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
roa-ptg | Galician-Portuguese | 23 | 334 | 29 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 395 |
nyn | Nyankole | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
li | Limburgish | 0 | 366 | 136 | 12 | 4 | 2 | 0 | 0 | 0 | 0 | 520 |
fi | Finnish | 4157 | 57816 | 5833 | 1357 | 423 | 174 | 85 | 41 | 22 | 15 | 69923 |
nxn | Ngawun | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
tpn | Tupinambá | 0 | 9 | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 12 |
ara | Arabic | 101 | 3185 | 704 | 322 | 175 | 105 | 63 | 34 | 43 | 17 | 4749 |
is | Icelandic | 81 | 9544 | 1475 | 559 | 168 | 80 | 24 | 13 | 1 | 0 | 11945 |
av | Avar | 0 | 141 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 149 |
chy | Cheyenne | 0 | 21 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
ho | Hiri Motu | 0 | 42 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 42 |
mdf | Moksha | 0 | 19 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
run | Rundi | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
yij | Yindjibarndi | 0 | 21 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 25 |
tati | Tati | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
pap | Papiamento | 0 | 120 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 125 |
yua | Yucatec Maya | 2 | 150 | 19 | 11 | 6 | 1 | 0 | 0 | 0 | 0 | 189 |
knb | Lubuagan Kalinga | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
tir | Tigrinya | 0 | 243 | 13 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 259 |
ff | Fula | 3 | 57 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 65 |
arl | Arabela | 0 | 36 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 36 |
apy | Apalaí | 0 | 20 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
awa | Awadhi | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
tgt | Tagbanwa | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
gml | Middle Low German | 0 | 52 | 18 | 9 | 0 | 2 | 2 | 0 | 0 | 0 | 83 |
ess | Central Siberian Yupik | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
cab | Garifuna | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
iu | Inuktitut | 3 | 187 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 195 |
vep | Veps | 0 | 139 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 145 |
yan | Mayangna | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ina | Interlingua | 173 | 1017 | 69 | 16 | 4 | 0 | 0 | 1 | 0 | 0 | 1280 |
nl | Dutch | 3764 | 19511 | 3504 | 907 | 263 | 102 | 36 | 20 | 9 | 3 | 28119 |
ood | O'odham | 0 | 22 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
jao | Yanyuwa | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
lkt | Lakota | 1 | 42 | 6 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 52 |
xpr | Parthian | 0 | 24 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 26 |
non | Old Norse | 14 | 1155 | 169 | 41 | 11 | 7 | 2 | 1 | 0 | 1 | 1401 |
smn | Inari Sami | 0 | 200 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 210 |
lo | Lao | 1 | 667 | 91 | 41 | 15 | 5 | 1 | 1 | 1 | 0 | 823 |
kxv | Kuvi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
te | Telugu | 19 | 6968 | 815 | 193 | 60 | 45 | 23 | 8 | 4 | 2 | 8137 |
mnw | Mon | 0 | 44 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 46 |
ja | Japanese | 1594 | 61244 | 7297 | 2089 | 829 | 346 | 189 | 122 | 71 | 48 | 73829 |
frm | Middle French | 120 | 2315 | 148 | 7 | 3 | 0 | 0 | 0 | 0 | 0 | 2593 |
ban | Balinese | 0 | 59 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 68 |
spx | South Picene | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
tk | Turkmen | 0 | 404 | 15 | 4 | 0 | 1 | 0 | 0 | 0 | 0 | 424 |
mvi | Miyako | 4 | 33 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 37 |
jv | Javanese | 2 | 80 | 4 | 1 | 0 | 4 | 0 | 0 | 0 | 0 | 91 |
ki | Gikuyu | 0 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
crp-tpr | Taimyr Pidgin Russian | 0 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
doz | Dorze | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
sgz | Sursurunga | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
or | Oriya | 0 | 57 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 57 |
tar | Tarahumara | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
css | Southern Ohlone | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
agx | Aghul | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
es | Spanish | 7795 | 31243 | 4806 | 1449 | 469 | 185 | 90 | 38 | 23 | 8 | 46106 |
sga | Old Irish | 268 | 1009 | 328 | 128 | 46 | 27 | 13 | 3 | 2 | 3 | 1827 |
vma | Martuthunira | 0 | 101 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 109 |
crh | Crimean Tatar | 0 | 1963 | 211 | 32 | 6 | 0 | 0 | 0 | 0 | 0 | 2212 |
kzg | Kikai | 4 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
ple | Palu'e | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
nap | Neapolitan | 8 | 511 | 106 | 23 | 7 | 1 | 0 | 1 | 0 | 0 | 657 |
nia | Nias | 0 | 28 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 28 |
mhk | Mungaka | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
kky | Guugu Yimithirr | 1 | 88 | 7 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 99 |
th | Thai | 8 | 2498 | 301 | 101 | 30 | 5 | 4 | 2 | 1 | 1 | 2951 |
ba | Bashkir | 7 | 749 | 205 | 74 | 26 | 14 | 9 | 4 | 1 | 0 | 1089 |
dsb | Lower Sorbian | 54 | 1209 | 242 | 149 | 53 | 1 | 1 | 3 | 1 | 0 | 1713 |
qu | Quechua | 6 | 546 | 90 | 30 | 11 | 2 | 0 | 0 | 0 | 0 | 685 |
xss | Assan | 0 | 17 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
kbc | Kadiwéu | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
kum | Kumyk | 0 | 117 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 122 |
bal | Balochi | 0 | 302 | 48 | 18 | 10 | 4 | 0 | 0 | 1 | 0 | 383 |
bjz | Baruga | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
aoz | Uab Meto | 0 | 18 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
umb | Umbundu | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
lb | Luxembourgish | 88 | 3688 | 541 | 137 | 33 | 15 | 2 | 0 | 1 | 0 | 4505 |
ny | Chewa | 0 | 12 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
lzh | Classical Chinese | 2 | 5 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
mwr | Marwari | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
ckt | Chukchi | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
bug | Buginese | 0 | 36 | 7 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 45 |
fa | Persian | 65 | 5075 | 1191 | 476 | 207 | 80 | 37 | 18 | 12 | 6 | 7167 |
srq | Sirionó | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
kda | Worimi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
szl | Silesian | 0 | 44 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 48 |
yuf | Yavapai | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 2 |
dar | Dargwa | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
krl | Karelian | 0 | 418 | 22 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 442 |
ml | Malayalam | 3 | 143 | 6 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 153 |
rhg | Rohingya | 0 | 187 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 188 |
nbm | Ngbaka Ma'bo | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ymm | Maay | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
gdm | Laal | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
stp | Tepehuán | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
cs | Czech | 143 | 18976 | 1901 | 349 | 110 | 38 | 10 | 5 | 0 | 4 | 21536 |
tkl | Tokelauan | 0 | 9 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 10 |
pcm | Nigerian Pidgin | 4 | 61 | 3 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 69 |
nv | Navajo | 2 | 3299 | 402 | 106 | 25 | 4 | 1 | 0 | 1 | 0 | 3840 |
waq | Wagiman | 0 | 4 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 6 |
pal | Middle Persian | 1 | 109 | 30 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 150 |
mrc | Maricopa | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
gwc | Kalami | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
mgm | Mambae | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
gv | Manx | 223 | 5103 | 718 | 243 | 123 | 45 | 43 | 13 | 5 | 7 | 6523 |
co | Corsican | 4 | 155 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 164 |
bzd | Bribri | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
kr | Kanuri | 0 | 11 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
zun | Zuni | 0 | 13 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
dlm | Dalmatian | 0 | 694 | 77 | 18 | 1 | 0 | 0 | 0 | 0 | 0 | 790 |
tix | Southern Tiwa | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
sah | Sakha | 0 | 226 | 26 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 261 |
yai | Yaghnobi | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
brg | Baure | 0 | 19 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
ks | Kashmiri | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
xlu | Luwian | 0 | 46 | 4 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 54 |
zko | Kott | 1 | 126 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 134 |
wym | Vilamovian | 16 | 941 | 87 | 17 | 3 | 3 | 0 | 1 | 0 | 0 | 1068 |
juc | Jurchen | 0 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
rof | Rombo | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
bar | Bavarian | 0 | 30 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 30 |
blt | Tai Dam | 0 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
apm | Chiricahua | 1 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
ky | Kyrgyz | 0 | 287 | 15 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 304 |
bn | Bengali | 2 | 1703 | 120 | 55 | 11 | 5 | 0 | 2 | 0 | 0 | 1898 |
nan | Min Nan | 17 | 1064 | 63 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 1154 |
khb | Tai Lü | 0 | 32 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
wbb | Wabo | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
prg | Old Prussian | 1 | 223 | 20 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 246 |
peo | Old Persian | 8 | 95 | 13 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 121 |
aeb | Tunisian Arabic | 0 | 20 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 22 |
mk | Macedonian | 13 | 5287 | 616 | 85 | 25 | 5 | 2 | 1 | 0 | 0 | 6034 |
haw | Hawaiian | 6 | 924 | 335 | 101 | 16 | 5 | 2 | 1 | 1 | 0 | 1391 |
amk | Ambai | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
gut | Maléku | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
wrp | Waropen | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
esu | Central Alaskan Yup'ik | 0 | 55 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 61 |
fur | Friulian | 14 | 930 | 141 | 61 | 12 | 1 | 0 | 0 | 0 | 0 | 1159 |
aak | Ankave | 0 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
tcy | Tulu | 0 | 110 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 116 |
xho | Xhosa | 0 | 110 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 112 |
osx | Old Saxon | 84 | 1213 | 259 | 42 | 9 | 1 | 0 | 0 | 0 | 0 | 1608 |
mdr | Mandar | 0 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
mga | Middle Irish | 1 | 50 | 15 | 3 | 1 | 2 | 0 | 0 | 0 | 0 | 72 |
bsb | Brunei Bisaya | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
pad | Paumarí | 0 | 12 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
oc | Occitan | 26 | 1142 | 122 | 7 | 3 | 0 | 0 | 0 | 0 | 0 | 1300 |
roa-nor | Norman | 180 | 6494 | 288 | 35 | 4 | 0 | 0 | 0 | 0 | 0 | 7001 |
vls | Flemish | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
xvs | Vestinian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
kea | Kabuverdianu | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
uln | Unserdeutsch | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
itl | Itelmen | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
ajp | South Levantine Arabic | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
acv | Achumawi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
rw | Kinyarwanda | 0 | 26 | 6 | 0 | 2 | 1 | 0 | 0 | 0 | 0 | 35 |
jbo | Lojban | 361 | 2753 | 75 | 4 | 1 | 0 | 0 | 1 | 0 | 0 | 3195 |
kri | Krio | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
nds-nl | Dutch Low Saxon | 6 | 90 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 105 |
tum | Tumbuka | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
mas | Maasai | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
bcl | Bikol Central | 0 | 10 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
aty | Aneityum | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
wya | Wyandot | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
nn | Norwegian Nynorsk | 1192 | 5826 | 453 | 94 | 17 | 6 | 3 | 1 | 0 | 0 | 7592 |
ski | Sika | 0 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
abq | Abaza | 0 | 30 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 31 |
xcr | Carian | 0 | 49 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 50 |
ali | Amaimon | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ch | Chamorro | 0 | 88 | 7 | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 98 |
ani | Andi | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
tzj | Tz'utujil | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
xvo | Volscian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
xil | Illyrian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ist | Istriot | 9 | 360 | 35 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 413 |
sms | Skolt Sami | 0 | 227 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 228 |
xav | Xavante | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
sw | Swahili | 51 | 2131 | 118 | 28 | 7 | 0 | 1 | 1 | 0 | 0 | 2337 |
awk | Awabakal | 0 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
nrn | Norn | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
ady | Adyghe | 2 | 2487 | 298 | 77 | 34 | 9 | 0 | 3 | 0 | 0 | 2910 |
sk | Slovak | 14 | 2025 | 245 | 49 | 14 | 3 | 3 | 1 | 0 | 0 | 2354 |
min | Minangkabau | 2 | 60 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 62 |
orv | Old East Slavic | 1 | 60 | 19 | 5 | 2 | 0 | 0 | 0 | 1 | 0 | 88 |
pwn | Paiwan | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
ale | Aleut | 0 | 68 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 68 |
kal | Greenlandic | 5 | 761 | 80 | 5 | 2 | 0 | 0 | 0 | 0 | 0 | 853 |
niv | Nivkh | 0 | 13 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
roo | Rotokas | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
kaz | Kazakh | 0 | 224 | 8 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 234 |
bg | Bulgarian | 344 | 5757 | 568 | 244 | 87 | 49 | 30 | 25 | 2 | 6 | 7112 |
Chumashan | Chumashan | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
lou | Louisiana Creole French | 0 | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
cy | Welsh | 149 | 2552 | 278 | 70 | 16 | 5 | 4 | 0 | 1 | 1 | 3076 |
bou | Bondei | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
fy | West Frisian | 15 | 835 | 115 | 25 | 3 | 2 | 0 | 0 | 0 | 0 | 995 |
aiw | Aari | 0 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
abe | Abenaki | 1 | 92 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 97 |
tgl | Tagalog | 39 | 1306 | 129 | 32 | 8 | 1 | 1 | 0 | 0 | 0 | 1516 |
pis | Pijin | 3 | 54 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 63 |
akz | Alabama | 0 | 58 | 11 | 3 | 1 | 1 | 0 | 0 | 0 | 0 | 74 |
bi | Bislama | 3 | 36 | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 42 |
kw | Cornish | 6 | 668 | 44 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 724 |
bku | Buhid | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
obm | Moabite | 0 | 18 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
kha | Khasi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ain | Ainu | 1 | 104 | 19 | 9 | 3 | 1 | 0 | 0 | 0 | 0 | 137 |
fon | Fon | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
egy | Egyptian | 23 | 715 | 91 | 25 | 9 | 1 | 3 | 2 | 0 | 1 | 870 |
aln | Gheg | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
duj | Dhuwal | 0 | 14 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
ksi | I'saka | 0 | 20 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 27 |
tsg | Tausug | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
cjm | Eastern Cham | 0 | 10 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
lbe | Lak | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
sas | Sasak | 2 | 23 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 26 |
ace | Acehnese | 0 | 62 | 6 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 70 |
km | Khmer | 6 | 737 | 122 | 60 | 38 | 21 | 13 | 7 | 1 | 1 | 1006 |
den | Slavey | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
xrn | Arin | 0 | 32 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 34 |
arn | Mapudungun | 0 | 513 | 252 | 59 | 23 | 3 | 2 | 0 | 0 | 0 | 852 |
kj | Ovambo | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
adj | Adioukrou | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ka | Georgian | 109 | 12600 | 860 | 53 | 40 | 2 | 5 | 0 | 1 | 0 | 13670 |
gan | Gan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
bo | Tibetan | 3 | 421 | 70 | 18 | 5 | 1 | 2 | 1 | 0 | 0 | 521 |
ltg | Latgalian | 0 | 91 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 94 |
om | Oromo | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
mul | Translingual | 469 | 34951 | 7080 | 2755 | 975 | 366 | 176 | 82 | 31 | 12 | 46897 |
meu | Motu | 0 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
mh | Marshallese | 1 | 221 | 14 | 8 | 5 | 1 | 0 | 0 | 0 | 1 | 251 |
ne | Nepali | 0 | 76 | 10 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 87 |
bdy | Bandjalang | 0 | 36 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 40 |
hai | Haida | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
rcf | Réunion Creole | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
mns | Mansi | 0 | 15 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 16 |
zai | Isthmus Zapotec | 1 | 21 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
xcl | Classical Armenian | 528 | 2966 | 975 | 448 | 203 | 81 | 41 | 25 | 12 | 8 | 5287 |
kac | Jingpho | 0 | 37 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 37 |
bh | Bihari | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
evn | Evenki | 0 | 42 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 45 |
aus-bun | Bunurong | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ssb | Southern Sama | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
wbp | Warlpiri | 0 | 43 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 45 |
rom | Romani | 13 | 359 | 32 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 407 |
aka | Akan | 0 | 28 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 28 |
akm | Bo | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
udi | Udi | 0 | 39 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 44 |
cpe-spp | Samoan Plantation Pidgin | 0 | 22 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
ayl | Libyan Arabic | 0 | 125 | 36 | 2 | 1 | 1 | 0 | 0 | 0 | 0 | 165 |
cia | Cia-Cia | 0 | 57 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 57 |
ofs | Old Frisian | 12 | 221 | 4 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 239 |
lzz | Laz | 0 | 34 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 36 |
lad | Judaeo-Spanish | 23 | 1041 | 57 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 1125 |
sjk | Kemi Sami | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
xhu | Hurrian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
owl | Old Welsh | 0 | 10 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
enm | Middle English | 47 | 1289 | 86 | 24 | 10 | 6 | 4 | 1 | 0 | 0 | 1467 |
alq | Algonquin | 0 | 21 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
xal | Kalmyk | 0 | 16 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
sbf | Shabo | 0 | 68 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 69 |
xeb | Eblaite | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
zku | Kaurna | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
chk | Chuukese | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
arg | Aragonese | 1 | 224 | 3 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 231 |
tet | Tetum | 0 | 96 | 7 | 3 | 0 | 1 | 0 | 0 | 0 | 0 | 107 |
uby | Ubykh | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
mvr | Marau | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
kgg | Kusunda | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ast | Asturian | 784 | 4148 | 368 | 81 | 34 | 14 | 5 | 3 | 1 | 2 | 5440 |
mwf | Murrinh-Patha | 0 | 12 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
ur | Urdu | 2 | 1041 | 452 | 287 | 188 | 121 | 53 | 45 | 31 | 18 | 2238 |
pot | Potawatomi | 0 | 14 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
yii | Yidiny | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
cu | Old Church Slavonic | 2 | 1871 | 327 | 76 | 13 | 0 | 0 | 1 | 0 | 0 | 2290 |
scn | Sicilian | 53 | 834 | 145 | 41 | 11 | 2 | 0 | 0 | 0 | 1 | 1087 |
nxe | Nage | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
pro | Old Occitan | 16 | 232 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 273 |
aus-dar | Darkinjung | 0 | 88 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 89 |
vin | Vinza | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
hrx | Hunsrik | 0 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 26 |
mus | Creek | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
csb | Cassubian | 0 | 419 | 35 | 17 | 3 | 1 | 0 | 0 | 0 | 0 | 475 |
cr | Cree | 0 | 38 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 40 |
xta | Alcozauca Mixtec | 0 | 24 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 26 |
ay | Aymara | 0 | 39 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 41 |
dtp | Dusun | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
wew | Weyewa | 0 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
miq | Miskito | 0 | 14 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
xpo | Pochutec | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
eml | Emiliano-Romagnolo | 0 | 66 | 4 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 76 |
dbj | Ida'an | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
mnk | Mandinka | 0 | 57 | 7 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 68 |
del | Delaware | 0 | 18 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
kut | Kutenai | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
mr | Marathi | 0 | 202 | 25 | 6 | 0 | 1 | 0 | 0 | 0 | 0 | 234 |
ndh | Ndali | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
tr | Turkish | 148 | 11400 | 1989 | 291 | 108 | 67 | 29 | 17 | 5 | 4 | 14058 |
ewe | Ewe | 0 | 466 | 58 | 16 | 9 | 1 | 1 | 0 | 0 | 0 | 551 |
huq | Tsat | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
mai | Maithili | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
et | Estonian | 119 | 3304 | 238 | 41 | 16 | 4 | 1 | 1 | 0 | 0 | 3724 |
dbl | Dyirbal | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
sd | Sindhi | 1 | 53 | 8 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 66 |
coo | Comox | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
gaa | Ga | 1 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 17 |
kem | Kemak | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
smj | Lule Sami | 0 | 36 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
lg | Luganda | 1 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
xas | Kamassian | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
bua | Buryat | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
iii | Nuosu | 0 | 51 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 53 |
yo | Yoruba | 0 | 123 | 4 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 129 |
xsr | Sherpa | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
kg | Kongo | 0 | 12 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
hu | Hungarian | 16 | 34309 | 1552 | 344 | 90 | 34 | 13 | 2 | 2 | 0 | 36362 |
gag | Gagauz | 0 | 59 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 64 |
gha | Ghadamès | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
aji | Ajië | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
he | Hebrew | 116 | 5385 | 889 | 259 | 89 | 43 | 13 | 9 | 4 | 0 | 6807 |
alr | Alyutor | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
gld | Nanai | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
wgy | Warrgamay | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
jiv | Shuar | 0 | 28 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 31 |
cmn | Mandarin | 2231 | 32065 | 749 | 281 | 129 | 87 | 76 | 48 | 44 | 44 | 35754 |
uk | Ukrainian | 9 | 2003 | 269 | 92 | 38 | 12 | 8 | 0 | 1 | 1 | 2433 |
jct | Krymchak | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
chg | Chagatai | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
izh | Ingrian | 0 | 50 | 3 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 55 |
az | Azerbaijani | 24 | 995 | 65 | 11 | 8 | 2 | 0 | 0 | 1 | 0 | 1106 |
cst | Northern Ohlone | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ve | Venda | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
sa | Sanskrit | 14 | 981 | 454 | 309 | 186 | 133 | 119 | 75 | 65 | 29 | 2365 |
ctu | Chol | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
cv | Chuvash | 0 | 53 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 58 |
osp | Old Spanish | 1 | 40 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 45 |
brc | Berbice Creole Dutch | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
my | Burmese | 1 | 505 | 38 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 552 |
kpg | Kapingamarangi | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
ru | Russian | 505 | 17033 | 3713 | 1414 | 576 | 246 | 125 | 59 | 35 | 13 | 23719 |
wyb | Ngiyambaa | 0 | 37 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
agg | Angor | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
men | Mende | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
pua | Purepecha | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
gwe | Gweno | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
it | Italian | 13567 | 93559 | 13733 | 3485 | 932 | 264 | 101 | 40 | 16 | 9 | 125706 |
iba | Iban | 0 | 29 | 3 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 34 |
pon | Pohnpeian | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
la | Latin | 15878 | 21330 | 6778 | 2459 | 1010 | 376 | 168 | 84 | 31 | 13 | 48127 |
zh | Chinese | 1061 | 39922 | 4637 | 785 | 222 | 52 | 16 | 10 | 8 | 1 | 46714 |
na | Nauruan | 0 | 109 | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 112 |
wba | Warao | 0 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
shh | Shoshone | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
ro | Romanian | 417 | 8430 | 2401 | 690 | 242 | 33 | 16 | 7 | 0 | 0 | 12236 |
kpy | Koryak | 0 | 42 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 42 |
gnd | Zulgo-Gemzek | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
sov | Sonsorolese | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ulk | Meriam | 0 | 41 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 46 |
gcf | Antillean Creole | 0 | 9 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
pi | Pali | 3 | 40 | 3 | 5 | 2 | 0 | 0 | 0 | 0 | 0 | 53 |
so | Somali | 0 | 91 | 6 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 98 |
xpg | Phrygian | 0 | 17 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
mn | Mongolian | 3 | 873 | 78 | 25 | 12 | 8 | 8 | 2 | 2 | 1 | 1012 |
bm | Bambara | 1 | 36 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 43 |
sog | Sogdian | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
akl | Aklanon | 0 | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
dgi | Northern Dagara | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
bhw | Biak | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
kdd | Yankunytjatjara | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
amu | Amuzgo | 0 | 27 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 28 |
id_ | Indonesian | 14 | 1660 | 191 | 58 | 12 | 9 | 7 | 1 | 1 | 1 | 1954 |
mt | Maltese | 33 | 1461 | 96 | 26 | 0 | 2 | 0 | 0 | 0 | 0 | 1618 |
vec | Venetian | 230 | 1713 | 506 | 82 | 7 | 4 | 0 | 0 | 0 | 0 | 2542 |
bft | Balti | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
omn | Minoan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
lld | Ladin | 219 | 1159 | 192 | 30 | 1 | 0 | 1 | 0 | 0 | 0 | 1602 |
zu | Zulu | 15 | 468 | 80 | 25 | 10 | 0 | 1 | 2 | 0 | 0 | 601 |
hsb | Upper Sorbian | 0 | 226 | 12 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 242 |
new | Newari | 0 | 15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
hi | Hindi | 11 | 2233 | 628 | 330 | 207 | 117 | 67 | 38 | 24 | 18 | 3673 |
abm | Abanyom | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
wlm | Middle Welsh | 0 | 11 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
zkt | Khitan | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
kld | Gamilaraay | 2 | 124 | 15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 141 |
rar | Rarotongan | 0 | 8 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
tiw | Tiwi | 0 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
en | English | 15067 | 349547 | 41557 | 11912 | 5058 | 2543 | 1488 | 911 | 615 | 420 | 429118 |
zmg | Marti Ke | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
yue | Cantonese | 824 | 16943 | 83 | 25 | 7 | 0 | 2 | 1 | 0 | 1 | 17886 |
hnd | Hindko | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
tfn | Dena'ina | 0 | 21 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
sat | Santali | 0 | 78 | 7 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 90 |
hit | Hittite | 0 | 71 | 13 | 6 | 3 | 1 | 2 | 0 | 1 | 0 | 97 |
chm | Mari | 0 | 56 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 56 |
abk | Abkhaz | 0 | 134 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 138 |
sei | Seri | 0 | 55 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 61 |
dgr | Dogrib | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
bvb | Bube | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ps | Pashto | 2 | 561 | 68 | 19 | 4 | 0 | 0 | 0 | 0 | 0 | 654 |
bew | Betawi | 0 | 29 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 32 |
cho | Choctaw | 0 | 82 | 22 | 7 | 6 | 2 | 0 | 0 | 0 | 0 | 119 |
fud | Futunan | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
chl | Cahuilla | 1 | 434 | 26 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 464 |
kju | Kashaya | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
tgk | Tajik | 1 | 623 | 135 | 60 | 30 | 12 | 2 | 3 | 3 | 0 | 869 |
wiv | Vitu | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
liv | Livonian | 4 | 2426 | 215 | 59 | 11 | 0 | 1 | 0 | 1 | 0 | 2717 |
sem-amm | Ammonite | 0 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
kos | Kosraean | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
inh | Ingush | 0 | 16 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 17 |
st | Sotho | 0 | 32 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
pmt | Tuamotuan | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
ket | Ket | 0 | 8 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
ota | Ottoman Turkish | 0 | 401 | 172 | 84 | 35 | 25 | 7 | 6 | 5 | 0 | 735 |
zkz | Khazar | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
lmo | Lombard | 1 | 46 | 5 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 54 |
smo | Samoan | 4 | 250 | 31 | 1 | 0 | 2 | 0 | 0 | 0 | 0 | 288 |
bho | Bhojpuri | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
yrk | Nenets | 0 | 29 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 29 |
xlc | Lycian | 0 | 34 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 34 |
brh | Brahui | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
ltc | Middle Chinese | 166 | 453 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 624 |
got | Gothic | 63 | 9298 | 71 | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 9446 |
mth | Munggui | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
abs | Ambonese | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
ade | Adele | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ga | Irish | 776 | 10508 | 1677 | 600 | 247 | 109 | 56 | 25 | 16 | 5 | 14019 |
sux | Sumerian | 0 | 101 | 17 | 7 | 2 | 2 | 0 | 0 | 0 | 0 | 129 |
auj | Awjila | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
hif | Fiji Hindi | 0 | 114 | 3 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 119 |
alp | Alune | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
mni | Meitei | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
mps | Dadibi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
tyv | Tuvan | 0 | 152 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 159 |
jrb | Judeo-Arabic | 0 | 50 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 53 |
ruo | Istro-Romanian | 1 | 75 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 78 |
aus-wem | Wemba-Wemba | 0 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
zza | Zazaki | 1 | 242 | 32 | 8 | 1 | 0 | 0 | 0 | 0 | 0 | 284 |
ty | Tahitian | 0 | 219 | 15 | 3 | 0 | 1 | 0 | 0 | 0 | 0 | 238 |
tzm | Central Morocco Tamazight | 0 | 113 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 119 |
crk | Plains Cree | 0 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
mjg | Monguor | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ksd | Tolai | 0 | 30 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 31 |
frr | North Frisian | 0 | 187 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 197 |
mer | Meru | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
vol | Volapük | 140 | 2477 | 135 | 36 | 16 | 4 | 2 | 1 | 0 | 0 | 2811 |
hy | Armenian | 110 | 8025 | 1796 | 433 | 147 | 93 | 38 | 12 | 8 | 3 | 10665 |
vro | Võro | 0 | 158 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 162 |
aus-gun | Gunai | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
arp | Arapaho | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
sqi | Albanian | 38 | 3879 | 666 | 179 | 63 | 23 | 8 | 2 | 0 | 0 | 4858 |
sjd | Kildin Sami | 0 | 32 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
hmn | Hmong | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
krc | Karachay-Balkar | 0 | 71 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 73 |
fit | Meänkieli | 0 | 13 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
nds | Low Saxon | 32 | 182 | 62 | 25 | 19 | 6 | 2 | 1 | 2 | 1 | 332 |
nha | Nhanda | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
ms | Malay | 90 | 2916 | 205 | 65 | 18 | 17 | 8 | 0 | 3 | 0 | 3322 |
ku | Kurdish | 11 | 1784 | 70 | 26 | 0 | 8 | 0 | 0 | 0 | 0 | 1899 |
xum | Umbrian | 0 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
de | German | 2365 | 40759 | 4445 | 1146 | 389 | 143 | 59 | 26 | 11 | 4 | 49347 |
wam | Massachusett | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ug | Uyghur | 0 | 351 | 32 | 11 | 2 | 1 | 0 | 0 | 0 | 0 | 397 |
guz | Gusii | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
wuu | Wu | 5 | 13 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 19 |
kam | Kamba | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
ie | Occidental | 0 | 92 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 93 |
ce | Chechen | 0 | 185 | 11 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 201 |
ta | Tamil | 0 | 466 | 40 | 20 | 4 | 1 | 4 | 0 | 2 | 0 | 537 |
kio | Kiowa | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
vai | Vai | 5 | 308 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 315 |
osc | Oscan | 0 | 13 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
xmf | Mingrelian | 0 | 36 | 3 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 41 |
pih | Pitcairn-Norfolk | 0 | 22 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
khv | Khwarshi | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
za | Zhuang | 0 | 42 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 46 |
tpi | Tok Pisin | 11 | 719 | 73 | 25 | 7 | 0 | 3 | 0 | 0 | 0 | 838 |
su | Sundanese | 0 | 53 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 56 |
pgl | Primitive Irish | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
rme | Angloromani | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
gez | Ge'ez | 0 | 19 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21 |
aqc | Archi | 0 | 14 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
yi | Yiddish | 72 | 1338 | 172 | 33 | 10 | 2 | 0 | 1 | 0 | 0 | 1628 |
oma | Omaha-Ponca | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
zmb | Zimba | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
cic | Chickasaw | 0 | 342 | 46 | 9 | 1 | 0 | 0 | 1 | 0 | 0 | 399 |
amn | Amanab | 0 | 54 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 54 |
gni | Gooniyandi | 0 | 56 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 57 |
xfa | Faliscan | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
twf | Taos | 1 | 489 | 23 | 6 | 2 | 2 | 0 | 0 | 0 | 0 | 523 |
aus-syd | Sydney | 2 | 23 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
str | Saanich | 0 | 200 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 210 |
kln | Kalenjin | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
din | Dinka | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
tpw | Old Tupi | 1 | 74 | 11 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 89 |
tay | Atayal | 0 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
pga | Juba Arabic | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ha | Hausa | 1 | 327 | 58 | 15 | 5 | 1 | 0 | 0 | 0 | 0 | 407 |
gmy | Mycenaean Greek | 0 | 101 | 16 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 120 |
be | Belarusian | 0 | 1176 | 67 | 6 | 3 | 0 | 0 | 0 | 0 | 0 | 1252 |
kjg | Khmu | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
afb | Gulf Arabic | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
udm | Udmurt | 0 | 21 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
tt | Tatar | 0 | 534 | 26 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 564 |
sva | Svan | 0 | 7 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
yur | Yurok | 0 | 280 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 285 |
gbb | Kaytetye | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
ksh | Kölsch | 0 | 58 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 62 |
num | Niuafo'ou | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
dv | Dhivehi | 0 | 19 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 23 |
rif | Tarifit | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
os | Ossetian | 0 | 233 | 31 | 11 | 1 | 0 | 1 | 0 | 0 | 0 | 277 |
tts | Isan | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
bpl | Broome Pearling Lugger Pidgin | 0 | 15 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
pnw | Panyjima | 0 | 14 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
shn | Shan | 0 | 44 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 46 |
wad | Wandamen | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
raj | Rajasthani | 0 | 122 | 25 | 9 | 7 | 1 | 0 | 0 | 0 | 0 | 164 |
mfe | Mauritian Creole | 8 | 117 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 134 |
mod | Mobilian | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
crg | Michif | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
xmk | Ancient Macedonian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
gu | Gujarati | 0 | 219 | 25 | 5 | 3 | 2 | 0 | 0 | 0 | 1 | 255 |
arc | Aramaic | 3 | 1423 | 374 | 91 | 28 | 8 | 1 | 0 | 0 | 0 | 1928 |
aus-wwg | Woiwurrung | 0 | 47 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 52 |
kjr | Kurudu | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
tna | Tacana | 0 | 12 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
xld | Lydian | 0 | 34 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 36 |
ca | Catalan | 2096 | 8988 | 919 | 263 | 79 | 32 | 7 | 1 | 2 | 3 | 12390 |
srn | Sranan Tongo | 0 | 279 | 29 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 310 |
moe | Innu-aimun | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
yut | Yopno | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
phn | Phoenician | 0 | 108 | 7 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 117 |
kab | Kabyle | 0 | 52 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 54 |
solresol | Solresol | 0 | 28 | 7 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 37 |
sn | Shona | 0 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 |
cax | Chiquitano | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
frp | Franco-Provençal | 1 | 46 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 47 |
dng | Dungan | 0 | 19 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
gvf | Golin | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
bpy | Bishnupriya Manipuri | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
pcd | Picard | 0 | 32 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 34 |
gd | Scottish Gaelic | 541 | 6415 | 1230 | 442 | 208 | 78 | 32 | 23 | 13 | 5 | 8987 |
myp | Pirahã | 0 | 10 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
wim | Wik-Mungknh | 0 | 24 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 24 |
stq | Saterland Frisian | 1 | 115 | 15 | 1 | 3 | 0 | 0 | 0 | 0 | 0 | 135 |
mha | Manda | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
sg | Sango | 0 | 25 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
har | Harari | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
myx | Masaba | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
otk | Old Turkic | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
tli | Tlingit | 0 | 34 | 6 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 41 |
grc | Ancient Greek | 107 | 3728 | 812 | 462 | 269 | 167 | 108 | 70 | 43 | 20 | 5786 |
kyi | Kiput | 0 | 14 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 |
vi | Vietnamese | 523 | 8884 | 408 | 73 | 25 | 10 | 2 | 0 | 0 | 1 | 9926 |
mi | Maori | 0 | 403 | 50 | 10 | 5 | 0 | 1 | 0 | 0 | 0 | 469 |
efi | Efik | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
ceb | Cebuano | 12 | 80 | 11 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 104 |
naq | Nama | 0 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
and | Ansus | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
ilo | Ilokano | 0 | 40 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 43 |
sqt | Soqotri | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
zpq | Zoogocho Zapotec | 0 | 10 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 |
tso | Tsonga | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
to | Tongan | 1 | 107 | 12 | 2 | 0 | 1 | 0 | 0 | 0 | 0 | 123 |
niu | Niuean | 0 | 40 | 2 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 44 |
luy | Luhya | 0 | 36 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 36 |
aie | Amara | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
adt | Adnyamathanha | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
lv | Latvian | 802 | 55986 | 31508 | 5355 | 2139 | 4976 | 2974 | 74 | 3226 | 22 | 107062 |
fr | French | 7856 | 39919 | 6459 | 1550 | 512 | 251 | 110 | 45 | 27 | 13 | 56742 |
wa | Walloon | 9 | 315 | 24 | 4 | 4 | 2 | 1 | 1 | 0 | 0 | 360 |
tcs | Torres Strait Creole | 0 | 106 | 7 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 116 |
ake | Akawaio | 0 | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
gl | Galician | 900 | 4178 | 477 | 73 | 33 | 7 | 5 | 5 | 0 | 0 | 5678 |
loz | Lozi | 0 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
myh | Makah | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
lt | Lithuanian | 84 | 17575 | 2821 | 490 | 69 | 7 | 1 | 1 | 1 | 1 | 21050 |
fo | Faroese | 72 | 3684 | 617 | 172 | 68 | 28 | 13 | 2 | 2 | 1 | 4659 |
yap | Yapese | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
sjt | Ter Sami | 0 | 107 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 112 |
ett | Etruscan | 2 | 4 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
ik | Inupiaq | 0 | 19 | 5 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 25 |
chh | Chinook | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
da | Danish | 380 | 24779 | 1674 | 482 | 189 | 91 | 53 | 33 | 9 | 4 | 27694 |
caa | Ch'orti' | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 10 |
lre | Laurentian | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
bjn | Banjarese | 0 | 42 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 45 |
xve | Venetic | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
lun | Lunda | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
tmh | Tamashaq | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
ht | Haitian Creole | 2 | 742 | 31 | 6 | 2 | 1 | 1 | 0 | 0 | 0 | 785 |
fro | Old French | 540 | 3923 | 698 | 120 | 34 | 3 | 2 | 0 | 0 | 0 | 5320 |
lez | Lezgian | 0 | 49 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 50 |
win | Winnebago | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
alu | 'Are'are | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
com | Comanche | 0 | 60 | 5 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 67 |
kbd | Kabardian | 0 | 436 | 54 | 6 | 6 | 0 | 0 | 1 | 1 | 0 | 504 |
xto | Tocharian | 1 | 187 | 24 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 214 |
moh | Mohawk | 0 | 4 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
xpm | Pumpokol | 0 | 46 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 46 |
kv | Komi | 0 | 37 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 39 |
www | Wawa | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
tkr | Tsakhur | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
mad | Madurese | 0 | 15 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 15 |
ko | Korean | 1344 | 16106 | 1023 | 241 | 78 | 40 | 15 | 5 | 12 | 10 | 18874 |
rue | Rusyn | 0 | 36 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
lvk | Lavukaleve | 0 | 12 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
nov | Novial | 9 | 596 | 7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 612 |
kyh | Karok | 0 | 2 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 |
uga | Ugaritic | 0 | 325 | 72 | 16 | 3 | 1 | 1 | 0 | 0 | 0 | 418 |
quc | K'iche' | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
aar | Afar | 0 | 21 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 21 |
ach | Acholi | 0 | 18 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 19 |
vmb | Mbabaram | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
mpm | Yosondúa Mixtec | 0 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
saw | Sawi | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
aaa | Ghotuo | 1 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
lua | Luba-Kasai | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
pim | Powhatan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
xls | Lusitanian | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
mak | Makassarese | 0 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 11 |
adz | Adzera | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
nay | Ngarrindjeri | 0 | 142 | 7 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 152 |
xpu | Punic | 0 | 36 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 39 |
nod | Northern Thai | 0 | 37 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 38 |
rop | Kriol | 0 | 27 | 4 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 33 |
ish | Esan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
gay | Gayo | 0 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 |
io | Ido | 680 | 5005 | 218 | 42 | 12 | 2 | 0 | 0 | 0 | 0 | 5959 |
ppm | Papuma | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
war | Waray-Waray | 0 | 4 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 6 |
afr | Afrikaans | 120 | 1704 | 176 | 34 | 12 | 5 | 2 | 1 | 1 | 0 | 2055 |
myv | Erzya | 0 | 66 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 73 |
chp | Chipewyan | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
roa-tara | Tarantino | 13 | 490 | 29 | 3 | 0 | 0 | 1 | 0 | 0 | 0 | 536 |
apw | Western Apache | 0 | 22 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 22 |
chr | Cherokee | 0 | 331 | 21 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 353 |
dz | Dzongkha | 0 | 10 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 12 |
apc | North Levantine Arabic | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
aii | Assyrian Neo-Aramaic | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
ibo | Igbo | 0 | 31 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 35 |
mixe | Mixe | 0 | 5 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
pl | Polish | 636 | 15881 | 1442 | 640 | 685 | 23 | 10 | 2 | 8 | 2 | 19329 |
fkv | Kven | 0 | 20 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 20 |
bla | Blackfoot | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
el | Greek | 83 | 19167 | 4301 | 5088 | 895 | 104 | 102 | 10 | 4 | 1 | 29755 |
seu | Serui-Laut | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
br | Breton | 9 | 952 | 70 | 16 | 6 | 2 | 1 | 0 | 0 | 0 | 1056 |
tig | Tigre | 0 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 9 |
kca | Khanty | 0 | 41 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 41 |
fj | Fijian | 2 | 146 | 12 | 0 | 0 | 3 | 0 | 0 | 0 | 0 | 163 |
pt | Portuguese | 5596 | 23275 | 4550 | 1309 | 408 | 132 | 63 | 23 | 19 | 4 | 35379 |
hop | Hopi | 0 | 11 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 12 |
rut | Rutul | 0 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 8 |
vot | Votic | 0 | 220 | 5 | 2 | 0 | 0 | 1 | 0 | 0 | 0 | 228 |
roa-gal | Gallo | 0 | 152 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 152 |
pau | Palauan | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
zav | Yatzachi Zapotec | 0 | 16 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 16 |
mrh | Mara Chin | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
xbc | Bactrian | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
no | Norwegian | 48 | 4437 | 314 | 58 | 20 | 6 | 2 | 2 | 0 | 0 | 4887 |
kim | Tofa | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
arw | Arawak | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
sth | Shelta | 0 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 6 |
hil | Hiligaynon | 3 | 1357 | 137 | 25 | 3 | 0 | 0 | 0 | 0 | 0 | 1525 |
aus-gab | Gabi | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
goh | Old High German | 9 | 882 | 83 | 10 | 2 | 0 | 0 | 0 | 0 | 0 | 986 |
rm | Romansch | 24 | 2021 | 92 | 9 | 1 | 0 | 0 | 0 | 0 | 0 | 2147 |
vmw | Makhuwa | 0 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 |
arz | Egyptian Arabic | 1 | 288 | 25 | 8 | 3 | 1 | 0 | 0 | 0 | 0 | 326 |
lif | Limbu | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 |
nio | Nganasan | 0 | 10 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 18 |
dlg | Dolgan | 0 | 25 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 30 |
kjh | Khakas | 0 | 201 | 31 | 7 | 1 | 2 | 0 | 0 | 0 | 0 | 242 |
Part of speech
[edit]Total (all entries)
[edit]Number of words and senses
[edit]Rows in the table: 52
Unique Strings | Total Word-Sense Pairs | POS | Short name | Templates | Max Senses | Entry |
---|---|---|---|---|---|---|
4211 | 6983 | initialism | initialism | 71 | AAA 71, CCA 40, AC 26 | |
41212 | 41542 | hanzi | hanzi | 10 | 即 10, 场 7, 托 7 | |
5048 | 6807 | suffix | suffix | 34 | -at 34, -ат 34, -inho 15 | |
9306 | 9580 | hanja | hanja | 17 | 体 17, 仚 8, 望 8 | |
2854 | 4245 | conjunction | conjunction | 212 | πρίν 212, ἐπεί 144, ὥστε 53 | |
31 | 31 | interfix | interfix | 1 | -a- 1, -e- 1, -e- 1 | |
4224 | 5259 | letter | letter | 26 | ⠱ 26, ⠣ 24, ⠪ 24 | |
13 | 14 | expression | expression | 2 | 実は 2, aŭ 1, どういたしまして 1 | |
867585 | 1116909 | noun | noun | 130 | heaven 130, head 94, mark 78 | |
161 | 187 | affix | affix | 6 | たく 6, 牙 3, 梨 3 | |
5 | 5 | measure word | measure word | 1 | 遍 1, 杯 1, 盆 1 | |
60 | 63 | infix | infix | 2 | -h- 2, -ma- 2, -n- 2 | |
8 | 8 | lujvo | lujvo | 1 | banskepre 1, bavlamdei 1, prulamdei 1 | |
31 | 165 | kanji reading | kanji reading | 29 | てい 29, ち 20, ちゅう 9 | |
16 | 16 | circumfix | circumfix | 1 | უ- -ო 1, a- -ing 1, em- -en 1 | |
38 | 47 | adnominal | adnominal | 3 | その 3, とんだ 3, この 2 | |
1103 | 1243 | contraction | contraction | 6 | gotcha 6, i'w 4, thou'dst 4 | |
1335 | 2349 | syllable | syllable | 85 | 질 85, 가 44, 차 35 | |
249813 | 355376 | verb | verb, verb prefix, verb form | 251 | ἔχω 251, χράω 189, ἵστημι 152 | |
48532 | 95212 | participle | participle | 14 | iacens 14, iaciturus 14, productus 9 | |
1095 | 1389 | determiner | determiner | 30 | πλεῖστος 30, ἅπας 15, which 5 | |
1 | 1 | pinyin | pinyin | 1 | Bìxiù 1 | |
293 | 432 | root | root | 30 | स्था 30, बुध् 13, वच् 8 | |
108665 | 127533 | proper noun | proper noun | 34 | Neustadt 34, こうじ 33, たかし 32 | |
44 | 82 | predicative | predicative | 5 | громко 5, ясно 5, должен 4 | |
3978 | 4760 | symbol | symbol | 30 | A 30, C 9, z 9 | |
12535 | 13291 | kanji | kanji | 11 | 随 11, 精 11, 獲 8 | |
53330 | 63982 | adverb | adverb | 157 | ὅπως 157, πλήν 65, ἔπειτα 63 | |
11 | 31 | preverb | preverb | 6 | ჩა- 6, მი- 4, წა- 4 | |
466 | 572 | article | article | 15 | të 15, a 7, des 5 | |
2476 | 2684 | idiom | idiom | 4 | 不上不下 4, 生拉硬拽 4, 三更燈火五更雞 3 | |
6136 | 7295 | interjection | interjection | 10 | oh 10, а 7, no worries 7 | |
138 | 195 | counter | counter | 8 | か 8, 本 4, だい 4 | |
4140 | 4988 | abbreviation | abbreviation | 11 | D 11, T 11, lv 10 | |
125 | 144 | classifier | classifier | 5 | 道 5, ដៃ 3, 组 3 | |
1974 | 2094 | proverb | proverb | 4 | 새옹지마 4, 三十年河东,三十年河西 4, 三十年河東,三十年河西 4 | |
5310 | 5780 | phrase | phrase | 7 | rejse sig 7, aaniish naa 5, the fuck 4 | |
9296 | 9619 | numeral | ordinal numeral, numeral, ordinal number, cardinal numeral, number, cardinal number | 19 | शत 19, viens 13, vienu 6 | |
1343 | 1393 | gismu | gismu | 4 | bongu 4, fanva 3, fanza 2 | |
8341 | 11575 | pronoun | pronoun | 185 | ὅσος 185, ὅδε 126, ὅσπερ 37 | |
711 | 1005 | acronym | acronym | 24 | CED 24, CET 22, KOS 17 | |
980 | 1441 | particle | particle | 52 | ἕως 52, אין 18, lai 7 | |
275915 | 392355 | adjective | adjective, quasi-adjective, adjectival noun | 101 | ὕστερος 101, περισσός 63, ἱκανός 47 | |
437 | 598 | prepositional phrase | prepositional phrase | 7 | in hand 7, in front 5, at the high port 5 | |
7 | 15 | gerund | gerund | 4 | laborandum 4, definiendum 3, sufflaminandum 2 | |
50 | 51 | katakana character | katakana character | 2 | ッ 2, ヲ 1, ィ 1 | |
8 | 8 | correlative | correlative | 1 | nenial 1, neniel 1, nenies 1 | |
32986 | 50711 | han character | han character | 17 | 正 17, 㴴 13, 䌛 13 | |
4175 | 7387 | preposition | preposition | 196 | of 196, ὑπό 160, ל־ 38 | |
1 | 1 | hanja reading | hanja reading | 1 | 겸 1 | |
3943 | 5101 | prefix | prefix | 61 | πρό 61, μετά 9, meta- 9 | |
545 | 709 | postposition | postposition | 7 | vasten 7, için 6, को 6 |
Polysemy information
[edit]Rows in the table: 52
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
initialism | 3081 | 1130 | 3902 | 1,66 | 3,45 |
hanzi | 40978 | 234 | 564 | 1,01 | 2,41 |
suffix | 4109 | 939 | 2698 | 1,35 | 2,87 |
hanja | 9127 | 179 | 453 | 1,03 | 2,53 |
conjunction | 2310 | 544 | 1935 | 1,49 | 3,56 |
interfix | 31 | 0 | 0 | 1,0 | -1,0 |
letter | 3975 | 249 | 1284 | 1,25 | 5,16 |
expression | 12 | 1 | 2 | 1,08 | 2,0 |
noun | 720849 | 146736 | 396060 | 1,29 | 2,7 |
affix | 144 | 17 | 43 | 1,16 | 2,53 |
measure word | 5 | 0 | 0 | 1,0 | -1,0 |
infix | 57 | 3 | 6 | 1,05 | 2,0 |
lujvo | 8 | 0 | 0 | 1,0 | -1,0 |
kanji reading | 8 | 23 | 157 | 5,32 | 6,83 |
circumfix | 16 | 0 | 0 | 1,0 | -1,0 |
adnominal | 31 | 7 | 16 | 1,24 | 2,29 |
contraction | 995 | 108 | 248 | 1,13 | 2,3 |
syllable | 1256 | 79 | 1093 | 1,76 | 13,84 |
verb | 195210 | 54603 | 160166 | 1,42 | 2,93 |
participle | 26987 | 21545 | 68225 | 1,96 | 3,17 |
determiner | 922 | 173 | 467 | 1,27 | 2,7 |
pinyin | 1 | 0 | 0 | 1,0 | -1,0 |
root | 236 | 57 | 196 | 1,47 | 3,44 |
proper noun | 96913 | 11752 | 30620 | 1,17 | 2,61 |
predicative | 22 | 22 | 60 | 1,86 | 2,73 |
symbol | 3535 | 443 | 1225 | 1,2 | 2,77 |
kanji | 12137 | 398 | 1154 | 1,06 | 2,9 |
adverb | 46269 | 7061 | 17713 | 1,2 | 2,51 |
preverb | 3 | 8 | 28 | 2,82 | 3,5 |
article | 408 | 58 | 164 | 1,23 | 2,83 |
idiom | 2295 | 181 | 389 | 1,08 | 2,15 |
interjection | 5309 | 827 | 1986 | 1,19 | 2,4 |
counter | 104 | 34 | 91 | 1,41 | 2,68 |
abbreviation | 3685 | 455 | 1303 | 1,2 | 2,86 |
classifier | 113 | 12 | 31 | 1,15 | 2,58 |
proverb | 1869 | 105 | 225 | 1,06 | 2,14 |
phrase | 4935 | 375 | 845 | 1,09 | 2,25 |
numeral | 9072 | 224 | 547 | 1,03 | 2,44 |
gismu | 1297 | 46 | 96 | 1,04 | 2,09 |
pronoun | 6710 | 1631 | 4865 | 1,39 | 2,98 |
acronym | 593 | 118 | 412 | 1,41 | 3,49 |
particle | 763 | 217 | 678 | 1,47 | 3,12 |
adjective | 213327 | 62588 | 179028 | 1,42 | 2,86 |
prepositional phrase | 322 | 115 | 276 | 1,37 | 2,4 |
gerund | 3 | 4 | 12 | 2,14 | 3,0 |
katakana character | 49 | 1 | 2 | 1,02 | 2,0 |
correlative | 8 | 0 | 0 | 1,0 | -1,0 |
han character | 22155 | 10831 | 28556 | 1,54 | 2,64 |
preposition | 3033 | 1142 | 4354 | 1,77 | 3,81 |
hanja reading | 1 | 0 | 0 | 1,0 | -1,0 |
prefix | 3256 | 687 | 1845 | 1,29 | 2,69 |
postposition | 439 | 106 | 270 | 1,3 | 2,55 |
English entries
[edit]Number of words with unknown POS: 4006
Number of words and senses
[edit]Rows in the table: 31
Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
---|---|---|---|---|
3483 | 6199 | initialism | 71 | AAA 71, CCA 40, AC 26 |
22700 | 31590 | proper noun | 33 | Franklin 33, Mid-Atlantic 32, Greenville 25 |
665 | 978 | suffix | 18 | -able 18, -ist 18, -es 8 |
90 | 139 | symbol | 30 | A 30, b 5, d 3 |
14490 | 16763 | adverb | 57 | quite 57, peculiarly 30, up 14 |
195 | 371 | conjunction | 78 | and 78, as 11, but 7 |
4 | 4 | interfix | 1 | -i- 1, -k- 1, -n- 1 |
56 | 57 | letter | 2 | k 2, I 1, a 1 |
209509 | 279399 | noun | 130 | heaven 130, head 94, mark 78 |
16 | 29 | article | 8 | the 8, a 7, an 1 |
24 | 25 | idiom | 2 | old enough to vote 2, be absorbed by 1, bring sand to the beach 1 |
1704 | 2027 | interjection | 10 | oh 10, no worries 7, huh 6 |
1561 | 2230 | abbreviation | 11 | D 11, T 11, lv 10 |
5 | 5 | affix | 1 | -i- 1, -kin- 1, -o- 1 |
46 | 48 | infix | 2 | -h- 2, -ma- 2, -a- 1 |
470 | 512 | proverb | 3 | it's a long road that has no turning 3, there but for the grace of God go I 3, six of one, half a dozen of the other 2 |
1316 | 1482 | phrase | 5 | excuse me 5, the fuck 4, what's up 4 |
3 | 3 | circumfix | 1 | a- -ing 1, em- -en 1, en- -en 1 |
359 | 413 | numeral | 3 | billion 3, eighty-eight 3, novemdecillion 3 |
490 | 599 | contraction | 6 | gotcha 6, thou'dst 4, whatcha 4 |
363 | 476 | pronoun | 12 | him 12, me 9, myself 5 |
610 | 898 | acronym | 24 | CED 24, CET 22, KOS 17 |
69693 | 95334 | verb | 165 | go 165, run 121, take 116 |
84680 | 104199 | adjective | 66 | gay 66, free 48, deep 44 |
18 | 25 | particle | 4 | like 4, 's 2, no 2 |
395 | 551 | prepositional phrase | 7 | in hand 7, in front 5, at the high port 5 |
1 | 1 | participle | 1 | bawn 1 |
105 | 165 | determiner | 6 | some 6, all 6, which 5 |
426 | 1069 | preposition | 196 | of 196, in 40, over 30 |
1021 | 1348 | prefix | 9 | be- 9, meta- 9, paleo- 5 |
1 | 1 | postposition | 1 | aside 1 |
Polysemy information
[edit]Rows in the table: 31
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
initialism | 2401 | 1082 | 3798 | 1,78 | 3,51 |
proper noun | 17998 | 4702 | 13592 | 1,39 | 2,89 |
suffix | 486 | 179 | 492 | 1,47 | 2,75 |
symbol | 78 | 12 | 61 | 1,54 | 5,08 |
adverb | 12902 | 1588 | 3861 | 1,16 | 2,43 |
conjunction | 141 | 54 | 230 | 1,9 | 4,26 |
interfix | 4 | 0 | 0 | 1,0 | -1,0 |
letter | 55 | 1 | 2 | 1,02 | 2,0 |
noun | 174902 | 34607 | 104497 | 1,33 | 3,02 |
article | 14 | 2 | 15 | 1,81 | 7,5 |
idiom | 23 | 1 | 2 | 1,04 | 2,0 |
interjection | 1484 | 220 | 543 | 1,19 | 2,47 |
abbreviation | 1233 | 328 | 997 | 1,43 | 3,04 |
affix | 5 | 0 | 0 | 1,0 | -1,0 |
infix | 44 | 2 | 4 | 1,04 | 2,0 |
proverb | 430 | 40 | 82 | 1,09 | 2,05 |
phrase | 1182 | 134 | 300 | 1,13 | 2,24 |
circumfix | 3 | 0 | 0 | 1,0 | -1,0 |
numeral | 318 | 41 | 95 | 1,15 | 2,32 |
contraction | 411 | 79 | 188 | 1,22 | 2,38 |
pronoun | 303 | 60 | 173 | 1,31 | 2,88 |
acronym | 498 | 112 | 400 | 1,47 | 3,57 |
verb | 59562 | 10131 | 35772 | 1,37 | 3,53 |
adjective | 72841 | 11839 | 31358 | 1,23 | 2,65 |
particle | 13 | 5 | 12 | 1,39 | 2,4 |
prepositional phrase | 285 | 110 | 266 | 1,39 | 2,42 |
participle | 1 | 0 | 0 | 1,0 | -1,0 |
determiner | 73 | 32 | 92 | 1,57 | 2,88 |
preposition | 317 | 109 | 752 | 2,51 | 6,9 |
prefix | 818 | 203 | 530 | 1,32 | 2,61 |
postposition | 1 | 0 | 0 | 1,0 | -1,0 |
Russian entries
[edit]Number of words with unknown POS: 84
Number of words and senses
[edit]Rows in the table: 27
Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
---|---|---|---|---|
13 | 14 | initialism | 2 | ВДНХ 2, ЛГБТ 1, НПО 1 |
1808 | 1987 | proper noun | 6 | Русь 6, Тура 6, Виктория 4 |
42 | 52 | suffix | 4 | -ок 4, -ев 3, -ик 2 |
41 | 77 | predicative | 5 | громко 5, ясно 5, должен 4 |
2 | 2 | symbol | 1 | × 1, @ 1 |
1094 | 1469 | adverb | 9 | в целом 9, неясно 7, так 7 |
63 | 82 | conjunction | 3 | и 3, как 3, или 3 |
46 | 53 | letter | 3 | ь 3, у 2, а 2 |
12890 | 18243 | noun | 13 | путаница 13, ход 12, полнота 11 |
36 | 39 | idiom | 2 | лёгок на помине 2, на славу 2, не в своей тарелке 2 |
246 | 307 | interjection | 5 | ведь 5, есть 4, на здоровье 4 |
1 | 1 | affix | 1 | -ун- 1 |
205 | 249 | abbreviation | 10 | Л 10, РКК 5, ср. 4 |
179 | 191 | phrase | 3 | без ума 3, что это такое 3, а то 2 |
81 | 83 | proverb | 2 | с глаз долой — из сердца вон 2, куй железо, пока горячо 2, неча на зеркало пенять, коли рожа крива 1 |
73 | 80 | numeral | 7 | один 7, сорока 2, биллион 1 |
1 | 1 | syllable | 1 | 대 1 |
183 | 261 | pronoun | 4 | этот 4, своей 4, той 3 |
2839 | 4250 | adjective | 13 | успокаивающий 13, раздражающий 12, предшествующий 11 |
33 | 49 | particle | 5 | же 5, не 5, вряд ли 2 |
3048 | 6160 | verb | 13 | лезть 13, развести 11, проводить 10 |
20 | 20 | acronym | 1 | СПИД 1, Би-би-си 1, ЦРУ 1 |
11 | 14 | prepositional phrase | 2 | по меньшей мере 2, в ссоре 2, с первого взгляда 2 |
67 | 122 | participle | 7 | отклоняющийся 7, отходя 6, засевшей 4 |
17 | 23 | determiner | 3 | весь 3, всякий 3, иной 2 |
112 | 186 | preposition | 18 | по 18, за 6, через 6 |
76 | 104 | prefix | 5 | за- 5, раз- 5, под- 3 |
Polysemy information
[edit]Rows in the table: 27
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
initialism | 12 | 1 | 2 | 1,08 | 2,0 |
proper noun | 1658 | 150 | 329 | 1,1 | 2,19 |
suffix | 36 | 6 | 16 | 1,24 | 2,67 |
predicative | 21 | 20 | 56 | 1,88 | 2,8 |
symbol | 2 | 0 | 0 | 1,0 | -1,0 |
adverb | 835 | 259 | 634 | 1,34 | 2,45 |
conjunction | 48 | 15 | 34 | 1,3 | 2,27 |
letter | 40 | 6 | 13 | 1,15 | 2,17 |
noun | 9664 | 3226 | 8579 | 1,42 | 2,66 |
idiom | 33 | 3 | 6 | 1,08 | 2,0 |
interjection | 203 | 43 | 104 | 1,25 | 2,42 |
affix | 1 | 0 | 0 | 1,0 | -1,0 |
abbreviation | 181 | 24 | 68 | 1,21 | 2,83 |
phrase | 169 | 10 | 22 | 1,07 | 2,2 |
proverb | 79 | 2 | 4 | 1,02 | 2,0 |
numeral | 71 | 2 | 9 | 1,1 | 4,5 |
syllable | 1 | 0 | 0 | 1,0 | -1,0 |
pronoun | 121 | 62 | 140 | 1,43 | 2,26 |
adjective | 2090 | 749 | 2160 | 1,5 | 2,88 |
particle | 25 | 8 | 24 | 1,48 | 3,0 |
verb | 1510 | 1538 | 4650 | 2,02 | 3,02 |
acronym | 20 | 0 | 0 | 1,0 | -1,0 |
prepositional phrase | 8 | 3 | 6 | 1,27 | 2,0 |
participle | 38 | 29 | 84 | 1,82 | 2,9 |
determiner | 13 | 4 | 10 | 1,35 | 2,5 |
preposition | 82 | 30 | 104 | 1,66 | 3,47 |
prefix | 58 | 18 | 46 | 1,37 | 2,56 |
Finnish entries
[edit]Number of words with unknown POS: 20
Number of words and senses
[edit]Rows in the table: 23
Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
---|---|---|---|---|
52 | 55 | initialism | 2 | BKT 2, DI 2, NKT 2 |
2272 | 2634 | proper noun | 4 | Jumalan Karitsa 4, Aura 4, Lappi 4 |
155 | 295 | suffix | 27 | -nne 27, -si 9, -nsä 9 |
48 | 52 | proverb | 2 | parempi katsoa kuin katua 2, poissa silmistä, poissa mielestä 2, yhteistyö on voimaa 2 |
348 | 380 | phrase | 4 | närhen munat 4, ottaa lämpöä 4, sen paremmin 4 |
202 | 207 | numeral | 3 | seiska 3, yhdes 2, kybä 2 |
3 | 3 | symbol | 1 | Gt 1, Mt 1, kt 1 |
2445 | 2789 | adverb | 8 | hajalle 8, hajallaan 6, niin 6 |
56 | 69 | conjunction | 5 | kun 5, niin kuin 3, sekä 2 |
37 | 37 | contraction | 1 | son 1, ehkei 1, ehken 1 |
34 | 34 | letter | 1 | I 1, C 1, X 1 |
148 | 230 | pronoun | 10 | niiden 10, kuka 4, itse 4 |
4 | 4 | acronym | 1 | Kela 1, SKY 1, STT 1 |
11 | 39 | particle | 6 | -hän 6, -kaan 5, -kö 4 |
11085 | 13865 | verb | 17 | avata 17, purkaa 16, kiertää 16 |
6637 | 7916 | adjective | 15 | sopiva 15, heikko 11, tavallinen 8 |
41603 | 48241 | noun | 13 | kuori 13, juttu 12, varsi 11 |
14 | 14 | idiom | 1 | ennen aikojaan 1, koko nimi 1, saada vihiä 1 |
16 | 19 | preposition | 2 | kiinni 2, kautta 2, kuin mitä 2 |
253 | 274 | interjection | 4 | älä 4, hei 3, terve 2 |
177 | 199 | prefix | 4 | yli- 4, avo- 3, vasta- 3 |
114 | 157 | postposition | 7 | vasten 7, vastaan 4, asti 4 |
75 | 86 | abbreviation | 3 | jkn 3, kk 3, huom 2 |
Polysemy information
[edit]Rows in the table: 23
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
initialism | 49 | 3 | 6 | 1,06 | 2,0 |
proper noun | 1937 | 335 | 697 | 1,16 | 2,08 |
suffix | 114 | 41 | 181 | 1,9 | 4,41 |
proverb | 44 | 4 | 8 | 1,08 | 2,0 |
phrase | 322 | 26 | 58 | 1,09 | 2,23 |
numeral | 198 | 4 | 9 | 1,02 | 2,25 |
symbol | 3 | 0 | 0 | 1,0 | -1,0 |
adverb | 2185 | 260 | 604 | 1,14 | 2,32 |
conjunction | 48 | 8 | 21 | 1,23 | 2,62 |
contraction | 37 | 0 | 0 | 1,0 | -1,0 |
letter | 34 | 0 | 0 | 1,0 | -1,0 |
pronoun | 94 | 54 | 136 | 1,55 | 2,52 |
acronym | 4 | 0 | 0 | 1,0 | -1,0 |
particle | 1 | 10 | 38 | 3,55 | 3,8 |
verb | 9455 | 1630 | 4410 | 1,25 | 2,71 |
adjective | 5741 | 896 | 2175 | 1,19 | 2,43 |
noun | 36961 | 4642 | 11280 | 1,16 | 2,43 |
idiom | 14 | 0 | 0 | 1,0 | -1,0 |
preposition | 13 | 3 | 6 | 1,19 | 2,0 |
interjection | 236 | 17 | 38 | 1,08 | 2,24 |
prefix | 161 | 16 | 38 | 1,12 | 2,38 |
postposition | 92 | 22 | 65 | 1,38 | 2,95 |
abbreviation | 66 | 9 | 20 | 1,15 | 2,22 |
Ukrainian entries
[edit]Number of words with unknown POS: 1
Number of words and senses
[edit]Rows in the table: 19
Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
---|---|---|---|---|
1 | 1 | initialism | 1 | НКАУ 1 |
228 | 240 | proper noun | 3 | Русь 3, Борис 2, Брест 2 |
8 | 10 | phrase | 3 | будь ласка 3, у мене є питання 1, кожному своє 1 |
36 | 36 | numeral | 1 | один 1, вісім 1, вісімдесят 1 |
1 | 2 | symbol | 2 | ’ 2 |
35 | 54 | adverb | 11 | навмисно 11, дальше 4, зараз 3 |
10 | 10 | conjunction | 1 | а 1, і 1, й 1 |
34 | 51 | pronoun | 9 | себе 9, ніхто 2, своє 2 |
10 | 10 | letter | 1 | а 1, й 1, є 1 |
1 | 1 | acronym | 1 | ЗМІ 1 |
146 | 189 | adjective | 4 | гарний 4, минулий 4, м'який 4 |
4 | 5 | particle | 2 | чи 2, і 1, не 1 |
87 | 128 | verb | 6 | припадати 6, іти 4, бувати 4 |
1787 | 2327 | noun | 8 | міць 8, дід 6, керівник 6 |
13 | 14 | determiner | 2 | чий 2, четверо 1, ваш 1 |
5 | 6 | preposition | 2 | серед 2, під 1, для 1 |
1 | 1 | prefix | 1 | авіа- 1 |
16 | 18 | interjection | 3 | гей 3, а 1, ах 1 |
1 | 1 | abbreviation | 1 | КПРС 1 |
Polysemy information
[edit]Rows in the table: 19
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
initialism | 1 | 0 | 0 | 1,0 | -1,0 |
proper noun | 217 | 11 | 23 | 1,05 | 2,09 |
phrase | 7 | 1 | 3 | 1,25 | 3,0 |
numeral | 36 | 0 | 0 | 1,0 | -1,0 |
symbol | 0 | 1 | 2 | 2,0 | 2,0 |
adverb | 28 | 7 | 26 | 1,54 | 3,71 |
conjunction | 10 | 0 | 0 | 1,0 | -1,0 |
pronoun | 24 | 10 | 27 | 1,5 | 2,7 |
letter | 10 | 0 | 0 | 1,0 | -1,0 |
acronym | 1 | 0 | 0 | 1,0 | -1,0 |
adjective | 117 | 29 | 72 | 1,29 | 2,48 |
particle | 3 | 1 | 2 | 1,25 | 2,0 |
verb | 64 | 23 | 64 | 1,47 | 2,78 |
noun | 1451 | 336 | 876 | 1,3 | 2,61 |
determiner | 12 | 1 | 2 | 1,08 | 2,0 |
preposition | 4 | 1 | 2 | 1,2 | 2,0 |
prefix | 1 | 0 | 0 | 1,0 | -1,0 |
interjection | 15 | 1 | 3 | 1,12 | 3,0 |
abbreviation | 1 | 0 | 0 | 1,0 | -1,0 |
French entries
[edit]Number of words with unknown POS: 336
Number of words and senses
[edit]Rows in the table: 27
Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
---|---|---|---|---|
116 | 123 | initialism | 3 | PQ 3, IA 2, CPM 2 |
2381 | 2736 | proper noun | 5 | Atlas 5, Mérida 4, Katanga 4 |
172 | 208 | suffix | 6 | -is 6, -ant 3, -aud 3 |
8 | 8 | symbol | 1 | a 1, Go 1, Mo 1 |
2819 | 3201 | adverb | 8 | grossièrement 8, singulièrement 6, carrément 5 |
72 | 86 | conjunction | 5 | que 5, comme 3, afin 2 |
21 | 21 | letter | 1 | a 1, x 1, S 1 |
26101 | 33173 | noun | 18 | tampon 18, verre 14, masse 11 |
10 | 20 | article | 5 | des 5, de la 2, les 2 |
7 | 9 | idiom | 2 | et ainsi de suite 2, coude à coude 2, il n'y a pas mort d'homme 1 |
231 | 257 | interjection | 3 | beuh 3, tiens 3, ouais 3 |
1 | 1 | affix | 1 | -un- 1 |
94 | 101 | abbreviation | 5 | AP 5, cie 2, g 2 |
2 | 2 | infix | 1 | -iss- 1, t 1 |
57 | 60 | proverb | 2 | à la guerre comme à la guerre 2, l'habit ne fait pas le moine 2, les chiens aboient, la caravane passe 2 |
217 | 250 | phrase | 4 | dans la foulée 4, huis clos 4, avant la lettre 2 |
49 | 49 | numeral | 1 | billion 1, trillion 1, deux 1 |
64 | 67 | contraction | 2 | du 2, c'est 2, t'es 2 |
132 | 191 | pronoun | 7 | se 7, vous 7, y 3 |
6058 | 9802 | verb | 31 | passer 31, pogner 14, relever 12 |
9943 | 11579 | adjective | 8 | grossier 8, chaleureux 8, fini 7 |
11 | 15 | particle | 4 | ne 4, est-ce que 2, genre 1 |
1 | 1 | acronym | 1 | PEB 1 |
7 | 9 | prepositional phrase | 2 | en tête 2, à gauche 2, de guingois 1 |
19 | 19 | determiner | 1 | son 1, ton 1, ma 1 |
109 | 190 | preposition | 14 | à 14, sur 5, dans 4 |
157 | 164 | prefix | 2 | beau- 2, dys- 2, photo- 2 |
Polysemy information
[edit]Rows in the table: 27
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
initialism | 110 | 6 | 13 | 1,06 | 2,17 |
proper noun | 2066 | 315 | 670 | 1,15 | 2,13 |
suffix | 148 | 24 | 60 | 1,21 | 2,5 |
symbol | 8 | 0 | 0 | 1,0 | -1,0 |
adverb | 2515 | 304 | 686 | 1,14 | 2,26 |
conjunction | 63 | 9 | 23 | 1,19 | 2,56 |
letter | 21 | 0 | 0 | 1,0 | -1,0 |
noun | 21353 | 4748 | 11820 | 1,27 | 2,49 |
article | 5 | 5 | 15 | 2,0 | 3,0 |
idiom | 5 | 2 | 4 | 1,29 | 2,0 |
interjection | 208 | 23 | 49 | 1,11 | 2,13 |
affix | 1 | 0 | 0 | 1,0 | -1,0 |
abbreviation | 90 | 4 | 11 | 1,07 | 2,75 |
infix | 2 | 0 | 0 | 1,0 | -1,0 |
proverb | 54 | 3 | 6 | 1,05 | 2,0 |
phrase | 189 | 28 | 61 | 1,15 | 2,18 |
numeral | 49 | 0 | 0 | 1,0 | -1,0 |
contraction | 61 | 3 | 6 | 1,05 | 2,0 |
pronoun | 89 | 43 | 102 | 1,45 | 2,37 |
verb | 3935 | 2123 | 5867 | 1,62 | 2,76 |
adjective | 8639 | 1304 | 2940 | 1,16 | 2,25 |
particle | 9 | 2 | 6 | 1,36 | 3,0 |
acronym | 1 | 0 | 0 | 1,0 | -1,0 |
prepositional phrase | 5 | 2 | 4 | 1,29 | 2,0 |
determiner | 19 | 0 | 0 | 1,0 | -1,0 |
preposition | 78 | 31 | 112 | 1,74 | 3,61 |
prefix | 150 | 7 | 14 | 1,04 | 2,0 |
German entries
[edit]Number of words with unknown POS: 48
Number of words and senses
[edit]Rows in the table: 29
Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
---|---|---|---|---|
82 | 88 | initialism | 3 | BVG 3, VfB 3, PM 2 |
2654 | 2970 | proper noun | 34 | Neustadt 34, Atlas 5, Bosch 4 |
87 | 110 | suffix | 6 | -e 6, -en 5, -ig 3 |
9 | 10 | symbol | 2 | – 2, J 1, ☉ 1 |
984 | 1253 | adverb | 6 | momentan 6, gar nicht 5, rechts 5 |
65 | 81 | conjunction | 4 | als 4, bis 2, infolgedessen 2 |
4 | 4 | interfix | 1 | -en- 1, -es- 1, -n- 1 |
29 | 29 | letter | 1 | I 1, a 1, C 1 |
31690 | 36279 | noun | 24 | H. 24, Minderung 15, c. 15 |
28 | 37 | article | 2 | 'n 2, den 2, n 2 |
27 | 27 | idiom | 1 | arm wie eine Kirchenmaus 1, Kopf und Kragen 1, Reiz und Reaktion 1 |
140 | 155 | interjection | 5 | ach 5, bitte 3, servus 3 |
206 | 224 | abbreviation | 3 | OP 3, Fr. 3, KG 3 |
151 | 162 | phrase | 3 | Holzweg 3, Das Reich 2, Kraut und Rüben 2 |
35 | 35 | proverb | 1 | Arbeit macht frei 1, Blut ist dicker als Wasser 1, Geduld ist eine Tugend 1 |
1 | 1 | circumfix | 1 | ge- -t 1 |
155 | 161 | numeral | 4 | einer 4, einem 2, einen 2 |
22 | 22 | contraction | 1 | am 1, ans 1, aufs 1 |
147 | 263 | pronoun | 26 | anderen 26, deiner 5, meiner 5 |
6091 | 9961 | adjective | 27 | gelsten 27, hässlichsten 27, langsamsten 27 |
4276 | 6387 | verb | 22 | abkommen 22, brennen 12, lösen 9 |
6 | 7 | particle | 2 | oder 2, zu 1, dritt 1 |
8 | 10 | acronym | 2 | BND 2, DAU 2, CDU 1 |
1 | 1 | prepositional phrase | 1 | von der Stange 1 |
3 | 5 | participle | 3 | verzaubert 3, gewesen 1 |
11 | 14 | determiner | 2 | sein 2, ihr 2, welche 2 |
91 | 146 | preposition | 8 | bei 8, nach 5, durch 5 |
97 | 123 | prefix | 4 | ab- 4, um- 4, nach- 4 |
7 | 8 | postposition | 2 | durch 2, nach 1, entlang 1 |
Polysemy information
[edit]Rows in the table: 29
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
initialism | 78 | 4 | 10 | 1,07 | 2,5 |
proper noun | 2398 | 256 | 572 | 1,12 | 2,23 |
suffix | 74 | 13 | 36 | 1,26 | 2,77 |
symbol | 8 | 1 | 2 | 1,11 | 2,0 |
adverb | 784 | 200 | 469 | 1,27 | 2,35 |
conjunction | 51 | 14 | 30 | 1,25 | 2,14 |
interfix | 4 | 0 | 0 | 1,0 | -1,0 |
letter | 29 | 0 | 0 | 1,0 | -1,0 |
noun | 28484 | 3206 | 7795 | 1,14 | 2,43 |
article | 19 | 9 | 18 | 1,32 | 2,0 |
idiom | 27 | 0 | 0 | 1,0 | -1,0 |
interjection | 130 | 10 | 25 | 1,11 | 2,5 |
abbreviation | 191 | 15 | 33 | 1,09 | 2,2 |
phrase | 141 | 10 | 21 | 1,07 | 2,1 |
proverb | 35 | 0 | 0 | 1,0 | -1,0 |
circumfix | 1 | 0 | 0 | 1,0 | -1,0 |
numeral | 151 | 4 | 10 | 1,04 | 2,5 |
contraction | 22 | 0 | 0 | 1,0 | -1,0 |
pronoun | 92 | 55 | 171 | 1,79 | 3,11 |
adjective | 4836 | 1255 | 5125 | 1,64 | 4,08 |
verb | 3007 | 1269 | 3380 | 1,49 | 2,66 |
particle | 5 | 1 | 2 | 1,17 | 2,0 |
acronym | 6 | 2 | 4 | 1,25 | 2,0 |
prepositional phrase | 1 | 0 | 0 | 1,0 | -1,0 |
participle | 2 | 1 | 3 | 1,67 | 3,0 |
determiner | 8 | 3 | 6 | 1,27 | 2,0 |
preposition | 67 | 24 | 79 | 1,6 | 3,29 |
prefix | 80 | 17 | 43 | 1,27 | 2,53 |
postposition | 6 | 1 | 2 | 1,14 | 2,0 |
Serbian entries
[edit]Tatar entries
[edit]Number of words with unknown POS: null
Number of words and senses
[edit]Rows in the table: 11
Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
---|---|---|---|---|
1 | 1 | letter | 1 | ү 1 |
1 | 1 | pronoun | 1 | sin 1 |
5 | 5 | verb | 1 | абын 1, сабир 1, сабыр 1 |
58 | 59 | adjective | 2 | авыр 2, яшь 1, фәрештә 1 |
55 | 56 | proper noun | 2 | Neptun 2, Austria 1, Bangladesh 1 |
366 | 392 | noun | 3 | çirek 3, дәвер 3, заман 3 |
2 | 3 | suffix | 2 | -ле 2, -лы 1 |
60 | 61 | numeral | 2 | nol 2, өч йөз 1, öç yöz 1 |
3 | 5 | interjection | 3 | әйдә 3, әйе 1 |
11 | 13 | adverb | 2 | зерә 2, әрмәнчә 2, әкрен 1 |
2 | 2 | conjunction | 1 | белән 1, билан 1 |
Polysemy information
[edit]Rows in the table: 11
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
letter | 1 | 0 | 0 | 1,0 | -1,0 |
pronoun | 1 | 0 | 0 | 1,0 | -1,0 |
verb | 5 | 0 | 0 | 1,0 | -1,0 |
adjective | 57 | 1 | 2 | 1,02 | 2,0 |
proper noun | 54 | 1 | 2 | 1,02 | 2,0 |
noun | 343 | 23 | 49 | 1,07 | 2,13 |
suffix | 1 | 1 | 2 | 1,5 | 2,0 |
numeral | 59 | 1 | 2 | 1,02 | 2,0 |
interjection | 2 | 1 | 3 | 1,67 | 3,0 |
adverb | 9 | 2 | 4 | 1,18 | 2,0 |
conjunction | 2 | 0 | 0 | 1,0 | -1,0 |
Esperanto entries
[edit]Number of words with unknown POS: 118
Number of words and senses
[edit]Rows in the table: 21
Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
---|---|---|---|---|
4 | 4 | initialism | 1 | UAE 1, MEP 1, IFA 1 |
838 | 869 | proper noun | 3 | Rioĥo 3, Venuso 2, Britio 2 |
105 | 132 | suffix | 4 | -iĝi 4, -ad- 3, -ant- 3 |
49 | 50 | phrase | 2 | mi petas 2, mi malsatas 1, mi soifas 1 |
114 | 114 | numeral | 1 | dek 1, tricent 1, cent 1 |
749 | 817 | adverb | 3 | nepre 3, tiel 3, okaze 3 |
23 | 24 | conjunction | 2 | ĉar 2, plus 1, kaj 1 |
67 | 67 | letter | 1 | i 1, I 1, a 1 |
62 | 71 | pronoun | 3 | kiun 3, vi 2, ĉiuj 2 |
11 | 16 | particle | 3 | ne 3, ĉi 3, ĉu 2 |
1821 | 1966 | adjective | 4 | morta 4, feliĉa 4, sama 3 |
1760 | 1973 | verb | 4 | pagi 4, valori 4, zorgi 4 |
3 | 3 | expression | 1 | aŭ 1, ĉu 1, ĉu 1 |
7097 | 7603 | noun | 4 | amaso 4, baskulo 4, batilo 4 |
8 | 8 | correlative | 1 | nenial 1, neniel 1, nenies 1 |
2 | 2 | article | 1 | la 1, l' 1 |
39 | 43 | determiner | 3 | kelkaj 3, ĉia 3, ties 1 |
51 | 75 | preposition | 4 | kun 4, dum 4, antaŭ 3 |
43 | 45 | interjection | 2 | he 2, gesinjoroj 2, ek 1 |
23 | 29 | prefix | 3 | ge- 3, pra- 3, eks- 2 |
6 | 6 | abbreviation | 1 | 2-a 1, V 1, n.b. 1 |
Polysemy information
[edit]Rows in the table: 21
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
initialism | 4 | 0 | 0 | 1,0 | -1,0 |
proper noun | 808 | 30 | 61 | 1,04 | 2,03 |
suffix | 86 | 19 | 46 | 1,26 | 2,42 |
phrase | 48 | 1 | 2 | 1,02 | 2,0 |
numeral | 114 | 0 | 0 | 1,0 | -1,0 |
adverb | 687 | 62 | 130 | 1,09 | 2,1 |
conjunction | 22 | 1 | 2 | 1,04 | 2,0 |
letter | 67 | 0 | 0 | 1,0 | -1,0 |
pronoun | 54 | 8 | 17 | 1,15 | 2,12 |
particle | 8 | 3 | 8 | 1,45 | 2,67 |
adjective | 1696 | 125 | 270 | 1,08 | 2,16 |
verb | 1571 | 189 | 402 | 1,12 | 2,13 |
expression | 3 | 0 | 0 | 1,0 | -1,0 |
noun | 6643 | 454 | 960 | 1,07 | 2,11 |
correlative | 8 | 0 | 0 | 1,0 | -1,0 |
article | 2 | 0 | 0 | 1,0 | -1,0 |
determiner | 37 | 2 | 6 | 1,1 | 3,0 |
preposition | 34 | 17 | 41 | 1,47 | 2,41 |
interjection | 41 | 2 | 4 | 1,05 | 2,0 |
prefix | 19 | 4 | 10 | 1,26 | 2,5 |
abbreviation | 6 | 0 | 0 | 1,0 | -1,0 |
Latin entries
[edit]Number of words with unknown POS: 38
Number of words and senses
[edit]Rows in the table: 27
Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
---|---|---|---|---|
7 | 9 | initialism | 2 | RIP 2, IHS 2, SPD 1 |
899 | 1010 | proper noun | 5 | peniculus 5, Smalcius 4, Uranus 3 |
89 | 106 | suffix | 3 | -ve 3, -icus 3, -brum 3 |
5 | 5 | symbol | 1 | HS 1, Ↄ 1, Ⅎ 1 |
1117 | 1682 | adverb | 12 | etiam 12, ultro 7, continenter 5 |
64 | 140 | conjunction | 29 | an 29, nam 5, autem 4 |
1 | 1 | interfix | 1 | -o- 1 |
17 | 24 | letter | 5 | ꝑ 5, C 1, Q 1 |
9143 | 14845 | noun | 16 | manus 16, terra 9, caput 9 |
6 | 6 | idiom | 1 | a lacte cunisque 1, ab acia et acu 1, semel pro semper 1 |
51 | 57 | interjection | 3 | en 3, o 2, hui 2 |
40 | 45 | abbreviation | 2 | R 2, S. 2, p 2 |
1 | 2 | infix | 2 | -n- 2 |
30 | 31 | proverb | 2 | obsta principiis 2, albus an ater sit 1, asinus in tegulis 1 |
154 | 181 | phrase | 3 | Socratici viri 3, gratia gratiam parit 3, in medio 3 |
103 | 111 | numeral | 3 | secundum 3, sexagesima 3, sexagesimum 3 |
8 | 8 | contraction | 1 | vin 1, viden 1, eapse 1 |
107 | 141 | pronoun | 3 | sui 3, ea 3, memet 3 |
5417 | 11569 | verb | 30 | agar 30, iacuerint 28, iacueris 28 |
7264 | 10043 | adjective | 10 | raptus 10, residuus 8, socius 6 |
3 | 5 | particle | 2 | in- 2, -ne 2, non 1 |
2 | 2 | prepositional phrase | 1 | a posteriori 1, sub clave 1 |
7 | 15 | gerund | 4 | laborandum 4, definiendum 3, sufflaminandum 2 |
7684 | 11018 | participle | 14 | iacens 14, iaciturus 14, productus 9 |
8 | 12 | determiner | 3 | ille 3, ambo 2, idem 1 |
60 | 120 | preposition | 8 | pro 8, erga 7, iuxta 4 |
28 | 38 | prefix | 6 | a- 6, se- 2, co- 2 |
Polysemy information
[edit]Rows in the table: 27
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
initialism | 5 | 2 | 4 | 1,29 | 2,0 |
proper noun | 808 | 91 | 202 | 1,12 | 2,22 |
suffix | 77 | 12 | 29 | 1,19 | 2,42 |
symbol | 5 | 0 | 0 | 1,0 | -1,0 |
adverb | 725 | 392 | 957 | 1,51 | 2,44 |
conjunction | 34 | 30 | 106 | 2,19 | 3,53 |
interfix | 1 | 0 | 0 | 1,0 | -1,0 |
letter | 15 | 2 | 9 | 1,41 | 4,5 |
noun | 5641 | 3502 | 9204 | 1,62 | 2,63 |
idiom | 6 | 0 | 0 | 1,0 | -1,0 |
interjection | 46 | 5 | 11 | 1,12 | 2,2 |
abbreviation | 35 | 5 | 10 | 1,12 | 2,0 |
infix | 0 | 1 | 2 | 2,0 | 2,0 |
proverb | 29 | 1 | 2 | 1,03 | 2,0 |
phrase | 133 | 21 | 48 | 1,18 | 2,29 |
numeral | 99 | 4 | 12 | 1,08 | 3,0 |
contraction | 8 | 0 | 0 | 1,0 | -1,0 |
pronoun | 77 | 30 | 64 | 1,32 | 2,13 |
verb | 2453 | 2964 | 9116 | 2,14 | 3,08 |
adjective | 5359 | 1905 | 4684 | 1,38 | 2,46 |
particle | 1 | 2 | 4 | 1,67 | 2,0 |
prepositional phrase | 2 | 0 | 0 | 1,0 | -1,0 |
gerund | 3 | 4 | 12 | 2,14 | 3,0 |
participle | 5698 | 1986 | 5320 | 1,43 | 2,68 |
determiner | 5 | 3 | 7 | 1,5 | 2,33 |
preposition | 35 | 25 | 85 | 2,0 | 3,4 |
prefix | 23 | 5 | 15 | 1,36 | 3,0 |
Italian entries
[edit]Number of words with unknown POS: 8
Number of words and senses
[edit]Rows in the table: 23
Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
---|---|---|---|---|
41 | 47 | initialism | 3 | PG 3, GdF 2, P.G. 2 |
3762 | 4231 | proper noun | 4 | Virgilio 4, Annunziata 3, Bodoni 3 |
321 | 384 | suffix | 5 | -ata 5, -ite 5, -i 4 |
30 | 35 | proverb | 3 | non piangere sul latte versato 3, più siamo meglio è 3, meglio un uovo oggi che una gallina domani 1 |
81 | 87 | phrase | 2 | che figata 2, detto, fatto 2, non c'è che dire 2 |
2 | 2 | numeral | 1 | uno 1, otto 1 |
3 | 3 | symbol | 1 | × 1, Z 1, W 1 |
3918 | 4844 | adverb | 10 | immediatamente 10, subito 8, di basso rango 5 |
128 | 159 | conjunction | 3 | se 3, che 3, se non 3 |
62 | 64 | contraction | 2 | dal 2, dai 2, della 1 |
158 | 220 | pronoun | 4 | ci 4, qualcuno 4, il quale 4 |
27 | 27 | letter | 1 | I 1, C 1, X 1 |
30185 | 36855 | verb | 12 | abbattere 12, sbattere 9, caricare 9 |
22194 | 26376 | adjective | 10 | minore 10, brado 8, grossolano 8 |
7 | 7 | prepositional phrase | 1 | a mano armata 1, a rischio 1, in ballo 1 |
50216 | 63158 | noun | 12 | titolo 12, tiro 9, fascio 9 |
11 | 11 | article | 1 | i 1, le 1, lo 1 |
6 | 6 | idiom | 1 | andare coi piedi di piombo 1, la quiete prima della tempesta 1, campa cavallo 1 |
309 | 375 | preposition | 9 | di 9, per 6, su 5 |
193 | 251 | interjection | 4 | buon giorno 4, accidenti 3, d'accordo 3 |
417 | 458 | prefix | 4 | filo- 4, acro- 3, ana- 3 |
68 | 72 | abbreviation | 2 | AC 2, PS 2, S. 2 |
1 | 1 | affix | 1 | -un- 1 |
Polysemy information
[edit]Rows in the table: 23
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
initialism | 36 | 5 | 11 | 1,15 | 2,2 |
proper noun | 3323 | 439 | 908 | 1,12 | 2,07 |
suffix | 273 | 48 | 111 | 1,2 | 2,31 |
proverb | 27 | 3 | 8 | 1,17 | 2,67 |
phrase | 75 | 6 | 12 | 1,07 | 2,0 |
numeral | 2 | 0 | 0 | 1,0 | -1,0 |
symbol | 3 | 0 | 0 | 1,0 | -1,0 |
adverb | 3186 | 732 | 1658 | 1,24 | 2,27 |
conjunction | 100 | 28 | 59 | 1,24 | 2,11 |
contraction | 60 | 2 | 4 | 1,03 | 2,0 |
pronoun | 110 | 48 | 110 | 1,39 | 2,29 |
letter | 27 | 0 | 0 | 1,0 | -1,0 |
verb | 25710 | 4475 | 11145 | 1,22 | 2,49 |
adjective | 18944 | 3250 | 7432 | 1,19 | 2,29 |
prepositional phrase | 7 | 0 | 0 | 1,0 | -1,0 |
noun | 40782 | 9434 | 22376 | 1,26 | 2,37 |
article | 11 | 0 | 0 | 1,0 | -1,0 |
idiom | 6 | 0 | 0 | 1,0 | -1,0 |
preposition | 276 | 33 | 99 | 1,21 | 3,0 |
interjection | 148 | 45 | 103 | 1,3 | 2,29 |
prefix | 384 | 33 | 74 | 1,1 | 2,24 |
abbreviation | 64 | 4 | 8 | 1,06 | 2,0 |
affix | 1 | 0 | 0 | 1,0 | -1,0 |
Swedish entries
[edit]Number of words with unknown POS: 11
Number of words and senses
[edit]Rows in the table: 26
Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
---|---|---|---|---|
20 | 22 | initialism | 2 | HD 2, OS 2, SAP 1 |
1769 | 1881 | proper noun | 3 | Hanna 3, Europa 3, Ester 3 |
80 | 110 | suffix | 5 | -t 5, -e 5, -en 4 |
711 | 820 | adverb | 7 | tillgodo 7, precis 5, så 5 |
46 | 57 | conjunction | 6 | och 6, emellertid 3, då 2 |
1 | 1 | interfix | 1 | -s- 1 |
11 | 11 | letter | 1 | a 1, o 1, w 1 |
9814 | 12429 | noun | 11 | bryt 11, rot 9, tur 9 |
7 | 7 | article | 1 | de 1, en 1, den 1 |
36 | 38 | idiom | 2 | måla fan på väggen 2, tårta på tårta 2, det är rena grekiskan 1 |
108 | 121 | interjection | 3 | god fortsättning 3, varsågod 3, ha 2 |
288 | 311 | abbreviation | 5 | BK 5, f. 3, tekn. 3 |
89 | 90 | phrase | 2 | sida upp och sida ned 2, glad påsk 1, gott nytt år 1 |
28 | 28 | proverb | 1 | Gå inte över ån efter vatten 1, Rom byggdes inte på en dag 1, nära skjuter ingen hare 1 |
156 | 157 | numeral | 2 | artonhundra 2, tionde 1, tolfte 1 |
1 | 1 | contraction | 1 | venne 1 |
135 | 159 | pronoun | 4 | vilken 4, det 3, som 2 |
2541 | 3723 | verb | 23 | gå 23, hålla 14, lägga 13 |
2226 | 2712 | adjective | 8 | hård 8, öppen 8, lös 7 |
10 | 10 | acronym | 1 | ABF 1, EU 1, DI 1 |
2 | 2 | particle | 1 | om 1, att 1 |
1 | 1 | prepositional phrase | 1 | i mannaminne 1 |
11 | 12 | determiner | 2 | var 2, de där 1, dessa 1 |
79 | 123 | preposition | 6 | om 6, på 5, runt 5 |
44 | 52 | prefix | 3 | över- 3, bi- 2, engångs- 2 |
1 | 1 | postposition | 1 | ut 1 |
Polysemy information
[edit]Rows in the table: 26
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
initialism | 18 | 2 | 4 | 1,1 | 2,0 |
proper noun | 1665 | 104 | 216 | 1,06 | 2,08 |
suffix | 63 | 17 | 47 | 1,38 | 2,76 |
adverb | 638 | 73 | 182 | 1,15 | 2,49 |
conjunction | 40 | 6 | 17 | 1,24 | 2,83 |
interfix | 1 | 0 | 0 | 1,0 | -1,0 |
letter | 11 | 0 | 0 | 1,0 | -1,0 |
noun | 8173 | 1641 | 4256 | 1,27 | 2,59 |
article | 7 | 0 | 0 | 1,0 | -1,0 |
idiom | 34 | 2 | 4 | 1,06 | 2,0 |
interjection | 97 | 11 | 24 | 1,12 | 2,18 |
abbreviation | 271 | 17 | 40 | 1,08 | 2,35 |
phrase | 88 | 1 | 2 | 1,01 | 2,0 |
proverb | 28 | 0 | 0 | 1,0 | -1,0 |
numeral | 155 | 1 | 2 | 1,01 | 2,0 |
contraction | 1 | 0 | 0 | 1,0 | -1,0 |
pronoun | 115 | 20 | 44 | 1,18 | 2,2 |
verb | 1853 | 688 | 1870 | 1,47 | 2,72 |
adjective | 1898 | 328 | 814 | 1,22 | 2,48 |
acronym | 10 | 0 | 0 | 1,0 | -1,0 |
particle | 2 | 0 | 0 | 1,0 | -1,0 |
prepositional phrase | 1 | 0 | 0 | 1,0 | -1,0 |
determiner | 10 | 1 | 2 | 1,09 | 2,0 |
preposition | 60 | 19 | 63 | 1,56 | 3,32 |
prefix | 37 | 7 | 15 | 1,18 | 2,14 |
postposition | 1 | 0 | 0 | 1,0 | -1,0 |
Spanish entries
[edit]Number of words with unknown POS: 245
Number of words and senses
[edit]Rows in the table: 25
Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
---|---|---|---|---|
52 | 53 | initialism | 2 | RAE 2, ARN 1, AXJ 1 |
1848 | 2075 | proper noun | 5 | Amazonas 5, Mérida 4, Jacobo 4 |
189 | 270 | suffix | 12 | -ón 12, -ón 7, -azo 4 |
10 | 11 | symbol | 2 | @ 2, ∃ 1, ⸘ 1 |
1343 | 1521 | adverb | 9 | ya 9, encima 6, al revés 5 |
55 | 80 | conjunction | 6 | que 6, no obstante 4, mientras 4 |
62 | 62 | letter | 1 | i 1, I 1, a 1 |
20475 | 26618 | noun | 15 | medio 15, casco 13, tope 10 |
8 | 8 | article | 1 | lo 1, la 1, el 1 |
4 | 5 | idiom | 2 | en vilo 2, no se ganó Zamora en una hora 1, quisqui 1 |
197 | 242 | interjection | 6 | ojalá 6, hombre 3, bueno 3 |
93 | 98 | abbreviation | 3 | SL 3, dpto 2, s/n 2 |
1 | 1 | affix | 1 | -un- 1 |
193 | 207 | phrase | 4 | a la buena de Dios 4, al fin y al cabo 4, de pe a pa 4 |
58 | 60 | proverb | 2 | el hábito no hace al monje 2, del dicho al hecho hay mucho trecho 2, a caballo regalado no le mires el diente 1 |
59 | 59 | numeral | 1 | trescientos 1, quince 1, once 1 |
7 | 8 | contraction | 2 | desdel 2, na 1, del 1 |
127 | 167 | pronoun | 4 | se 4, les 4, me 3 |
6537 | 9155 | verb | 17 | dar 17, sacar 15, tirar 14 |
6809 | 8379 | adjective | 8 | duro 8, verde 7, pegado 6 |
9 | 11 | acronym | 2 | AFI 2, PAN 2, OTAN 1 |
1 | 2 | participle | 2 | amado 2 |
4 | 4 | determiner | 1 | uno 1, cada 1, cierto 1 |
57 | 86 | preposition | 5 | en 5, por 5, según 3 |
110 | 113 | prefix | 2 | re- 2, bis- 2, ferro- 2 |
Polysemy information
[edit]Rows in the table: 25
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
initialism | 51 | 1 | 2 | 1,02 | 2,0 |
proper noun | 1659 | 189 | 416 | 1,12 | 2,2 |
suffix | 144 | 45 | 126 | 1,43 | 2,8 |
symbol | 9 | 1 | 2 | 1,1 | 2,0 |
adverb | 1211 | 132 | 310 | 1,13 | 2,35 |
conjunction | 41 | 14 | 39 | 1,45 | 2,79 |
letter | 62 | 0 | 0 | 1,0 | -1,0 |
noun | 16493 | 3982 | 10125 | 1,3 | 2,54 |
article | 8 | 0 | 0 | 1,0 | -1,0 |
idiom | 3 | 1 | 2 | 1,25 | 2,0 |
interjection | 163 | 34 | 79 | 1,23 | 2,32 |
abbreviation | 89 | 4 | 9 | 1,05 | 2,25 |
affix | 1 | 0 | 0 | 1,0 | -1,0 |
phrase | 185 | 8 | 22 | 1,07 | 2,75 |
proverb | 56 | 2 | 4 | 1,03 | 2,0 |
numeral | 59 | 0 | 0 | 1,0 | -1,0 |
contraction | 6 | 1 | 2 | 1,14 | 2,0 |
pronoun | 102 | 25 | 65 | 1,31 | 2,6 |
verb | 5045 | 1492 | 4110 | 1,4 | 2,75 |
adjective | 5675 | 1134 | 2704 | 1,23 | 2,38 |
acronym | 7 | 2 | 4 | 1,22 | 2,0 |
participle | 0 | 1 | 2 | 2,0 | 2,0 |
determiner | 4 | 0 | 0 | 1,0 | -1,0 |
preposition | 40 | 17 | 46 | 1,51 | 2,71 |
prefix | 107 | 3 | 6 | 1,03 | 2,0 |
Mandarin entries
[edit]Number of words with unknown POS: 10629
Number of words and senses
[edit]Rows in the table: 23
Unique Strings | Total Word-Sense Pairs | POS | Max Senses | Entry |
---|---|---|---|---|
4 | 4 | measure word | 1 | 遍 1, 杯 1, 盆 1 |
339 | 395 | proper noun | 4 | 姑苏 4, 姑蘇 4, 二里岗 3 |
23900 | 24110 | hanzi | 10 | 即 10, 场 7, 托 7 |
1 | 4 | suffix | 4 | 者 4 |
415 | 452 | proverb | 4 | 三十年河东,三十年河西 4, 三十年河東,三十年河西 4, 十年河东,十年河西 3 |
21 | 21 | phrase | 1 | PK就PK 1, 己立立人,己達達人 1, 己立立人,己达达人 1 |
21 | 26 | classifier | 3 | 組 3, 组 3, 条 1 |
3 | 3 | symbol | 1 | ; 1, { 1, ¥ 1 |
34 | 44 | adverb | 4 | 剛 4, 刚 4, 便 2 |
4 | 4 | conjunction | 1 | 似 1, 如 1, 併 1 |
25 | 25 | letter | 1 | ㄅ 1, ㄆ 1, ㄇ 1 |
11 | 18 | pronoun | 3 | 誰 3, 谁 3, 幾 3 |
163 | 239 | adjective | 4 | 洋洋 4, 美 3, 倭 3 |
17 | 24 | particle | 5 | 啊 5, 唄 2, 哩 2 |
184 | 320 | verb | 7 | 叫 7, 搞 7, 开 7 |
327 | 543 | noun | 12 | 帅 12, 烟 6, 點 6 |
3 | 3 | determiner | 1 | 更多 1, 誰家 1, 谁家 1 |
150 | 163 | idiom | 2 | 三下五除二 2, 东逃西窜 2, 周瑜打黃蓋 2 |
9 | 11 | preposition | 2 | 裏 2, 裡 2, 叫 1 |
1 | 1 | pinyin | 1 | Bìxiù 1 |
5 | 7 | prefix | 2 | 超 2, 新 2, 沒 1 |
21 | 22 | interjection | 2 | 哦 2, 555 1, 88 1 |
3 | 3 | postposition | 1 | 間 1, 以來 1, 间 1 |
Polysemy information
[edit]Rows in the table: 23
POS | Monosemous Words and Senses | Polysemous Words | Polysemous Senses | Average Polysemy Including Monosemous Words | Average Polysemy Excluding Monosemous Words |
---|---|---|---|---|---|
measure word | 4 | 0 | 0 | 1,0 | -1,0 |
proper noun | 297 | 42 | 98 | 1,17 | 2,33 |
hanzi | 23761 | 139 | 349 | 1,01 | 2,51 |
suffix | 0 | 1 | 4 | 4,0 | 4,0 |
proverb | 385 | 30 | 67 | 1,09 | 2,23 |
phrase | 21 | 0 | 0 | 1,0 | -1,0 |
classifier | 18 | 3 | 8 | 1,24 | 2,67 |
symbol | 3 | 0 | 0 | 1,0 | -1,0 |
adverb | 28 | 6 | 16 | 1,29 | 2,67 |
conjunction | 4 | 0 | 0 | 1,0 | -1,0 |
letter | 25 | 0 | 0 | 1,0 | -1,0 |
pronoun | 7 | 4 | 11 | 1,64 | 2,75 |
adjective | 110 | 53 | 129 | 1,47 | 2,43 |
particle | 13 | 4 | 11 | 1,41 | 2,75 |
verb | 116 | 68 | 204 | 1,74 | 3,0 |
noun | 217 | 110 | 326 | 1,66 | 2,96 |
determiner | 3 | 0 | 0 | 1,0 | -1,0 |
idiom | 137 | 13 | 26 | 1,09 | 2,0 |
preposition | 7 | 2 | 4 | 1,22 | 2,0 |
pinyin | 1 | 0 | 0 | 1,0 | -1,0 |
prefix | 3 | 2 | 4 | 1,4 | 2,0 |
interjection | 20 | 1 | 2 | 1,05 | 2,0 |
postposition | 3 | 0 | 0 | 1,0 | -1,0 |
References
[edit]- ^ This (or more recent) database would be available at the project site wikokit, see Download section at page whinger.krc.karelia.ru.