Module:families/data/documentation
Appearance
< Module:families | data
This module contains definitions for all language family codes on Wiktionary. The entries are listed by their family code, which is usually an ISO-639 code, but there are also some exceptional codes. These are listed on Wiktionary:Families; that page also explains how to create new exceptional codes.
This module must not be imported using require
. Instead, it is imported like this:
local families = mw.loadData("Module:families")
This ensures that the data is only loaded once per page, rather than once for every module invocation like normal.
Errors
[edit]The following errors were detected by Module:data consistency check:
- Norwegian Bokmål (
nb
) has Middle Norwegian (gmq-mno
) set as an ancestor, but is not in the West Scandinavian languages (gmq-wes
). - Norwegian Bokmål (
nb
) has Danish (da
) set as an ancestor, but is not in the East Scandinavian languages (gmq-eas
).
- Caribbean Hindustani (
hns
) has Bhojpuri (bho
) set as an ancestor, but is not in the Bihari languages (inc-bih
). - Caribbean Hindustani (
hns
) has Awadhi (awa
) set as an ancestor, but is not in the Eastern Hindi languages (inc-hie
).
- Proto-Central Togo (
alv-gtm-pro
) does not have the expected name "Proto-Ghana-Togo Mountain", even though it is the proto-language of the Ghana-Togo Mountain languages (alv-gtm
). - Proto-Arawa (
auf-pro
) does not have the expected name "Proto-Arauan", even though it is the proto-language of the Arauan languages (auf
). - Proto-Amuesha-Chamicuro (
awd-amc-pro
) has a proto-language code associated with the invalid code"awd-amc"
. - Proto-Kampa (
awd-kmp-pro
) has a proto-language code associated with the invalid code"awd-kmp"
. - Proto-Arawak (
awd-pro
) does not have the expected name "Proto-Arawakan", even though it is the proto-language of the Arawakan languages (awd
). - Proto-Paresi-Waura (
awd-prw-pro
) has a proto-language code associated with the invalid code"awd-prw"
. - Proto-Ta-Arawak (
awd-taa-pro
) does not have the expected name "Proto-Ta-Arawakan", even though it is the proto-language of the Ta-Arawakan languages (awd-taa
). - Proto-Rukai (
dru-pro
) has a proto-language code associated with Rukai (dru
), which is not a family. - Proto-Basque (
euq-pro
) does not have the expected name "Proto-Vasconic", even though it is the proto-language of the Vasconic languages (euq
). - Proto-Norse (
gmq-pro
) does not have the expected name "Proto-North Germanic", even though it is the proto-language of the North Germanic languages (gmq
). - Proto-Kamta (
inc-krn-pro
) does not have the expected name "Proto-KRNB lects", even though it is the proto-language of the KRNB lects (inc-krn
). - Kelantan Peranakan Hokkien (
mis-hkl
) has its canonical name ("Kelantan Peranakan Hokkien"
) repeated in the table ofaliases
. - Proto-Chumash (
nai-chu-pro
) does not have the expected name "Proto-Chumashan", even though it is the proto-language of the Chumashan languages (nai-chu
). - Proto-Maidun (
nai-mdu-pro
) does not have the expected name "Proto-Maiduan", even though it is the proto-language of the Maiduan languages (nai-mdu
). - Proto-Mixe-Zoque (
nai-miz-pro
) does not have the expected name "Proto-Mixe-Zoquean", even though it is the proto-language of the Mixe-Zoquean languages (nai-miz
). - Proto-Pomo (
nai-pom-pro
) does not have the expected name "Proto-Pomoan", even though it is the proto-language of the Pomoan languages (nai-pom
). - Proto-Mazatec (
omq-maz-pro
) does not have the expected name "Proto-Mazatecan", even though it is the proto-language of the Mazatecan languages (omq-maz
). - Proto-Ossetic (
os-pro
) has a proto-language code associated with Ossetian (os
), which is not a family. - Proto-North Sarawak (
poz-swa-pro
) does not have the expected name "Proto-North Sarawakan", even though it is the proto-language of the North Sarawakan languages (poz-swa
). - Proto-Salish (
sal-pro
) does not have the expected name "Proto-Salishan", even though it is the proto-language of the Salishan languages (sal
). - Proto-Puroik (
sit-khp-pro
) has a proto-language code associated with the invalid code"sit-khp"
. - Proto-Samic (
smi-pro
) does not have the expected name "Proto-Sami", even though it is the proto-language of the Sami languages (smi
). - Proto-Kuki-Chin (
tbq-kuk-pro
) does not have the expected name "Proto-Kukish", even though it is the proto-language of the Kukish languages (tbq-kuk
). - Proto-Saka (
xsc-sak-pro
) does not have the expected name "Proto-Sakan", even though it is the proto-language of the Sakan languages (xsc-sak
). - Proto-Sarmatian (
xsc-sar-pro
) has a proto-language code associated with the invalid code"xsc-sar"
.
- Literary Chinese (
lzh-lit
) has a canonical name that is not unique; it is also used by the codelzh
. - The data key
preprocess_links
for Hacked Thai (th-new
) is invalid.
- Old Indo-Aryan languages (
inc-old
) has no child families or languages.
- Literary Chinese, the canonical name for the code
lzh-lit
, is wrong; it should be Literary Chinese.
- Literary Chinese, the canonical name for the code
lzh-lit
, is wrong; it should be Literary Chinese.
- The code
ira-mid
and the canonical name Middle Iranian should be removed; they are not found in Module:families/data. - The code
ira-old
and the canonical name Old Iranian should be removed; they are not found in Module:families/data.
- The code
ira-mid
and the canonical name Middle Iranian should be removed; they are not found in Module:families/data. - The code
ira-old
and the canonical name Old Iranian should be removed; they are not found in Module:families/data.
Required values
[edit]Every entry in the table must contain the following properties:
canonicalName
- The "canonical" name of the family. This is the name that is used in Wiktionary entries and category names.
Optional values
[edit]aliases
- A list of aliases/synonyms for the family, other than the canonical name.
varieties
- A table of family varieties that are subsumed under the family. This should not include those varieties for which separate family codes have been assigned. If a given variety has several names, they can all be listed by including a sublist in the overall list, where the first element is the canonical name that you want the variety to be known by, and the remainder are aliases. This facility is more commonly used for individual languages than families, but the facility is also provided for families. See Template:language extradata documentation for an example.
otherNames
(deprecated)- A table of all non-canonical names that this family is known by, including both synonyms and varieties. This should not be used in new families, and existing families should have the entries in this list moved into either
aliases
orvarieties
. protoLanguage
- The language code for the proto-language of the family; that is, the common ancestor of all languages in the family. If this is not specified, Module:families will attempt to create the proto-language code by adding
-pro
to the family code — for instance,ine
(Indo-European languages) →ine-pro
(Proto-Indo-European language) — and then use Module:languages to check if the code is valid. family
- The superfamily that the language family belongs to, if any.
wikidata_item
- The number in the Wikidata item id for the language family.