Template talk:ja-spellings

Turn this into a morphology template?

Latest comment: 5 years ago4 comments3 people in discussion

withdrawing this

As you know, I've always wanted to remodel the Japanese entry layout after the Chinese one, in order to solve the two problems which makes the Wiktionary-default entry layout unsuitable for Japanese:

A Japanese entry usually have more than one spelling, and therefore must appear on multiple entries;
Homographs are common, especially when ancient words are included.

The solution I see is to use a soft-redirect system similar to Chinese entries: the lemma entry uses {{ja-forms}} to link to non-lemma spellings, and non-lemma spellings use {{ja-see}} to link to the lemma form, with these two templates modeled after {{zh-forms}} and {{zh-see}}.

Thus, Japanese entries are expected to have the following structure:

entry with one etymology

==Japanese==
<lexeme>

entry with multiple etymologies (e.g. homographs)

==Japanese==
===Etymology 1===
<lexeme 1>
===Etymology 2===
<lexeme 2>
===Etymology 3===
<lexeme 3>
...

where each <lexeme> is either a lemma entry or a non-lemma entry, signified whether it begins with {{ja-forms}} or {{ja-see}} respectively.

I created this template ({{ja-kanji spellings}}) as a substitute of what was to be {{ja-forms}} (whose name was already taken). It had the basic feature to list all the spellings of an entry as well as to link to its non-lemma forms. Unfortunately (1) I was not good at web design and it looked ugly, and (2) the problem of where to place {{ja-kanjitab}}s remained. However, it fell short of the features of {{zh-forms}} which it was modeled after: to show the internal structure of the term. For example, the Chinese entry 國際音標 had |type=22 to show it was a compound of 國際 + 音標, but the similar Japanese entry 国際音声記号 has no such information, so the word structure must be shown in the etymology section.

Therefore I'd like to propose adding morphology information to this template:

	international; (obs) diplomatic intercourse	phonetic symbol
hiragana	こくさい	おんせいきごう
kanji (国際音声記号)	国際	音声記号

(The resulting template is meant to be put on the lemma entry. Information about individual kanji is still to be handled by {{ja-kanjitab}}, which can be either centralized or listed on the individual non-lemma entries.)

What do you think about such an approach? 草草頓首

(Notifying Eirikr, Wyang, TAKASUGI Shinji, Nibiko, Atitarev, Suzukaze-c, Poketalker, Cnilep, Britannic124, Fumiko Take, Nardog, Marlin Setia1, AstroVulpes, Tsukuyone, Aogaeru4): --Dine2016 (talk) 07:41, 14 November 2018 (UTC)Reply

I’m not against using it, but isn’t it enough to explain word formation in the etymology section? I think there aren’t so many cases where word boundaries are different. — TAKASUGI Shinji (talk) 08:53, 14 November 2018 (UTC)Reply

@TAKASUGI Shinji: Thanks for your reply. For compounds, word formation is obvious in most cases. If you explain it in the etymology section, you probably need to type something like

{{com|ja|インド|ヨーロッパ|語族|tr1=Indo|t1=[[India]]|tr2=Yōroppa|t2=[[Europe]]|tr3=gozoku|t3=[[language family]]}}

. On the other hand, you could just have {{ja-forms|インド・ヨーロッパ語族|h=インド・ヨーロッパごぞく}} and the template will fill the rest automatically. Of course, there are many problems and difficulties with such an approach, especially when concerning 和語. --Dine2016 (talk) 16:06, 14 November 2018 (UTC)Reply

Although this proposal is withdrawn, I think it is a good idea to show information such as 国際 + 音声記号, perhaps as an extension under {{ja-kanjitab}}. There is too much information if I were to click on each kanji: 国 + 際 + 音 + 声 + 記 + 号. By the way, this proposal works best for Sino-Japanese compounds that are written without okurigana. KevinUp (talk) 21:55, 30 March 2019 (UTC)Reply

Accent reference carried from kana entry

Latest comment: 6 years ago2 comments2 people in discussion

@Dine2016, please take a look at 加留多, is this normal? ～ POKéTalker（═◉═） 22:04, 29 January 2019 (UTC)Reply

@Poketalker: It's fixed; thanks for reporting the problem. (It was a problem in Module:ja-parse, which only removed references in the form <ref...</ref>, not <ref.../>.) --Dine2016 (talk) 02:10, 30 January 2019 (UTC)Reply

kyūjitai

Latest comment: 5 years ago18 comments3 people in discussion

@Dine2016: Is kyūjitai ~~under~~within the scope of this template? —Suzukaze-c ◇◇ 06:00, 6 March 2019 (UTC)Reply

@Suzukaze-c: Thanks for the question. I considered kyūjitai (and other orthographical variations such as 日々 vs 日日) to be specific to spellings rather than words, so I left them out when making this template, leaving them for {{ja-kanjitab}} for handle.

{{ja-spellings|ひろがる|広がる|拡がる}} // to be put on whatever is chosen as the main entry
{{ja-kanjitab|k=廣がる|ひろ|yomi=k}} // to be put on 広がる
{{ja-kanjitab|k=擴がる|ひろ|yomi=k}} // to be put on 拡がる

I'm not sure if this is the best approach, though. What's your thought? --Dine2016 (talk) 06:29, 6 March 2019 (UTC)Reply

Ah, I forgot that my original plan was to make them generated automatically (except for ambiguous cases like 弁 and 芸). --Dine2016 (talk) 06:38, 6 March 2019 (UTC)Reply

@Dine2016: I had thought that such variations might be within the scope (being mere alternative spellings), but including them in your concept of {{ja-kanjitab}} also makes sense. It would be more concise, as well. —Suzukaze-c ◇◇ 08:16, 11 March 2019 (UTC)Reply

Actually, I put off kyūjitai because there are other issues to be solved (such as how to display the kyūjitai if it's unified with the shinjitai in Unicode, e.g. 漢). --Dine2016 (talk) 08:40, 11 March 2019 (UTC)Reply

ah, my favorite part of unicode~ they did such a great job with han unification. —Suzukaze-c ◇◇ 09:01, 11 March 2019 (UTC)Reply

Um…, a few Hongkongese are unhappy about it: [1] [2]. I guess it has something to do with the lack of an official standard of kyūjitai. --Dine2016 (talk) 10:41, 11 March 2019 (UTC)Reply

(is sarcasm _(：3 」∠ )_ —Suzukaze-c ◇◇ 17:14, 11 March 2019 (UTC))Reply

@Suzukaze-c Which kyūjitai standard do you prefer - JIS X 0208 / JIS X 0213 or Unicode? If the latter, we might as well use pictures for unified characters such as 漢. Also, do you happen to be aware of any shin-to-kyū conversion table which follows one of the listed standards? --Dine2016 (talk) 10:39, 26 March 2019 (UTC)Reply

@Dine2016: I'm not sure (´・ω・｀) Perhaps we should follow JIS use of codepoints, to align with Japanese computing, but the shape of the characters is also important … I like how the Chinese Wikipedia page for 旧字体 uses CJK compatibility characters, and there is also the possibility of using Unicode variation selectors. wikisource:ja:ヘルプ:異体字 might be worth looking at. As for lists, what about jaglyphwiki:Group:常用漢字の旧字体 or [3]? —Suzukaze-c ◇◇ 19:23, 26 March 2019 (UTC)Reply

@Dine2016 Hey there. I compiled the following shinjitai to kyūjitai conversion list:

No. 1 to No. 362 are from Jōyō kanji (2010).

No. 363 to No. 380 are from Jinmeiyō kanji (2015) (212 kanji that already appear in Jōyō kanji (2010) are omitted).

No. 381 to No. 398 are from Hyōgai kanji (2000) (4 kanji already listed in Jōyō kanji or Jinmeiyō kanji are omitted).

Codepoints of compatibility ideographs have been bolded. Also, note that shinjitai 弁 (U+5F01) corresponds to three different kyūjitai. For Jōyō kanji, all codepoints are obtained by converting PDF files into text files. KevinUp (talk) 12:39, 29 March 2019 (UTC)Reply

More information

亜 (U+4E9C) → 亞 (U+4E9E)
悪 (U+60AA) → 惡 (U+60E1)
圧 (U+5727) → 壓 (U+58D3)
囲 (U+56F2) → 圍 (U+570D)
医 (U+533B) → 醫 (U+91AB)
為 (U+70BA) → 爲 (U+7232)
壱 (U+58F1) → 壹 (U+58F9)
逸 (U+9038) → 逸 (U+FA67)
隠 (U+96A0) → 隱 (U+96B1)
栄 (U+6804) → 榮 (U+69AE)
営 (U+55B6) → 營 (U+71DF)
衛 (U+885B) → 衞 (U+885E)
駅 (U+99C5) → 驛 (U+9A5B)
謁 (U+8B01) → 謁 (U+FA62)
円 (U+5186) → 圓 (U+5713)
塩 (U+5869) → 鹽 (U+9E7D)
縁 (U+7E01) → 緣 (U+7DE3)
艶 (U+8276) → 艷 (U+8277)
応 (U+5FDC) → 應 (U+61C9)
欧 (U+6B27) → 歐 (U+6B50)
殴 (U+6BB4) → 毆 (U+6BC6)
桜 (U+685C) → 櫻 (U+6AFB)
奥 (U+5965) → 奧 (U+5967)
横 (U+6A2A) → 橫 (U+6A6B)
温 (U+6E29) → 溫 (U+6EAB)
穏 (U+7A4F) → 穩 (U+7A69)
仮 (U+4EEE) → 假 (U+5047)
価 (U+4FA1) → 價 (U+50F9)
禍 (U+798D) → 禍 (U+FA52)
画 (U+753B) → 畫 (U+756B)
会 (U+4F1A) → 會 (U+6703)
悔 (U+6094) → 悔 (U+FA3D)
海 (U+6D77) → 海 (U+FA45)
絵 (U+7D75) → 繪 (U+7E6A)
壊 (U+58CA) → 壞 (U+58DE)
懐 (U+61D0) → 懷 (U+61F7)
慨 (U+6168) → 慨 (U+FA3E)
概 (U+6982) → 槪 (U+69EA)
拡 (U+62E1) → 擴 (U+64F4)
殻 (U+6BBB) → 殼 (U+6BBC)
覚 (U+899A) → 覺 (U+89BA)
学 (U+5B66) → 學 (U+5B78)
岳 (U+5CB3) → 嶽 (U+5DBD)
楽 (U+697D) → 樂 (U+6A02)
喝 (U+559D) → 喝 (U+FA36)
渇 (U+6E07) → 渴 (U+6E34)
褐 (U+8910) → 褐 (U+FA60)
缶 (U+7F36) → 罐 (U+7F50)
巻 (U+5DFB) → 卷 (U+5377)
陥 (U+9665) → 陷 (U+9677)
勧 (U+52E7) → 勸 (U+52F8)
寛 (U+5BDB) → 寬 (U+5BEC)
漢 (U+6F22) → 漢 (U+FA47)
関 (U+95A2) → 關 (U+95DC)
歓 (U+6B53) → 歡 (U+6B61)
観 (U+89B3) → 觀 (U+89C0)
気 (U+6C17) → 氣 (U+6C23)
祈 (U+7948) → 祈 (U+FA4E)
既 (U+65E2) → 既 (U+FA42)
帰 (U+5E30) → 歸 (U+6B78)
亀 (U+4E80) → 龜 (U+9F9C)
器 (U+5668) → 器 (U+FA38)
偽 (U+507D) → 僞 (U+50DE)
戯 (U+622F) → 戲 (U+6232)
犠 (U+72A0) → 犧 (U+72A7)
旧 (U+65E7) → 舊 (U+820A)
拠 (U+62E0) → 據 (U+64DA)
挙 (U+6319) → 擧 (U+64E7)
虚 (U+865A) → 虛 (U+865B)
峡 (U+5CE1) → 峽 (U+5CFD)
挟 (U+631F) → 挾 (U+633E)
狭 (U+72ED) → 狹 (U+72F9)
郷 (U+90F7) → 鄕 (U+9115)
響 (U+97FF) → 響 (U+FA69)
暁 (U+6681) → 曉 (U+66C9)
勤 (U+52E4) → 勤 (U+FA34)
謹 (U+8B39) → 謹 (U+FA63)
区 (U+533A) → 區 (U+5340)
駆 (U+99C6) → 驅 (U+9A45)
勲 (U+52F2) → 勳 (U+52F3)
薫 (U+85AB) → 薰 (U+85B0)
径 (U+5F84) → 徑 (U+5F91)
茎 (U+830E) → 莖 (U+8396)
恵 (U+6075) → 惠 (U+60E0)
掲 (U+63B2) → 揭 (U+63ED)
渓 (U+6E13) → 溪 (U+6EAA)
経 (U+7D4C) → 經 (U+7D93)
蛍 (U+86CD) → 螢 (U+87A2)
軽 (U+8EFD) → 輕 (U+8F15)
継 (U+7D99) → 繼 (U+7E7C)
鶏 (U+9D8F) → 鷄 (U+9DC4)
芸 (U+82B8) → 藝 (U+85DD)
撃 (U+6483) → 擊 (U+64CA)
欠 (U+6B20) → 缺 (U+7F3A)
研 (U+7814) → 硏 (U+784F)
県 (U+770C) → 縣 (U+7E23)
倹 (U+5039) → 儉 (U+5109)
剣 (U+5263) → 劍 (U+528D)
険 (U+967A) → 險 (U+96AA)
圏 (U+570F) → 圈 (U+5708)
検 (U+691C) → 檢 (U+6AA2)
献 (U+732E) → 獻 (U+737B)
権 (U+6A29) → 權 (U+6B0A)
顕 (U+9855) → 顯 (U+986F)
験 (U+9A13) → 驗 (U+9A57)
厳 (U+53B3) → 嚴 (U+56B4)
広 (U+5E83) → 廣 (U+5EE3)
効 (U+52B9) → 效 (U+6548)
恒 (U+6052) → 恆 (U+6046)
黄 (U+9EC4) → 黃 (U+9EC3)
鉱 (U+9271) → 鑛 (U+945B)
号 (U+53F7) → 號 (U+865F)
国 (U+56FD) → 國 (U+570B)
黒 (U+9ED2) → 黑 (U+9ED1)
穀 (U+7A40) → 穀 (U+FA54)
砕 (U+7815) → 碎 (U+788E)
済 (U+6E08) → 濟 (U+6FDF)
斎 (U+658E) → 齋 (U+9F4B)
剤 (U+5264) → 劑 (U+5291)
殺 (U+6BBA) → 殺 (U+F970)
雑 (U+96D1) → 雜 (U+96DC)
参 (U+53C2) → 參 (U+53C3)
桟 (U+685F) → 棧 (U+68E7)
蚕 (U+8695) → 蠶 (U+8836)
惨 (U+60E8) → 慘 (U+6158)
賛 (U+8CDB) → 贊 (U+8D0A)
残 (U+6B8B) → 殘 (U+6B98)
糸 (U+7CF8) → 絲 (U+7D72)
祉 (U+7949) → 祉 (U+FA4D)
視 (U+8996) → 視 (U+FA61)
歯 (U+6B6F) → 齒 (U+9F52)
児 (U+5150) → 兒 (U+5152)
辞 (U+8F9E) → 辭 (U+8FAD)
湿 (U+6E7F) → 濕 (U+6FD5)
実 (U+5B9F) → 實 (U+5BE6)
写 (U+5199) → 寫 (U+5BEB)
社 (U+793E) → 社 (U+FA4C)
者 (U+8005) → 者 (U+FA5B)
煮 (U+716E) → 煮 (U+FA48)
釈 (U+91C8) → 釋 (U+91CB)
寿 (U+5BFF) → 壽 (U+58FD)
収 (U+53CE) → 收 (U+6536)
臭 (U+81ED) → 臭 (U+FA5C)
従 (U+5F93) → 從 (U+5F9E)
渋 (U+6E0B) → 澁 (U+6F81)
獣 (U+7363) → 獸 (U+7378)
縦 (U+7E26) → 縱 (U+7E31)
祝 (U+795D) → 祝 (U+FA51)
粛 (U+7C9B) → 肅 (U+8085)
処 (U+51E6) → 處 (U+8655)
暑 (U+6691) → 暑 (U+FA43)
署 (U+7F72) → 署 (U+FA5A)
緒 (U+7DD2) → 緖 (U+7DD6)
諸 (U+8AF8) → 諸 (U+FA22)
叙 (U+53D9) → 敍 (U+654D)
将 (U+5C06) → 將 (U+5C07)
祥 (U+7965) → 祥 (U+FA1A)
称 (U+79F0) → 稱 (U+7A31)
渉 (U+6E09) → 涉 (U+6D89)
焼 (U+713C) → 燒 (U+71D2)
証 (U+8A3C) → 證 (U+8B49)
奨 (U+5968) → 奬 (U+596C)
条 (U+6761) → 條 (U+689D)
状 (U+72B6) → 狀 (U+72C0)
乗 (U+4E57) → 乘 (U+4E58)
浄 (U+6D44) → 淨 (U+6DE8)
剰 (U+5270) → 剩 (U+5269)
畳 (U+7573) → 疊 (U+758A)
縄 (U+7E04) → 繩 (U+7E69)
壌 (U+58CC) → 壤 (U+58E4)
嬢 (U+5B22) → 孃 (U+5B43)
譲 (U+8B72) → 讓 (U+8B93)
醸 (U+91B8) → 釀 (U+91C0)
触 (U+89E6) → 觸 (U+89F8)
嘱 (U+5631) → 囑 (U+56D1)
神 (U+795E) → 神 (U+FA19)
真 (U+771F) → 眞 (U+771E)
寝 (U+5BDD) → 寢 (U+5BE2)
慎 (U+614E) → 愼 (U+613C)
尽 (U+5C3D) → 盡 (U+76E1)
図 (U+56F3) → 圖 (U+5716)
粋 (U+7C8B) → 粹 (U+7CB9)
酔 (U+9154) → 醉 (U+9189)
穂 (U+7A42) → 穗 (U+7A57)
随 (U+968F) → 隨 (U+96A8)
髄 (U+9AC4) → 髓 (U+9AD3)
枢 (U+67A2) → 樞 (U+6A1E)
数 (U+6570) → 數 (U+6578)
瀬 (U+702C) → 瀨 (U+7028)
声 (U+58F0) → 聲 (U+8072)
斉 (U+6589) → 齊 (U+9F4A)
静 (U+9759) → 靜 (U+975C)
窃 (U+7A83) → 竊 (U+7ACA)
摂 (U+6442) → 攝 (U+651D)
節 (U+7BC0) → 節 (U+FA56)
専 (U+5C02) → 專 (U+5C08)
浅 (U+6D45) → 淺 (U+6DFA)
戦 (U+6226) → 戰 (U+6230)
践 (U+8DF5) → 踐 (U+8E10)
銭 (U+92AD) → 錢 (U+9322)
潜 (U+6F5C) → 潛 (U+6F5B)
繊 (U+7E4A) → 纖 (U+7E96)
禅 (U+7985) → 禪 (U+79AA)
祖 (U+7956) → 祖 (U+FA50)
双 (U+53CC) → 雙 (U+96D9)
壮 (U+58EE) → 壯 (U+58EF)
争 (U+4E89) → 爭 (U+722D)
荘 (U+8358) → 莊 (U+838A)
捜 (U+635C) → 搜 (U+641C)
挿 (U+633F) → 插 (U+63D2)
巣 (U+5DE3) → 巢 (U+5DE2)
曽 (U+66FD) → 曾 (U+66FE)
痩 (U+75E9) → 瘦 (U+7626)
装 (U+88C5) → 裝 (U+88DD)
僧 (U+50E7) → 僧 (U+FA31)
層 (U+5C64) → 層 (U+FA3B)
総 (U+7DCF) → 總 (U+7E3D)
騒 (U+9A12) → 騷 (U+9A37)
増 (U+5897) → 增 (U+589E)
憎 (U+618E) → 憎 (U+FA3F)
蔵 (U+8535) → 藏 (U+85CF)
贈 (U+8D08) → 贈 (U+FA65)
臓 (U+81D3) → 臟 (U+81DF)
即 (U+5373) → 卽 (U+537D)
属 (U+5C5E) → 屬 (U+5C6C)
続 (U+7D9A) → 續 (U+7E8C)
堕 (U+5815) → 墮 (U+58AE)
対 (U+5BFE) → 對 (U+5C0D)
体 (U+4F53) → 體 (U+9AD4)
帯 (U+5E2F) → 帶 (U+5E36)
滞 (U+6EDE) → 滯 (U+6EEF)
台 (U+53F0) → 臺 (U+81FA)
滝 (U+6EDD) → 瀧 (U+7027)
択 (U+629E) → 擇 (U+64C7)
沢 (U+6CA2) → 澤 (U+6FA4)
担 (U+62C5) → 擔 (U+64D4)
単 (U+5358) → 單 (U+55AE)
胆 (U+80C6) → 膽 (U+81BD)
嘆 (U+5606) → 嘆 (U+FA37)
団 (U+56E3) → 團 (U+5718)
断 (U+65AD) → 斷 (U+65B7)
弾 (U+5F3E) → 彈 (U+5F48)
遅 (U+9045) → 遲 (U+9072)
痴 (U+75F4) → 癡 (U+7661)
虫 (U+866B) → 蟲 (U+87F2)
昼 (U+663C) → 晝 (U+665D)
鋳 (U+92F3) → 鑄 (U+9444)
著 (U+8457) → 著 (U+FA5F)
庁 (U+5E81) → 廳 (U+5EF3)
徴 (U+5FB4) → 徵 (U+5FB5)
聴 (U+8074) → 聽 (U+807D)
懲 (U+61F2) → 懲 (U+FA40)
勅 (U+52C5) → 敕 (U+6555)
鎮 (U+93AE) → 鎭 (U+93AD)
塚 (U+585A) → 塚 (U+FA10)
逓 (U+9013) → 遞 (U+905E)
鉄 (U+9244) → 鐵 (U+9435)
点 (U+70B9) → 點 (U+9EDE)
転 (U+8EE2) → 轉 (U+8F49)
伝 (U+4F1D) → 傳 (U+50B3)
都 (U+90FD) → 都 (U+FA26)
灯 (U+706F) → 燈 (U+71C8)
当 (U+5F53) → 當 (U+7576)
党 (U+515A) → 黨 (U+9EE8)
盗 (U+76D7) → 盜 (U+76DC)
稲 (U+7A32) → 稻 (U+7A3B)
闘 (U+95D8) → 鬭 (U+9B2D)
徳 (U+5FB3) → 德 (U+5FB7)
独 (U+72EC) → 獨 (U+7368)
読 (U+8AAD) → 讀 (U+8B80)
突 (U+7A81) → 突 (U+FA55)
届 (U+5C4A) → 屆 (U+5C46)
難 (U+96E3) → 難 (U+FA68)
弐 (U+5F10) → 貳 (U+8CB3)
悩 (U+60A9) → 惱 (U+60F1)
脳 (U+8133) → 腦 (U+8166)
覇 (U+8987) → 霸 (U+9738)
拝 (U+62DD) → 拜 (U+62DC)
廃 (U+5EC3) → 廢 (U+5EE2)
売 (U+58F2) → 賣 (U+8CE3)
梅 (U+6885) → 梅 (U+FA44)
麦 (U+9EA6) → 麥 (U+9EA5)
発 (U+767A) → 發 (U+767C)
髪 (U+9AEA) → 髮 (U+9AEE)
抜 (U+629C) → 拔 (U+62D4)
繁 (U+7E41) → 繁 (U+FA59)
晩 (U+6669) → 晚 (U+665A)
蛮 (U+86EE) → 蠻 (U+883B)
卑 (U+5351) → 卑 (U+FA35)
秘 (U+79D8) → 祕 (U+7955)
碑 (U+7891) → 碑 (U+FA4B)
浜 (U+6D5C) → 濱 (U+6FF1)
賓 (U+8CD3) → 賓 (U+FA64)
頻 (U+983B) → 頻 (U+FA6A)
敏 (U+654F) → 敏 (U+FA41)
瓶 (U+74F6) → 甁 (U+7501)
侮 (U+4FAE) → 侮 (U+FA30)
福 (U+798F) → 福 (U+FA1B)
払 (U+6255) → 拂 (U+62C2)
仏 (U+4ECF) → 佛 (U+4F5B)
併 (U+4F75) → 倂 (U+5002)
並 (U+4E26) → 竝 (U+7ADD)
塀 (U+5840) → 塀 (U+FA39)
餅 (U+9905) → 餠 (U+9920)
辺 (U+8FBA) → 邊 (U+908A)
変 (U+5909) → 變 (U+8B8A)
弁 (U+5F01) → 辨 (U+8FA8), 瓣 (U+74E3), 辯 (U+8FAF)
勉 (U+52C9) → 勉 (U+FA33)
歩 (U+6B69) → 步 (U+6B65)
宝 (U+5B9D) → 寶 (U+5BF6)
豊 (U+8C4A) → 豐 (U+8C50)
褒 (U+8912) → 襃 (U+8943)
墨 (U+58A8) → 墨 (U+FA3A)
翻 (U+7FFB) → 飜 (U+98DC)
毎 (U+6BCE) → 每 (U+6BCF)
万 (U+4E07) → 萬 (U+842C)
満 (U+6E80) → 滿 (U+6EFF)
免 (U+514D) → 免 (U+FA32)
麺 (U+9EBA) → 麵 (U+9EB5)
黙 (U+9ED9) → 默 (U+9ED8)
弥 (U+5F25) → 彌 (U+5F4C)
訳 (U+8A33) → 譯 (U+8B6F)
薬 (U+85AC) → 藥 (U+85E5)
与 (U+4E0E) → 與 (U+8207)
予 (U+4E88) → 豫 (U+8C6B)
余 (U+4F59) → 餘 (U+9918)
誉 (U+8A89) → 譽 (U+8B7D)
揺 (U+63FA) → 搖 (U+6416)
様 (U+69D8) → 樣 (U+6A23)
謡 (U+8B21) → 謠 (U+8B20)
来 (U+6765) → 來 (U+4F86)
頼 (U+983C) → 賴 (U+8CF4)
乱 (U+4E71) → 亂 (U+4E82)
覧 (U+89A7) → 覽 (U+89BD)
欄 (U+6B04) → 欄 (U+F91D)
竜 (U+7ADC) → 龍 (U+9F8D)
隆 (U+9686) → 隆 (U+F9DC)
虜 (U+865C) → 虜 (U+F936)
両 (U+4E21) → 兩 (U+5169)
猟 (U+731F) → 獵 (U+7375)
緑 (U+7DD1) → 綠 (U+7DA0)
涙 (U+6D99) → 淚 (U+6DDA)
塁 (U+5841) → 壘 (U+58D8)
類 (U+985E) → 類 (U+F9D0)
礼 (U+793C) → 禮 (U+79AE)
励 (U+52B1) → 勵 (U+52F5)
戻 (U+623B) → 戾 (U+623E)
霊 (U+970A) → 靈 (U+9748)
齢 (U+9F62) → 齡 (U+9F61)
暦 (U+66A6) → 曆 (U+66C6)
歴 (U+6B74) → 歷 (U+6B77)
恋 (U+604B) → 戀 (U+6200)
練 (U+7DF4) → 練 (U+FA57)
錬 (U+932C) → 鍊 (U+934A)
炉 (U+7089) → 爐 (U+7210)
労 (U+52B4) → 勞 (U+52DE)
郎 (U+90CE) → 郞 (U+90DE)
朗 (U+6717) → 朗 (U+F929)
廊 (U+5ECA) → 廊 (U+F928)
楼 (U+697C) → 樓 (U+6A13)
録 (U+9332) → 錄 (U+9304)
湾 (U+6E7E) → 灣 (U+7063)
亘 (U+4E98) → 亙 (U+4E99)
凜 (U+51DC) → 凛 (U+51DB)
尭 (U+5C2D) → 堯 (U+582F)
巌 (U+5DCC) → 巖 (U+5DD6)
晃 (U+6643) → 晄 (U+6644)
桧 (U+6867) → 檜 (U+6A9C)
槙 (U+69D9) → 槇 (U+69C7)
渚 (U+6E1A) → 渚 (U+FA46)
猪 (U+732A) → 猪 (U+FA16)
琢 (U+7422) → 琢 (U+FA4A)
祢 (U+7962) → 禰 (U+79B0)
祐 (U+7950) → 祐 (U+FA4F)
祷 (U+7977) → 禱 (U+79B1)
禄 (U+7984) → 祿 (U+797F)
禎 (U+798E) → 禎 (U+FA53)
穣 (U+7A63) → 穰 (U+7A70)
萌 (U+840C) → 萠 (U+8420)
遥 (U+9065) → 遙 (U+9059)
唖 (U+5516) → 啞 (U+555E)
頴 (U+9834) → 穎 (U+7A4E)
鴎 (U+9D0E) → 鷗 (U+9DD7)
撹 (U+64B9) → 攪 (U+652A)
麹 (U+9EB9) → 麴 (U+9EB4)
鹸 (U+9E78) → 鹼 (U+9E7C)
噛 (U+565B) → 嚙 (U+5699)
繍 (U+7E4D) → 繡 (U+7E61)
蒋 (U+848B) → 蔣 (U+8523)
醤 (U+91A4) → 醬 (U+91AC)
掻 (U+63BB) → 搔 (U+6414)
屏 (U+5C4F) → 屛 (U+5C5B)
并 (U+5E76) → 幷 (U+5E77)
桝 (U+685D) → 枡 (U+67A1)
沪 (U+6CAA) → 濾 (U+6FFE)
芦 (U+82A6) → 蘆 (U+8606)
蝋 (U+874B) → 蠟 (U+881F)
弯 (U+5F2F) → 彎 (U+5F4E)

@KevinUp: Thanks a lot! Though I would like to add 欠（けつ゠缺、けん゠欠）・芸（げい゠藝、うん゠芸）・缶（かん゠罐、ふ゠缶） to the one-shin-to-multiple-kyū list. (Well, there are also cases like 虫（ちゅう゠蟲、き゠虫）・糸（し゠絲、べき゠糸）, in terms that reference the original glyphs such as 虫部 and 糸部.)

By the way, very curious why 篭-籠 isn't listed _(:зゝ∠)_ --Dine2016 (talk) 04:51, 30 March 2019 (UTC)Reply

籠 is still the official form in Jōyō kanji (2010) and no shinjitai is listed for it. Japan has strict rules regarding character simplification and only shinjitai listed in official documents are considered official. Apparently, 篭 is still considered an extended shinjitai character. KevinUp (talk) 21:55, 30 March 2019 (UTC)Reply

On an unrelated note, I noticed that the radical 辶 is written as three strokes for Jōyō kanji characters but reverts to its original form as four strokes (辶) for non Jōyō kanji characters such as Jinmeiyō kanji. KevinUp (talk) 21:55, 30 March 2019 (UTC)Reply

@KevinUp: Very nice. Out of curiosity, which PDF files were specifically used? —Suzukaze-c ◇◇ 20:48, 10 April 2019 (UTC)Reply

@Suzukaze-c: (1) Jōyō kanji (2010) [4] (2) Jinmeiyō kanji (2015) [5] (3) Hyōgai kanji (2000) [6] - I obtained the codepoints manually for glyphs that are not part of Jōyō kanji. Also, the simplified forms in hyōgai kanji are known as 簡易慣用字体 (variant kanji that can be used in place of 印刷標準字体).

Anyway, these are the official approved shinjitai. Hopefully we could have it all automated. I think we can consider shinjitai not found in the list above as 拡張新字体 (extended or unofficial shinjitai). KevinUp (talk) 02:03, 12 April 2019 (UTC)Reply

@suzukaze-c should 𥳑 be listed as a kyujitai of 簡? This character isn't even supported on my Android phone (which uses Source Han Sans :) Dine2016 (talk) 12:15, 9 April 2019 (UTC)Reply

@Dine2016: I don't know orz —Suzukaze-c ◇◇ 20:48, 10 April 2019 (UTC)Reply

romaji

Latest comment: 5 years ago2 comments2 people in discussion

Should we add romaji information to this template? Then we may drop spelling information from headings. This is an example layout:

Hiragana	modern	まっとう (mattō)
Hiragana	historical	まつたう
Kanji	全う真っ当 ateji, adjective only 完う literary

--115.27.198.88 21:52, 8 April 2019 (UTC)Reply

Hi, thanks for bringing up this issue. Unlike the kana spellings, the romaji is dependent on the POS. For example, 本 (ほん) is romanized as “hon” as a noun (“book”), but “-hon” as a counter. 紫 (むらさき) is “murasaki” as a noun, but “Murasaki” as a proper noun. So I think it might be advantageous to let the romaji remain in the POS headers. On the other hand, I recognize that doing so would mean repeating the same information in every POS header, which can be tedious and error-prone. Which is why I proposed a “unified Japanese” entry layout similar to that of Chinese. Such an entry layout puts the romaji in the pronunciation template, because the romaji is closer to actual pronunciation than to modern kana spelling (gendai kanazukai). Finally, if we still want to put the romaji in the ja-spellings box, it is better to give the romaji a separate row, below kana and kanji. By doing so there will be abundant space to add both Hepburn and Kunrei-shiki romanizations. I personally don't like adding anything to the right of any kana or kanji spelling listed in the ja-spellings box, because it would make the kana and kanji spellings no longer centered in the box and vertically aligned. Thanks. --Dine2016 (talk) 00:28, 9 April 2019 (UTC)Reply