Talk:

From Wiktionary, the free dictionary
Latest comment: 6 years ago by KevinUp in topic zh-forms Box
Jump to navigation Jump to search

zh-forms Box

[edit]

@Wyang Would it be possible for you to insert font lang="zh-hant" and font lang="zh-hans" to Module:zh-forms so that the characters and (both encoded under the same code point) can be displayed correctly? Thanks. KevinUp (talk) 11:53, 7 June 2018 (UTC)Reply

@KevinUp Lol, added. I had to remove class="Hani" so that the appropriate trad/simp fonts can be displayed. I hope that is the correct/proper way to do it. Wyang (talk) 12:10, 7 June 2018 (UTC)Reply
@Wyang: Thanks. Problem solved. KevinUp (talk) 12:15, 7 June 2018 (UTC)Reply
@Wyang, @Zcreator alt, @Justinrleung, @Suzukaze-c Hi. After the most recent edit in Module:zh-forms [1], the zh-forms box isn't displaying the proper traditional and simplified forms based on the language tag. Also, I think it would be preferable to add in such characters manually rather than letting it do so automatically. If you look carefully in revision 49664286 [2], some characters added to the unified_char list such as (huì), (zhōu) show no significant difference between traditional and simplified forms in the Unicode charts. I don't think it is necessary to split between traditional and simplified forms for characters that show only minor cosmetic differences (mostly in the stroke direction) such as /, /, /, /, /, /, /, /, /. Instead, these differences should be noted in the translingual section (either as alternative forms or in their respective ids). KevinUp (talk) 13:08, 8 June 2018 (UTC)Reply
I think it would be much better to add something such as "zh-forms|s=t" to characters such as / (U+73CA), / (U+7424), / (U+733A), / (U+761F), / (U+8392) as these characters are special exceptions that have been unified when compared with characters such as (U+59CD)/ (U+59D7) and (zhēng) (U+775C)/ (zhēng) (U+7741) and (yáo) (U+6416)/ (yáo) (U+6447) and (wēn) (U+6EAB)/ (wēn) (U+6E29) and (gōng) (U+5BAE)/ (gōng) (U+5BAB) that have been disunified. It should also be noted that Han unification is slightly inconsistent, with derived characters of (U+518A)/ (U+518C), (U+722D)/ (U+4E89), (U+4343), 𥁕 (U+25055)/ (U+6637), (U+5442)/ (U+5415) for example, not unified consistently. Some frequently used characters are split into separate code points whereas rarely used characters are unified. I would suggest adding "zh-forms|s=t" to such anomalies when encountered. KevinUp (talk) 10:22, 9 June 2018 (UTC)Reply