Module talk:th-pron

From Wiktionary, the free dictionary
Latest comment: 3 years ago by ShakespeareFan00 in topic Requesting page protection...
Jump to navigation Jump to search

New features

[edit]

@Octahedron80, Wyang Hi. @Octahedron80, could you document new features and symbols introduced into the module, please? --Anatoli T. (обсудить/вклад) 04:13, 5 March 2017 (UTC)Reply

About little bar, is it? According to the standard, there is no mark to indicate mid tone of dead syllables. So I introduce U+0304 COMBINING MACRON to function this. (It must be better than degree sign I used firstly.) It must be put over consonant like other tone marks.

The mid tone of dead syllables often occurs in "-ะ" (and some other short vowels without final) which is not the last syllable; it becomes mid tone in speaking. It is like unstressed as they called in English, but Thai does not use stress concept. But not every "-ะ" becomes mid tone. วัฒนธรรม and ลักษณนาม are the best examples.

I also make IPA righter and simpler to read since we use / / for wide pronunciation, as suggested by a thwikt linguist there. --Octahedron80 (talk) 08:30, 5 March 2017 (UTC)Reply

@Octahedron80 Thanks. Isn't it what it is in the pronunciation of the first syllable in สวัสดี (sà-wàt-dii)? Phonetic: ส̄ะ-หฺวัด-ดี /sa˧.wat˨˩.diː˧/? --Anatoli T. (обсудить/вклад) 09:25, 5 March 2017 (UTC)Reply
Lots of words apply this. Must manually check. --Octahedron80 (talk) 09:58, 5 March 2017 (UTC)Reply

@Atitarev, Wyang Saek language also use the (-) in the same concept either. See the research [1] (any supplement). --Octahedron80 (talk) 07:55, 17 August 2017 (UTC)Reply

Thanks, very interesting. Is this the official orthography for Saek? I wonder what tone this sign mark represents. I can also see a tent-like (^) diacritic - a tone mark as well? Wyang (talk) 10:15, 17 August 2017 (UTC)Reply
This is the first-and-only writing system of Saek, finished in 2013, because the language has been only spoken. The research is the result of the contribution of local people. They start studying and teaching with this orthography for conservation. (I cannot call it "official" but the people already accept.) Of course about the tone mark. --Octahedron80 (talk) 10:16, 17 August 2017 (UTC)Reply

Question

[edit]

@Wyang, Octahedron80 Hi. I have three questions: First, is the code shown here what is actually running on Wiktionary or can this be out of sync with production? Second, I'm studying how transliteration is done and I'm having trouble running this locally (this was before the Jun26 edit). Is there a stand-alone version of this script that can be run with Lua only, without any PHP integration? Third, what font/IDE can you use to make editing Thai scripts easier? It is very difficult trying to understand the regular expressions when Thai has so many zero-width characters. Many thanks for your work. Stutterbub (talk) 07:00, 27 June 2018 (UTC)Reply

@Stutterbub Hi. With the questions above: (1) The code shown here is what is currently running on (the English) Wiktionary. (2) Unfortunately I'm only familiar with Lua in a Wiktionary context, and am not sure how this relates to standalone Lua interpreters. (3) I'm not sure about the font either; Octahedron80 would probably know. Please note that all Thai entries/texts here should be free of zero-width characters, and this module does not accept ZWSP in the input. Automatic Thai text segmentation is infeasible using ZWSP-free texts, and this module requires a priori segmentation/syllabification in the input. More documentation can be found on {{th-pron}}. Any other questions please let me know. Wyang (talk) 07:12, 27 June 2018 (UTC)Reply
(1) There is another version of th-pron running on Thai Wiktionary managed by me. I do not know if there are more wikis use it in their own ways.
(2) Lua scripts here are definitely not able run on standalone environment, especially MediaWiki-ralated libraries. You have to convert some parts and replace with equivalent functions yourself. However, you still can study their algorithm for re-writing. No shortcut.
(3) You can use any editor (except Notepad) but neither font will make you code easier; marks will only stack up in one place. First thing you can use is logical thinking what you just type in. Or you might want to escape them into codepoints but it will result in much longer expressions. The easiest way is to copy them here quote-to-quote.

--Octahedron80 (talk) 09:35, 27 June 2018 (UTC)Reply

PS. BabelMap may help you understand already-written regex by pasting on it. --Octahedron80 (talk) 09:41, 27 June 2018 (UTC)Reply
BabelPad has a "simple rendering" option that makes combining characters non-combining. —Suzukaze-c 18:07, 27 June 2018 (UTC)Reply
If you want to develop an off-wiki version of the module, it may be possible if you download the libraries that the module uses and install them where your Lua interpreter can find and load them. A mirror of libraryUtil, mw.html, and mw.text is available here, and the Lua version of the mw.ustring library here. libraryUtil is used by the libraries to check arguments. — Eru·tuon 18:18, 27 June 2018 (UTC)Reply

Text popup

[edit]

Line 464: "(...) กร, ทร, ธร, มร, and หร are pronounced 'gaaw ra', 'thaaw ra', 'maaw ra' and 'haaw ra', respectively." One pronunciation seems missing. Could someone please take a look? Thanks! --PiefPafPier (talk) 12:10, 6 December 2019 (UTC)Reply

Rhymes

[edit]

I add rhyme detection into this module. The rhyme link (R) will be shown next to its IPA. This is the first step for collecting to Rhymes namespace. --Octahedron80 (talk) 07:00, 6 December 2020 (UTC)Reply

Requesting page protection...

[edit]

This is a module used on more than a 1000 pages, It should be protected. ShakespeareFan00 (talk) 12:20, 18 January 2021 (UTC)Reply