Thokit
桃橘(THOKIT,Tong-uán Hokkien Orthography toolKIT)
Install / Use
/learn @TongUanLab/ThokitREADME
THOKIT / 桃橘
「桃橘」(THOKIT,Tong-uán Hokkien Orthography toolKIT)是東苑實驗室个閩南方言拼寫方案處理工具套件。
「桃橘」漢字號名取漳州話 Thô-kit,毋過若讀做泉廈腔 Thô-kiat 嘛解使着。
支持个編程語言
- Python
- JavaScript
主要功能
- 臺羅/白話字个數字式(ASCII)/閏符式(Unicode)互轉
- 臺羅數字式/白話字數字式互轉
- 臺羅閏符式/白話字閏符式互轉
- 支持全大寫處理
- 支持加種白話字標準
- 支持自定義羅馬字調符
開始
Python
from thokit import ThoKit
thokit = ThoKit()
print(thokit.tailoAscii2Unicode('Sann1 TE2 khoo3 khuah4; lang5 lau6 phinn7 tit8. Hann9?'))
print(thokit.pojAscii2Unicode('SANN te2 khoo3 khuah; lang5 lau6 phinn7 tit8. Hann9?'))
HTML
<script type="module">
import { ThoKit } from '../../thokit.js';
const thokit = new ThoKit();
console.log(thokit.tailoAscii2Unicode('Sann1 TE2 khoo3 khuah4; lang5 lau6 phinn7 tit8. Hann9?'))
console.log(thokit.pojAscii2Unicode('SANN te2 khoo3 khuah; lang5 lau6 phinn7 tit8. Hann9?'))
</script>
Node.js
npm install
npm run build
const { ThoKit } = require('./dist/thokit.cjs');
const thokit = new ThoKit();
console.log(thokit.tailoAscii2Unicode('Sann1 TE2 khoo3 khuah4; lang5 lau6 phinn7 tit8. Hann9?'))
console.log(thokit.pojAscii2Unicode('SANN te2 khoo3 khuah; lang5 lau6 phinn7 tit8. Hann9?'))
測試
Python
python test/py/oj.py
HTML
試用 test/html/demo.html(着注意 thokit.js 个導入),或者「韻彙」網站搭个頁面。
Node.js
node test/node/oj.js
白話字標準說明
THOKIT 今支持下底即幾若款白話字標準,字母佮標調規則小可有縒。
默認標準
大致照信望愛台語客語輸入法(FHL Taigi-Hakka IME)白話字个拼寫方式。
TL => default POJ
ts/tsh=>ch/chhua/ue=>oa/oeing/ik=>eng/eknn=>ⁿNN(nn个大寫) =>ᴺoo=>o͘nnh=>ⁿha̋ (a9)=>ă
佮信望愛輸入法个主要差異:
auh8/aunnh8:a̍uh/a̍uⁿhta̍uh-ta̍uh-á(o)tau̍h-tau̍h-á(x)
ere佮irinn(老泉腔)調符放咧央元音
甘爲霖標準('campbell')
照甘爲霖《廈門音新字典》(1913)个拼寫方式佮標調規則。
TL => Campbell POJ
ch/chh:tsh=>chh- ⚠️
tsa, tso, tsu=>tsa, tso, tsu tsi, tse=>chi, che
ua/ue=>oa/oeing/ik=>eng/eknn=>ⁿ- ⚠️
nnh=>hⁿ oo=>o͘onn=>o͘ⁿmoo/ngoo=>mo͘/ngo͘noo=>no͘ⁿoa/oe:頭前接聲母,後壁無接元音或者塞音韻尾,標o;若無標a/e- ⚠️
uā, uē, uānn=>oā, oē, oāⁿ tuā, tuē, tuānn=>tōa, tōe, tōaⁿua̍h, hua̍h, uāi, uān,huāi, huān=>oa̍h, hoa̍h, oāi, oān, hoāi, hoān
- ⚠️
- ⚠️
a̍innh=>ai̍hⁿ
杜嘉德標準(douglas)
照杜嘉德《廈英大辭典》(1873)个拼寫方式佮標調規則。
TL => Douglas POJ
ch/chh:tsh=>chh- ⚠️
tsa, tso, tsu=>tsa, tso, tsu tsi, tse=>chi, che
ua/ue=>oa/oeing/ik=>eng/eknn=>ⁿ- ⚠️
nnh=>hⁿ oo=>o͘onn=>o͘ⁿ- ✅
ir, er, ee=>ü, ö, ɛ oa/oe:oa*標o;oe*標o- ⚠️
uā, ua̍h, uānn, tuā, tuānn=>oā, oa̍h, oāⁿ, toā, toāⁿ - ⚠️
uē, tuē=>ōe, o̍eh, tōe
- ⚠️
- ⚠️
ainn頭前若無字母,標i
授權協議
MIT
Related Skills
docs-writer
99.2k`docs-writer` skill instructions As an expert technical writer and editor for the Gemini CLI project, you produce accurate, clear, and consistent documentation. When asked to write, edit, or revie
model-usage
337.1kUse CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
arscontexta
2.9kClaude Code plugin that generates individualized knowledge systems from conversation. You describe how you think and work, have a conversation and get a complete second brain as markdown files you own.
zola-ai
An autonomous Solana wallet agent that executes payments via Twitter mentions and an in-app dashboard, powered by Claude.
