Pinyin2hanzi

End-to-end translation of Chinese phonetics to characters using bi-directional RNN (LSTM/GRU)

Generate Convert Improve

Install / Use

/learn @taiqing/Pinyin2hanzi

About this skill

Quality Score

0/100

README

Bi-directional Gated RNN for End-to-End Translation of Chinese Phonetic Letters to Characters

Architecture

The input is a sequence of romanized phonetic letters, and the output is a sequence of Chinese characters. For example, the input is womenyouxinxinyingdezhecibisai, and the output should be 我们有信心赢得这次比赛("we have confidence to win the game").

Bi-directional Gated RNN for End-to-End Translation of Chinese Phonetic Letters to Characters

Pipeline

setup_corpus.py --> handle_rare_words.py --> label_corpus.py --> align_corpus.py --> calc_vocab.py & format_dataset.py --> pinyin_to_hanzi_bigru.py

Experiments

Error rate is measured by the number of edits divided by the number of Chinese characters, averaged for all the testing sentences.

The models were trained on 67,000 (phonetic letters, Chinese character sentence) pairs.

| | #test sentences | #hidden nodes | error rate | |----------------|-----------------|---------------|------------| | single-way GRU | 1,687 | 256 | 33.38% | | Bi-GRU | 1,687 | 256 | 13.72% |

Examples

input: womenyouxinxinyingdezhecibisai

output: 我们有信心赢得这次比赛

input: youyujingyanbuzu

output: 由于经验不足

input: tebieshizuijin_nianlai

output: 特别是最近_年来

Dataset

You can download the dataset from this link: https://www.dropbox.com/s/fzahr4euzu8crvz/dataset.zip?dl=0

Related Skills

node-connect

352.9k

Diagnose OpenClaw node connection and pairing failures for Android, iOS, and macOS companion apps

frontend-design

111.5k

Create distinctive, production-grade frontend interfaces with high design quality. Use this skill when the user asks to build web components, pages, or applications. Generates creative, polished code that avoids generic AI aesthetics.

openai-whisper-api

352.9k

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

qqbot-media

352.9k

QQBot 富媒体收发能力。使用 <qqmedia> 标签，系统根据文件扩展名自动识别类型（图片/语音/视频/文件）。