A native-speaker validation form for the project's Nepali grapheme-to-phoneme (G2P) gold lexicon. The lexicon underpins every TTS evaluation we publish.
The frontend produces candidate pronunciations from rules and from a
source lexicon (Google's language-resources/ne, CC BY 4.0).
Native speakers and linguists validate those candidates one word at a
time. Validated entries become the project's gold lexicon.
Typing IPA in a browser is hard. The project's labels (ax,
tx, dz, etc.) are ASCII, easy to type, and
unambiguously map to a single IPA equivalent. The
cheatsheet has the full table.
The default mode is spoken_nepali. Per the primary
references (Khatiwada 2009, Regmi 2025):
s in spoken Nepali.n.i; ू (long ū) reads as u
— spoken Nepali has no length contrast.If you think a specific word needs the careful Sanskritized pronunciation, please note that in the comments field; don't try to encode it in default-mode phones.
gold_lexicon.tsv.Any phone string that doesn't validate (e.g., a typo'd label) is flagged for follow-up before promotion. Nothing reaches the gold without passing the inventory check.
reviewer = <name>.The sources behind the v1.0 phone inventory and rules:
Questions about a specific entry, the workflow, or wanting to contribute: hello@ampixa.com.
© 2026 AMPIXA LABS Pvt. Ltd. — ampixa.com