Pinyin Pronunciation Guide — Initials, Finals, Tones
The complete pinyin pronunciation guide with audio examples. The foundation for everything else in Mandarin.
What is pinyin?
Pinyin is the romanization system for Mandarin Chinese — the way you spell Chinese sounds with the Latin alphabet. Every Chinese character has a pinyin syllable that tells you how to pronounce it, plus a tone mark (ā, á, ǎ, à) for the pitch contour.
你好 → nǐ hǎo
Two parts make up each syllable: an initial (the starting consonant, optional) and a final (the vowel-based ending, required).
The four tones — quick overview
Same syllable, different tone, different word. The four tones plus a neutral fifth:
| mā妈 | mother (high level) |
| má麻 | hemp (rising) |
| mǎ马 | horse (low dipping) |
| mà骂 | scold (falling) |
| ma吗 | question particle (neutral) |
Each tone in detail with tone-change rules: The four tones, explained.
Initials — the starting consonants
21 initials, grouped by where in the mouth they're produced:
Bilabial & labiodental
| b | unaspirated — like "p" in "spy" |
| p | aspirated — like "p" in "pie" |
| m | like English "m" |
| f | like English "f" |
Alveolar (tongue tip behind the teeth)
| d | unaspirated — like "t" in "stop" |
| t | aspirated — like "t" in "top" |
| n | like English "n" |
| l | like English "l" |
Retroflex (tongue curled back)
| zh | like "j" in "judge" but tongue curled |
| ch | aspirated zh — like "ch" in "church" |
| sh | like "sh" in "shoe" with curled tongue |
| r | between English "r" and "zh" — tongue curled |
Alveolopalatal (tongue forward, against hard palate)
| j | like "j" in "jeep" — softer, tongue forward |
| q | aspirated j — like "ch" in "cheek" |
| x | like "sh" in "she" — softer, tongue forward |
Dental sibilants (tongue against upper teeth)
| z | like "ds" in "kids" |
| c | aspirated z — like "ts" in "cats" |
| s | like English "s" |
Velar (back of tongue)
| g | unaspirated — like "g" in "go" |
| k | aspirated — like "k" in "kick" |
| h | like English "h" but slightly harsher in the throat |
Finals — the vowels and endings
Simple finals
| a | "ah" — like "father" |
| o | "oh" — like "more" |
| e | "uh" — like "the" |
| i | "ee" — like "see" (after z/c/s/zh/ch/sh/r it's a buzzed "uh") |
| u | "oo" — like "boot" |
| ü | round your lips for "oo" and try to say "ee" — no English equivalent |
Compound finals
| ai | "eye" |
| ei | "ay" |
| ao | "ow" |
| ou | "oh" |
| ia | "yah" |
| ie | "yeh" |
| ua | "wah" |
| uo | "woh" |
| üe | ü + "eh" |
Nasal finals
| an | "ahn" |
| en | "un" |
| in | "een" |
| ang | "ahng" |
| eng | "ung" |
| ing | "eeng" |
| ong | "oong" |
The tricky sounds for English speakers
ü vs u
Round your lips like you're saying "oo" but try to say "ee" — that's ü. It's a single distinct vowel, not "u + e". When ü follows j, q, x, or y, the umlaut drops in spelling (ju, qu, xu, yu) — but it's still ü, not u. So 居 (jū) sounds like "jü", not "joo".
q, j, x — palatal, not English ch/j/sh
Tongue forward, against the hard palate. j ≈ soft "j" in "jeep"; q ≈ "ch" in "cheek" (aspirated); x ≈ "sh" in "she" (soft). Different family from zh/ch/sh.
zh, ch, sh, r — retroflex
Tongue curls back so the tip points toward the roof of the mouth. English speakers often default to alveolar — which turns these into j/ch/sh, a totally different set. Practice in a mirror: the tongue should look visibly curled.
c is "ts", not "k"
Pinyin c is "ts" as in "cats". So 才 (cái) is "tsai", not "kai". The "k" sound is written k.
ian sounds like "yen"
Even though it's spelled with "a", -ian endings come out as "yen": 钱 (qián) = "chyen", 见 (jiàn) = "jyen".
Related
Hear it, then say it
Reading the chart is one thing — getting the retroflex curl or the ü right takes hearing and producing the sound. Kango drills pinyin pronunciation with real-time feedback, syllable by syllable.
Download Kango on iOS