Skip navigation

Complete guide

Thai pronunciation —
tones, sounds and romanization

Thai pronunciation presents three challenges for English speakers: the five tones, the aspirated sounds we're not used to, and a romanization system you need to use consistently. This guide explains everything.

Section 1

How Thai pronunciation differs from English

Thai pronunciation differs from English on three fundamental points. All of them can be confusing at first — but they are all systematic and learnable.

First: pitch determines meaning. In English we use intonation for feeling and emphasis. In Thai, pitch is a property of individual words. The word maa can have five meanings depending on the tone. Say it on the wrong tone, and you're literally saying a different word.

Second: aspiration. Thai makes a strict distinction between aspirated sounds (kh, ph, th) and unaspirated sounds (k, p, t). In Thai, that distinction is meaning-differentiating.

Third: no final consonant clusters. Thai always ends a syllable on a single consonant or a vowel. Thai final consonants are also pronounced differently — they are not fully released but cut off.

Section 2

The five tones

Thai has five tones. For English speakers, the high tone and rising tone are the trickiest — our brains interpret a high voice as emotion, not as part of a word. The key is to treat your voice as an instrument, not as an expression of feeling.

Mid tone

a (no mark)

horizontal →

Flat, in the middle of your voice

มา maa = to come

Low tone

à

low, flat →

Low and flat, bottom of your range

ม่า màa = widow

Falling tone

â

downward ↘

Starts high, drops sharply

ข้าว khâao = rice

High tone

á

high, flat →

High and tense, top of your range

ม้า máa = horse

Rising tone

ǎ

upward ↗

Starts low-mid, rises at the end

หมา mǎa = dog

Common mistake: confusing tones with emotion

Beginners sometimes speak softer or louder to imitate a tone. But in Thai it's exclusively about pitch, not volume. Use Pasaa's tone analysis to get real feedback on your pitch.

Section 3

Aspiration: kh, ph, th versus k, p, t

Thai makes a fundamental distinction: aspirated versus unaspirated voiceless stops. In Thai that distinction is meaning-differentiating: กา (kaa = crow) versus ขา (khǎa = leg).

Practice tip: Hold a piece of paper in front of your mouth. For aspirated sounds (kh, ph, th) the paper should move. For unaspirated sounds (k, p, t) it doesn't move.

k vs kh

k — กา (kaa) = crow
kh — ขา (khǎa) = leg

p vs ph

p — ปาก (bpàak) = mouth
ph — พ่อ (phâw) = father

t vs th

t — ตา (dtaa) = eye
th — ถาม (thǎam) = to ask

In addition to kh, ph and th, Thai also has two special stops: dt (halfway between d and t) and bp (halfway between b and p). With practice you'll learn to hear and produce the difference.

Section 4

Paiboon+ romanization

Pasaa uses Paiboon+: a consistent romanization system with tone markers for every tone. Every word has a fixed spelling.

The 5 tone markers in Paiboon+

a

Mid tone

มา maa = to come, กา kaa = crow

à

Low tone

ข่า khàa = galangal, ผ่า phàa = to cut

â

Falling tone

ข้าว khâao = rice, ห้า hâa = five

á

High tone

ม้า máa = horse, น้ำ náam = water

ǎ

Rising tone

หมา mǎa = dog, ถาม thǎam = to ask

A complete overview can be found in the Paiboon+ romanization guide.

Section 5

Tone sandhi: tones in connected speech

In isolated pronunciation, every word sounds as the dictionary describes. In real connected speech, tones sometimes change. This phenomenon is called tone sandhi.

As a beginner you don't need to actively learn tone sandhi, but you need to recognize it when you hear it. In Pasaa you build this up gradually: Phase 1 uses audio at 75% speed without sandhi effects. Phases 2 and 3 use normal native audio.

More about tone sandhi in the article Thai tone sandhi.

Section 6

How tone analysis works in Pasaa

Pasaa uses real-time tone analysis via your microphone. We measure the pitch and compare it to the expected tone contour of the word. Since every voice is different, Pasaa automatically calibrates to your voice register.

After each speaking moment, Pasaa gives visual feedback: a tone contour of what you produced versus what was expected. This lets you see whether you started too low, dropped too early, or the rise wasn't steep enough.

1

Microphone

Pasaa listens via your microphone. No special hardware needed.

2

Pitch detection

Real-time analysis of your fundamental pitch per syllable.

3

Calibration

Automatically adjusted to your voice register for accurate feedback.

Ready to get started?

11 lessons free.
No credit card.

Tones, sounds & basic grammar free · Upgrade when you're ready

Start for free →