Pahawh Hmong

From Wikipedia, the free encyclopedia
Pahawh Hmong
𖬖𖬰𖬝𖬵 𖬄𖬶𖬟 𖬌𖬣𖬵
Semisyllabary (onset-rime; vowel-centered equivalent of an abugida)
Languages Hmong Daw, Hmong Njua
Creator Shong Lue Yang
Time period
Parent systems
  • Pahawh Hmong
    𖬖𖬰𖬝𖬵 𖬄𖬶𖬟 𖬌𖬣𖬵
Direction Left-to-right
ISO 15924 Hmng, 450
Unicode alias
Pahawh Hmong


Final Accepted Script Proposal

Pahawh Hmong (RPA: Phajhauj Hmoob [pʰâ hâu m̥ɔ́ŋ], known also as Ntawv Pahawh, Ntawv Keeb, Ntawv Caub Fab, Ntawv Soob Lwj) is an indigenous semi-syllabic script, invented in 1959 by Shong Lue Yang, to write two Hmong languages, Hmong Daw (Hmoob Dawb White Miao) and Hmong Njua AKA Hmong Leng (Moob Leeg Green Miao).


Pahawh is written left to right. Each syllable is written with two letters, an onset (la, an initial consonant or consonant cluster) and a rime (yu, a vowel, diphthong, or vowel plus final consonant).[1] However, the order of these elements is rime-initial, the opposite of their spoken order. (That is, each syllable would seem to be written right to left, if it were transcribed literally into the Roman alphabet.) This is an indication that Shong conceived of the rimes as primary; Pahawh Hmong might therefore be thought of as a vowel-centered abugida. Tones and many onsets are distinguished by diacritics.

The onset k is not written, so that a rime letter (V) written by itself is read as kV. Nor is the rime au (on mid tone) written, so that an onset letter (C) written by itself is read Cau, except following a bare rime, as otherwise these could be read as a single syllable. The absence of an onset, however, is indicated with a null-onset letter. Again, this is similar to an abugida, but with the roles of consonant and vowel reversed.[2]

For an example of the positional variation, consider the phrase (in RPA orthography) kuv rau tshais rau koj noj "I serve you breakfast". Since the first word, kuv, starts with a k, it is written as the bare rime uv in Pahawh. The word rau, with mid-tone au as the rime, is normally written as a bare onset r, and indeed this is the case for the second instance in this sentence. However, since the first rau follows a bare rime, it cannot be written as a bare onset r, or the combination might be read as ruv rather than kuv rau. Therefore, the combination kuv rau is written uv rau rather than uv r, with the rime au made explicit (Smalley et al. 1990:58).

Orthographic conventions in Pahawh
Written order es e s 0e 0 e áus es# és
Read as [séŋ] [kéŋ] [sau] [éŋ] [au] [kéŋsau] [séŋséŋ] [nyéŋ] [sè’]
Pahawh onsets. Except for the null onset series at lower right, these are consistent for stages 2–4. Row 3 reads l-, dl-, dlh- in Hmong Njua.
Pahawh rimes, stage 2. The tone diacritics are irregular. Note that -v tone is sometimes written with the left-hand rime glyph, sometimes with the right. In stage 3, it is consistently written with the right-hand glyph, and all tones have the diacritics of koo above: [left glyph] -b none, -m dot, -j macron; [right glyph] -v none, - dot, -s macron, -g trema. When used, -d tone takes the left-hand glyphs with a stroke ⟨'⟩ diacritic. The ia and a rows may be read a and aa in Hmong Njua.

Pahawh has twenty onset letters to transcribe sixty phonemic onsets. This is accomplished with two diacritics, a dot and a tack, written above the onset. However, although there is some scattered similarity between the sounds of the resulting forms, there is no overall pattern to the system. For example, the letter for h with a dot is pronounced th, and with a tack is pronounced pl. The null consonant does not take diacritics in Hmong Daw, but does in Hmong Njua, for two onsets, ndl and ndlh, which only occur in Hmong Njua. (Similarly, Daw d and dh, which do not occur in Njua, are used for Njua dl and dlh, which do not occur in Daw.)

The rimes, in contrast, are over-specified. There are thirteen rime sounds, but twenty-six letters to represent them. One of each pair takes four of the eight tones, while the other takes the other four tones. Diacritics (none, dot, macron, and trema) distinguish the tones that each rime letter may carry. One of the tones, written -d in RPA, is not phonemic but is a prosodic unit-final allophone of the creaky register -m. It may be written in Pahawh by changing the dot diacritic to a short stroke, but it is not used by many people.

Shong used the rimes with the values kiab and kab in Hmong Daw for kab and kaab (/káŋ/) in Hmong Njua. However, Cwjmem retains the Daw values for Njua and adds a pipe (|) to the left of kab etc. to write kaab etc.

In addition to phonetic elements, Pahawh Hmong has a minor logographic component, with characters for

  • the numerals 0–10, ×102 (hundreds), ×104 (myriads), ×106 (millions), ×108, ×1010, and ×1012 (billions), though the higher numerals have been dropped leaving a positional decimal system
  • arithmetical signs
  • periods of time: year, season, month, day, date
  • the most common grammatical classifier, lub, which when written out phonetically consists of two very similar letters, and
  • eighteen clan signs. These were never disseminated, but were intended to clarify personal relationships in Hmong refugee camps, where people regularly met strangers of unknown clan. Strict taboos govern the behavior of Hmong men and women from the same clan.

Punctuation is derived from the Roman alphabet, presumably through French or Lao,[3] except for a sign introduced by one of Shong's disciples that replaced Shong's ⟨!⟩, but also includes a native sign for reduplication and a native cantillation mark.

Second and third stage tones

There are two orthographic systems in use for Pahawh Hmong, the second reduced stage from 1965 and the third reduced stage from 1970 (see history, below). Some Hmong communities consider second stage to be more authentic, while others prefer third stage as more regular. It would appear that stage two is more widespread.

The differences are primarily in tone assignment. Bare rimes—that is, rime letters without a tone diacritic—have various values in stage two, but are regularly high tone (-b) or rising tone (-v) in stage three. Likewise, although the pedagogic charts are organized so that each column corresponds to a single tone, the tonic diacritics are scattered about the columns in stage two, but correspond to them in stage three. (Stage 4, which today is only used for shorthand, dispenses with the -v rime letters, replacing them with additional diacritics on the -b rime letters, so that each rime and tone has a single dedicated glyph.)

Tone transcription is that of the Romanized Popular Alphabet.

Tone values of bare rimes
Stage Two Stage Three
keem keeb
kee keev
kim kib
ki kiv
kaum kaub
kau kauv
kum kub
ku kuv
kem keb
kev kev
kaim kaib
kai kaiv
koob koob
koov koov
kawb kawb
kaw kawv
kuam kuab
kua kuav
kom kob
kog kov
kiab kiab
kia kiav
kam kab
kav kav
kwm kwb
kwv kwv
kaam kaab
kaav kaav


Rimes of stage 1
(Hmong Daw values)

Pahawh Hmong was the product of a native messianic movement, based on the idea that, throughout history, God had given the Hmong power through the gift of writing, and revoked it as divine retribution.[citation needed]

In 1959 Shong Lue Yang (Soob Lwj Yaj), a Hmong spiritual leader from Laos, created Pahawh. Yang was not previously literate in any language.[4] An illiterate peasant, Shong claimed to be the Son of God, messiah of the Hmong and Khmu people, and that God had revealed Pahawh to him in 1959, in northern Vietnam near the border with Laos, to restore writing to the Hmong and Khmu people. Over the next twelve years he and his disciples taught it as part of a Hmong cultural revival movement, mostly in Laos after Shong had fled Communist Vietnam. The Khmuic version of the script never caught on, and has disappeared. Shong continually modified the Hmong script, producing four increasingly sophisticated versions, until he was assassinated by Laotian soldiers in 1971 to stop his growing influence as part of the opposition resistance. Knowledge of the later stages of Pahawh come to us through his disciple Chia Koua Vang, who corresponded with Shong in prison.

Onsets of stage 1
(Hmong Daw values)
  • The first stage of Pahawh, Pahawh Pa (Pajhawj Paj), common called the source version, had distinct glyphs for all 60 onsets and 91 rimes of both Hmong Daw and Hmong Njua. Although there were diacritics, there was no relationship between them and the sound values of the letters, and many of the diacritics are unique to a single letter. Among the rimes, there was a strong tendency for letters which differed only in diacritic to share the same vowel and differ in tone. However, this was not absolute. For example, a letter shaped like Ü stood for the rime iaj, while U, differing only in its diacritic, stood for the rime us. Plain U without a diacritic did not occur. Similarly, the letter that, without a diacritic, represents the rime ag, when combined with a diacritic dot represents the onset rh. Thus it can be seen that at this stage the diacritics were integral parts of their letters, with only the beginnings of an independent existence.[citation needed]
Stage 1 was abandoned after Shong revealed the second stage, with only the occasional glyph showing up when people who know it write using other versions. However, it is not considered obsolete, as people remember Shong's instructions to use this source of all later Pahawh as a sacred script.
  • The second stage, Pahawh Njia Dua O (Pajhawj Ntsiab Duas Ob) "second stage reduced version", was the first practical Hmong script. It was taught by Shong in 1965 and is supported today by the Australian Language Institute and Cwjmem (Everson 1999). The consonants are graphically regular, in that each column in the pedagogic charts contains the same diacritic, but are phonetically irregular, in that the diacritics have no consistent meaning. (This situation remained in all later stages.) Tone assignment is irregular, in that the diacritics do not represent specific tones with the rimes any more than they represent specific features with the consonants. For example, the trema sometimes represents the -b tone, sometimes -j, -v, or -g, depending on which rime it is added to. The one exception is the -d "tone", which is actually a prosodic inflection of the -m tone. Shong added a specific diacritic for this when Chia, who was familiar with RPA, asked him how RPA -d should be written, but it was treated as extraneous to the tone system, was not included in the rime charts, and was not always taught to Shong's disciples.
  • The third stage, Pahawh Njia Dua Pe (Pajhawj Ntsiab Duas Peb) "third stage reduced version", introduced in 1970, regularized tone assignment, which was irregular in the second stage. It restores the null onset, which with the addition of diacritics covers Hmong Njua consonants not found in Hmong Daw, that had been found in stage 1, but does not otherwise change the onsets. Chia believes the lack of this series in stage two was merely an oversight on his part in his prison correspondence with Shong (Smalley et al. 1990:70). It was not distributed as widely in Laos as the second stage, due to fear of admitting knowledge of the script after the Communist takeover. Both second and third stage are currently in use in different Hmong communities; however, because the third stage did not appear widely until after Shong's death, there is a suspicion in many communities that it and the fourth stage were invented by Shong's disciples, and therefore are not authentic Pahawh. In the third stage, there is also presence of different signs for month, tens, and zero.
  • The final version, Pahawh Tsa (Pajhawj Txha) "core version", published in 1971 just a month before Shong's death, was a radical simplification with one letter per rime and one diacritic per tone. The onsets were not changed. The only graphic addition was that of three new tone marks, for seven total, but half of the rimes were eliminated: The -b, -m, -d, -j tones are written as in stage 3; the -v, -, -s, -g tones now use the same rime letters as the other tones but with different diacritics: circumflex, underlined dot, underlined stroke, and diaeresis. (The diaeresis is retained from stage 3, so only the rime letter changes for this tone.) Stage 4 is not widely known, but is used as a kind of shorthand by some who do know it; indeed, it may be called "Hmong shorthand" in English.
Number of Pahawh glyphs at each stage
Sounds Stage 1 Stage 2 Stage 3 Stage 4
Rimes 91 91 based
on 26
26 13
Tones 7–8 (3–4 diacritics) 8 (7 diacritics)
Onsets 60 19×3 20×3 20×3

Pahawh is not as widespread as RPA romanization for writing Hmong, partially because of the difficulties in typesetting it, but it is a source of great pride for many Hmong who do not use it, as in Southeast Asia every respectable language has a script of its own, which RPA does not provide.[citation needed] However, for some educated Hmong, Pahawh is considered an embarrassing remnant of a superstitious past (Smalley et al. 1990:165).

Chao Fa (means "Lord of the Sky" in Lao, Hmong: Cob Fab ChaoFaPahawh.png[5]), an anti-Laotian government Hmong group, uses this writing system.[4]


Because Shong was illiterate, it is sometimes assumed that he invented Pahawh ex nihilo. However, Shong was acutely aware of writing and of the advantages that it provided; indeed, that was the basis of his messianic movement. It would appear that existing scripts provided his inspiration, even if he did not fully understand them, much as the Roman alphabet inspired the illiterate Sequoyah when he invented the Cherokee script, in a process called trans-cultural diffusion. Not only do the forms of the majority of the letters in the oldest stage of Pahawh closely resemble the letters of the local Lao alphabet and missionary scripts such as Pollard and Fraser, though they are independent in sound value (much like the relationship between Roman and Cherokee), but the appearance of vowel and tone diacritics in those scripts, which would appear nearly random to the illiterate, may explain the idiosyncratic use of diacritics in early Pahawh. Nevertheless, even if the graphic forms of Pahawh letters derive from other scripts, much of the typology of the script, with its primary rimes and secondary onsets, would appear to be Shong's invention.

The later stages of Pahawh became typologically more like Lao and the Roman alphabet, suggesting that perhaps they influenced its evolution. However, even from the start, Pahawh is "fascinatingly similar [...] and fascinatingly different" from the Lao alphabet (Smalley et al. 1990:90). For example, it resembles an abugida such as Lao where the order of writing does not reflect the order of speech, but with the roles of consonant and vowel reversed. There is an inherent vowel, as in Lao, though only on one tone, but also an inherent consonant. In Lao, tone depends on the consonant; it is modified with diacritics, but the patterns of modification are complex. In early Pahawh, tone depends on the rime and is modified with irregular diacritics. Starting with stage 2, there are two tone-classes of rime, just as in Lao there are two tone-classes of consonant.

Nearly all other scripts invented by illiterates are syllabaries like Cherokee. However, to represent Hmong as a syllabary, Pahawh would have needed 60×91 = 5460 letters. By breaking each syllable in two in the fashion of Chinese phonetics, Shong was able to write Hmong, in his original version, with a mere 60+91 = 151 letters.


The Pahawh Hmong alphabet was added to the Unicode Standard in June 2014 with the release of version 7.0.

The Unicode block for Pahawh Hmong is U+16B00–U+16B8F:

Pahawh Hmong[1][2]
Official Unicode Consortium code chart (PDF)
  0 1 2 3 4 5 6 7 8 9 A B C D E F
U+16B0x 𖬀 𖬁 𖬂 𖬃 𖬄 𖬅 𖬆 𖬇 𖬈 𖬉 𖬊 𖬋 𖬌 𖬍 𖬎 𖬏
U+16B1x 𖬐 𖬑 𖬒 𖬓 𖬔 𖬕 𖬖 𖬗 𖬘 𖬙 𖬚 𖬛 𖬜 𖬝 𖬞 𖬟
U+16B2x 𖬠 𖬡 𖬢 𖬣 𖬤 𖬥 𖬦 𖬧 𖬨 𖬩 𖬪 𖬫 𖬬 𖬭 𖬮 𖬯
U+16B3x 𖬰 𖬱 𖬲 𖬳 𖬴 𖬵 𖬶 𖬷 𖬸 𖬹 𖬺 𖬻 𖬼 𖬽 𖬾 𖬿
U+16B4x 𖭀 𖭁 𖭂 𖭃 𖭄 𖭅
U+16B5x 𖭐 𖭑 𖭒 𖭓 𖭔 𖭕 𖭖 𖭗 𖭘 𖭙 𖭛 𖭜 𖭝 𖭞 𖭟
U+16B6x 𖭠 𖭡 𖭣 𖭤 𖭥 𖭦 𖭧 𖭨 𖭩 𖭪 𖭫 𖭬 𖭭 𖭮 𖭯
U+16B7x 𖭰 𖭱 𖭲 𖭳 𖭴 𖭵 𖭶 𖭷 𖭽 𖭾 𖭿
U+16B8x 𖮀 𖮁 𖮂 𖮃 𖮄 𖮅 𖮆 𖮇 𖮈 𖮉 𖮊 𖮋 𖮌 𖮍 𖮎 𖮏
1.^ As of Unicode version 10.0
2.^ Grey areas indicate non-assigned code points


  1. ^ The only final consonant is [ŋ]. However, this is sometimes analyzed as vowel nasalization: oo, ee, aa /ɔŋ, ɛŋ, aŋ/ or /ɔ̃, ɛ̃, ã/. Thus Pahawh can be considered an alphabet where both consonants and vowels may be complex, much like x [ks] and i [aɪ] in English.
  2. ^ Coincidentally, RPA also has an inherent consonant which is not written, glottal stop, along with a null-consonant diacritic, the apostrophe: ai /ʔāi/, ’ai /āi/. Few Hmong words are vowel initial, so writing the absence of a consonant with an apostrophe is more economical than writing glottal stop with an apostrophe.
  3. ^ Punctuation was introduced in 1969 when Chia Koua Vang, who was literate in RPA, wrote to Shong in prison asking him what to put at the ends of sentences. Neither Shong nor any of his other disciples were literate in any other script at this point, so presumably someone taught him punctuation there. (Smalley et al. 1990:76–77)
  4. ^ a b Fadiman, Anne. "Note on Hmong Orthography, Pronunciation, and Quotations." The Spirit Catches You and You Fall Down. Farrar, Straus and Giroux. 1997. 291.
  5. ^ Smalley, William Allen, Chia Koua Vang (Txiaj Kuam Vaj ChiaKouaVangPahawh.png), and Gnia Yee Yang (Nyiaj Yig Yaj GniaYeeYangPahawh.png). Mother of Writing: The Origin and Development of a Hmong Messianic Script. University of Chicago Press, March 23, 1990. 10. Retrieved from Google Books on March 23, 2012. ISBN 0226762866, 9780226762869.


External links

  • Omniglot
Retrieved from ""
This content was retrieved from Wikipedia :
This page is based on the copyrighted Wikipedia article "Pahawh Hmong"; it is used under the Creative Commons Attribution-ShareAlike 3.0 Unported License (CC-BY-SA). You may redistribute it, verbatim or modified, providing that you comply with the terms of the CC-BY-SA