Y&#x012b;n & Y&aacute;ng and the I Ching

Although Chinese characters are originally and basically ideographic, writing whole words, the language over time has become more polysyllabic and many characters now do not occur in isolation. The system thus can be said to have become morphographic, writing semantic elements of words, morphemes, rather than ideas or words as wholes. [note]

Copyright (c) 1997, 2002, 2003, 2006, 2007, 2008, 2009, 2011 Kelley L. Ross, Ph.D. All Rights Reserved

Yīn & Yáng and the I Ching, Note

As it happens, there is a conspicuous mountain north-east of Los Angeles Valley College. Indeed, there is a whole mountain range, the San Gabriel Mountains. Beyond the lower Verdugo Mountains in the foreground, which rise to 3126 feet, there is the conspicuous Mt. Lukens in the San Gabriels, which is 5074 feet high. Behind Mt. Lukens runs Big Tujunga Canyon. There are much higher peaks in the San Gabriels (up to Mt. San Antonio, "Old Baldy," at 10,064 ft., which is east and outside of the image provided here), as can be seen in the image, but these are hidden from the perspective of Valley College. Unfortunately, there are no Buddhist temples, as far as I know, upon Mt. Lukens. Los Angeles could use the protection.

Return to text

Categories of Chinese Characters

Chinese characters are the last ancient ideographic writing system that survives in modern usage. This was a close call. In Vietnamese, the Latin alphabet is used; in Korean, the Hangŭl phonetic system is now used. Japanese has its own syllabaries, the kana, which could easily replace characters altogether, as in the past they sometimes did. Both China and Japan were contemplating a transition to the Latin alphabet (the Pinyin system prepared the way for this in Chinese). Ironically, it is the most modern technology which has saved the most ancient writing. Computer assisted writing makes the use of characters relatively convenient, and the need for vast metal fonts for printing and even typewriting has now been eliminated.

The characters and their definitions here are from Mathews' Chinese-English Dictionary [Harvard, 1972]. The pronunciation of each character, however, is rendered in Pinyin. There are, understandably, disputes over the classification system and over the assignment of individual characters. For instance, the very first example, dá, "big," is from the drawing of a man, and so can be considered "pictographic"; but since it doesn't mean "man," but "big," it might be considered "indicative" instead.

Pictographic: These are characters that originate with pictures of the objects in question. In the Shang Dyansty, these counted for 23% of all characters. By the Han they were down to only 4%, and during the Sung only 3%. The characters at right were all originally little pictures. "Great" was the picture of a man, while "mountain," "field," "woman," "horse," "shield," and "tree" were just that.
John DeFrancis [The Chinese Language, Fact and Fantasy, University of Hawaii Press, 1984, 1986, & Visible Speech, University of Hawaii Press, 1989], one of the greatest scholars of Chinese, has the view that language (or meaning) is essentially spoken (i.e. sound) and that pictograms really stand for the words rather than for the things. However, it seems the most natural to say that a picture of a man, a woman, or a tree simply represents those things directly. While all writing systems, including Chinese, develop phonetic elements, the thesis that meaning is essentially sound is destroyed by the use of sign language among the profoundly deaf, for whom language and meaning have no aural component at all. At one time, it was not believed that the profoundly deaf had any true language, just because sign language was not taken seriously; but this view is now insupportable. Indeed, from Plato we already have the observation that the deaf sign and that this is a logical accommodation to that condition:
SOCRATES: Answer me this: If we had no voice [φωνή, phoné] or tongue [γλῶττα, glôtta], and wished to make things [πράγματα, prágmata] known to one another, should we not try, as mute [and deaf] people [ἐνεοί, eneoí; singular ἐνεός, eneós] actually do, to make signs [σημαίνειν, sêmaínein] with our hands and head and body generally? ["Cratylus," 433 E, Cratylus, Parmenides, Greater Hippias, Lesser Hippias, translated by F.N. Fowler, Loeb Classical Library, Harvard, 1926, 1963, p.133; translation modified]

Sign languages are known to develop and exist with no connection to spoken language, and the form of signs has its own dynamic, unrelated to sounds. Thus, even as a Chinese character is classified by radical and phonetic, a sign can be specified by [1] the shape of the hand(s), [2] position(s), [3] orientation(s), and [4] motion(s) (if any).
Simple Indicative or Ideographic: Some abstract concepts can be suggested with certain diagrams, like simple lines for "one," "two," and "three." At right, we also have "under," "above," and "middle," all of which bear some relation, as diagrams, to the meaning. In the Shang Dynasty, only 2% of characters were like this. By the Han and Sung, it was down to only 1%. So these kinds of characters may be frequently used, but there aren't many of them.
Compound Indicative or Logical Aggregates: Multiple examples of the first two kinds of characters can be combined to suggest something semantically related to the original meanings. So at right, we see "sun" and "moon" combined to mean "bright," "light," or even "cleanse." Three "fields" can be combined to mean "fields divided by dikes." A "woman" under a "roof" means "quiet," "peace," "tranquility." Two "women" means "handsome" or "pretty," and also "cunning." This negative (misogynistic) suggestion emerges fully with three "women," which means "adultery," "fornication," "licentiousness," "debauch," "ravish." Two "trees" get us "forest," and three are "luxuriant," "overgrown," "dark." Three "stones" is "heap of stone, boulders." Note that there are altenative, radical and phonetic versions, given with the lei (boulders) and jiao (handsome) characters. In the Shang Dynasty, 41% of the characters were of this compound indicative type. In the Han it was 13%, and in the Sung only 3%. It is sometimes said that the Chinese character for "trouble" shows two women under one roof. Such a character is possible, and would look like this , but there actually is no such Chinese character, though I understand that the myth lives on the internet. Meanwhile, the character , which is a pig under a roof, means "a house, family, home, relatives," or a member of a class or school. We can imagine that this goes back to the conditions of rural life where people and farm animals might share the same dwelling, even as pork is still a conspicuous part of traditional Chinese cooking.
The most common Chinese characters are of the Radical and Phonetic or Phonetic Complex form. These combine other characters either side by side or above and below. The constituent character called the "Radical" gives some clue about the meaning and, more importantly, is the basis for the listing of the character in Chinese dictionaries (where 214 traditional Radicals are used). The constituent character called the "Phonetic" gives some clue about the pronunciation, which is usually similar to that of the original character. In the Shang Dynasty, only 34% of characters (or 334 actual characters) were of this type. By the Han Dynasty, it was up to 82% (or 7697), the Sung up to 93% (21,810), and in the Ch'ing radical and phonetic characters were 97% (or 47,141) of the total. Clearly, this device becomes the most productive way of generating new characters in Chinese. It is also unique among Old World ideographic writing systems. Nothing similar is seen in Egyptian hieroglyphics, for instance, where the phonology of a word is indicated by writing extra, purely phonetic, glyphs. The exception, however, is in the New World, where Mayan glyphs, recently deciphered, include both ideographic and phonetic elements, just like Chinese characters. Mayan glyphs, however, fully specify the phonology (according to the current understanding), not just suggest it, as with the Chinese.
In the diagram at right, the basic phonetic value of "horse" (mǎ) turns up in the purely phonetic interrogative particle, and in a word for "mother." The character for "to tie, bind" occurred as a phonetic in the alternative character given above for "heap of stone/boulders" (lei). The "fields" compound character above (lei again) occurs as a phonetic with the character for "stone" to mean "roll stones down hill." "Shield" (gan) occurs with "sun" in "sunset," with "woman" in "crafty," villainous," "false," and with "tree" in "shaft of a spear," "pole." "Middle" occurs with the radical "heart," zhong, to mean "conscientious," "loyal," "honest," etc. It is these characters that provide some of the evidence for the reconstruction of the pronunciation of earlier forms of Chinese.
Since radical and phonetic characters already exist in the Shang Dynasty, there clearly was a long period of development prior to this. But the evidence for this is scant, and the ultimate origin of Chinese characters is unclear.

"Mandarin" is a word from Sanskrit (, mantrin) by way of Malay (menteri) and Portuguese (mandarim). This meant "counselor." The word was applied because the Portuguese were originally dealing with traders along the southern coast of China, where, of course, many languages were spoken, but not Mandarin. When officials from the Capital came down to deal with the Portuguese, they spoke a different language, which the Portuguese had not otherwise encountered. Hence the name, the language of the "counselors". However, this may also have simply been a translation of what the counselors were calling their own language, which was the , "Official Language," or even "Language of the Officials," i.e. the Mandarins [note].

Copyright (c) 2000, 2005, 2006, 2010, 2013, 2015, 2016, 2017, 2019 Kelley L. Ross, Ph.D. All Rights Reserved

Categories of Chinese Characters, Note;
Ideograms vs. "Logograms"

Since Chinese characters originally wrote whole words, it is now fashionable to say that they are "logograms" (logos = "word") rather than "ideograms." On this view, Chinese characters (or the units of any such writing system) have no meaning apart from the words of Chinese. They are derivative of the words and are semantically, functionally, and even ontologically dependent on them. The notion that the characters could exist independently of the words, or of the Chinese language, is incomprehensible.

As noted, this is already rather behind the development of Chinese, where characters usually write morphemes. However, the principal reason for the change in terms is ideological rather than linguistic. Because of the influence of Ludwig Wittgenstein and Ferdinand de Saussure, the view has grown that language is a self-contained and self-referential system, without connection to the external world or to truth. Because of this, the notion that there are "ideas" or concepts that exist independently of language and embody meanings with a real relationship to the world has fallen into disfavor. So "ideogram" must go.

Unfortunately, those who are at pains to demonstrate their adherence to fashionable opinion have missed the point. The issue is not whether ideas or truth exist, but whether a writing system like Chinese characters directly matches up with spoken language. It doesn't. This is the most conspicuous in something like Ancient Egyptian hieroglyphics, where certain glyphs are "generic determinatives," which correspond to no words in the language but give a clue as to the general meaning of the word being written. As it happens, Chinese has something rather like generic determinatives, i.e. the "radical" which is that part of the character that gives a clue to the meaning and functions as the basis of classifying characters in a Chinese dictionary. These visual elements of the written language do a job where the written language may not fully represent the sounds of spoken language, which is what happens in Egyptian or Chinese. The written language does it in its own way, and so takes on a life of its own.

Since the fashionable view is that language is self-referential, we might wonder why opinion could not move over to the view that written language breaks away from the spoken language and takes on a self-contained life of its own. Clinging to the notion that written language refers to spoken language would seem to contradict part of the fashionable thesis, that there is no external reference. Indeed. But the move does not take place, perhaps because the connection of the written to the spoken language is too obvious (though one might think that their connection to the world would then be equally obvious, which it isn't to the bien pensants), but perhaps even more so because of an old prejudice that language can only exist as spoken language. This latter assertion is actually made by John DeFrancis in the work cited in the text above -- and reconfirmed to me in personal correspondence.

The notion that language can only truly consist of sounds is refuted by the existence of fully functioning sign languages among the profoundly deaf. Indeed, there are now cases where deaf children, with no previous contact with other deaf individuals, have been introduced together into new schools for the deaf and have spontaneously and quickly developed a completely new sign language between themselves. In the past, the possibility that sign languages could be the equivalent of spoken language was simply not believed, and even educators of the deaf thought that signs could properly only be used to spell the words of spoken languages. Word of the existence of true, semantically complete sign languages of the deaf has apparently still not reached everyone.

The truth is that visual (whether written or sign) and spoken languages match up to each other by way of meaning. There are ideas, concepts, and reference. That is why languages can be translated into each other -- though, indeed, there are philosophers, like W.V.O. Quine, in the self-referential tradition, who openly assert the "indeterminacy of translation," as though this were not contradicted by centuries of actual translating. The existence of meaning has been ably demonstrated by Jerrold Katz. Thus, Chinese characters, which write ideas, as spoken language speaks them (with, we might say, "ideophones" -- sounds that speak ideas), are ideograms. Since they historically correspond to Chinese words or morphemes, they can also be called logograms or morphograms. Since they often originally consisted of pictures of objects, they can also be called "pictograms," a term also in fashionable disfavor. If there are pictures of objects, after all, we might need to admit that there are objects, and that language has something to do with them. It is a shame when something so obvious becomes shocking to educated opinion.

Why there is now this ideological preference is a good question. Such theories, however, are conformable to the "deconstructionist" or "post-modern" view that everything is a matter of power relationships -- something about equally inspired by Marx and by Nietzsche -- and unrelated to any actual truth or reality, except a political reality. People writing about Chinese characters may not be aware of all the connections of the theories they promote, but it is usually the academic water within which they swim.

Return to text

The Dialects of Chinese

What are usually called the "dialects" of Chinese are really separate languages, all descended from the Chinese of the T'ang Dynasty. They are all about as far apart from each other now as English and Dutch. However, they are all written with the same characters (with some exceptions), which means that an educated person can understand (mostly) their written forms, and for cultural and political reasons, as well as their historical origin, are regarded by the Chinese as part of the same language. A new term has even been introduced for this unusual situation, calling the languages "topolects," i.e. speech of the "place," τόπος, topos. A Chinese equivalent term, , "speech of the place," not only is the official term for "dialect," but it is officially used for all the Chinese languages, whether they are actually languages or dialects.

The picture of the languages has changed somewhat over the years. Older sources (e.g. John DeFrancis, The Chinese Language, Fact and Fantasy, Hawaii, 1984; S. Robert Ramsey, The Languages of China, Princeton, 1987; and Nathan Sivin, editor, The Contemporary Atlas of China, Houghton Mifflin, 1988) say that there are seven different languages, or six, since sometimes Gan is linked with Hakka, or with Xiang. More recently, Lynn Pan, in The Encyclopedia of the Chinese Overseas [Harvard, 1999], lists ten languages, where Jin is separated from Mandarin, Hui from Wu, and Pinghua from Yue. Now, however, in The Sino-Tibetan Languages, edited by Graham Thurgood and Randy J. LaPolla [Routledge Language Family Series, Routledge, London, 2003], Jerry Norman ("The Chinese Dialects: Phonology") states, "If one takes mutual intelligibility as the criterion for defining the difference between dialect and language, then one would have to recognize not eight [or seven, etc.] but hundreds of 'languages' in China" [p.72]. This appears to resolve the issue. What previously were regarded as separate languages, like Cantonese, are in fact families of languages. It is therefore not surprising that the "splitters" (those who like to divide groups, as opposed to "lumpers," who like to combine groups -- a typological difference) should begin to divide the old languages into new ones. If there are really "hundreds" of languages involved, however, further splitting becomes pointless.

On the map at left, we see China of the late Empire divided by the ethnic principle of the "five peoples." While the Hui, , might be Turks or Uighurs, the term in general means "Muslims" and thus applies to ethnic Chinese Muslims. Those Hui speak Mandarin and tend to live in the area identified for the Han, , People on the map. Otherwise, the dialects of Chinese all refer to languages of the Han People. Manchurian has all but disappeared and been replaced by Mandarin.

Within each of the groups of Chinese languages, there are also true dialects, which means that they are mutually intelligible. In Pan's book and The Sino-Tibetan Languages many dialects are shown for the language groups. The confusion over all this -- couldn't everyone tell what forms of speech are mutually intelligible? -- was certainly due to the difficulties of doing research in China in the 20th century. From revolution, to war, to revolution, to totalitarianism, China until recently was not the best place for graduate students wandering around with tape recorders asking strange questions. Such behavior would often have evoked suspicion, arrest, or worse. Of course, there is also the problem of distinguishing dialects from languages in general, when dialects may be intelligible to those nearby, while those at the extreme ends of a range may be incomprehensible to each other.

The table gives a classification of languages and dialects based on a combination of The Sino-Tibetan Languages and other sources. The 10 languages identified on the map from Pan's The Encyclopedia of the Chinese Overseas are given in boldface; but the overall organization is in terms of the three groups and six "dialect familes" of The Sino-Tibetan Languages [p.6]. While Gan and Xiang and now definitely separated, Hakka has come to be included under Gan -- though this is not consistently seen in the book.

"Hakka" itself is an interesting term, in Mandarin, in Cantonese (客家, haak³gaa¹), meaning "guest, visitor, traveller, stranger, merchant," or "customer." Althought there is a concentrated area of Hakka speakers, the language is otherwise spoken in widely scattered areas, where it has been taken by, indeed, Hakka traders.

In Modern Chinese, official Mandarin is the , "Common Language," or the , "National Language." These same expressions are used in Cantonese, pronounced differently of course, where we also find , the "Beijing Language," 北京話, pronounced Bak¹-ging¹-waa⁶ -- or Beg⁷ging¹wa⁶, or Bùk-gìng-wa̿ (see variety of Cantonese transcriptions below, including the a/e/u alternation we see here). As we have seen, is literally used for everything else in the country, whether language or dialect.

Some population figures are given for the older seven language classification. These are given as percentages of the total Chinese speaking population, as a number in millions (M), and, from another source, as a number in thousands (k). These count those for whom the languages are their first languages. The figure of 952,000,000 speakers for Mandarin given elsewhere is for people who speak Mandarin at all. This is considerably larger than the 715 million number below, not just because the population has grown in the last twenty years, but also because Mandarin in the national language of China, taught in schools around the country. Areas where the languages are spoken are given after the language name(s). Names of cities and provinces in Pinyin are given in italics. I have now added new population figures, after a dash, which are taken from The World Almanac and Book of Facts 2008 [World Almanac Books, 2008, p.728]. The Almanac gives the first figures I've seen for Puxian, which is now evidently often broken off of Min, as Jin is broken off of Mandarin.

Northern
- Mandarin, , North, Southwest, 71.5%, 715 M, 679,250 k -- 873 M
  - Northern
    - Northern, Peking,
    - Jin(yu), (), Shanxi -- 45 M
    - Northwestern, Kansu [Gansu]
  - Southern, Nanking,
  - Southwestern, Szechwan [Sichuan]
Central
- Wú, , Shanghai, Zhejiang, 8.5%, 85 M, 80,750 k -- 77 M
  - Wú (I), Suzhou, Shanghai
  - Wú (II), Wenzhou, Chekiang [Zhejiang]
  - Hui(yu), (), Anhui
- Gàn, , Kiangsi [Jiangxi], 2.4%, 24 M, 22,800 k -- 20 M
  - Hakka, , Guandong, Jiangxi, scattered, 3.7%, 37 M, 35,150 k -- 29 M
- Xiang, , Hunan, 4.8%, 48 M, 45,600 k -- 36 M
  - Old Xiang, countryside
  - New Xiang, NW Hunan, cities
Southern
- Min, , Fukien [Fujian], 4.1%, 41 M, 38,950 k
  - Northern Min, Foochow [Fuzhou], 1.3%, 13 M
    - Northern Min, -- 10 M
    - Eastern Min, , Fuzhou -- 9 M
  - Puxian Min, , Putian & Xianyou -- 2 M
  - Southern Min, , "Amoy-Swatow," 2.8%, 28 M -- 46 M
    - Hokkien, Fukien [Fujian]
      - Quanzhou
      - Taiwanese -- 16M
      - Amoy [Xiamen]
      - Zhangzhou
    - Chaoshan, Teochew, Shantou, Swatow
    - Hainanese
- Yuè, , Cantonese, Guandong, Guangxi, 5.0%, 50 M, 47,500 k -- 54 M
  - Pingua, , Guangxi

Dialect Family Initials Finals Tones Syllables

Mandarin, 16 39 4 2496

Gan, 19 59 6 6726

Hakka, 17 69 6 7038

Xiang, 23 37 6 5106

Min, 15 57 7 5985

Wu/Shanghai, 27 50 7 9450

Yue/Cantonese, 20 53 9 9540

Dialect Family	Initials	Finals	Tones	Syllables
Mandarin,	16	39	4	2496
Gan,	19	59	6	6726
Hakka,	17	69	6	7038
Xiang,	23	37	6	5106
Min,	15	57	7	5985
Wu/Shanghai,	27	50	7	9450
Yue/Cantonese,	20	53	9	9540

It is noteworthy that the extension of Mandarin into the Southwest was in part the result of veterans being settled there after the Mongols were ejected from China and the Ming Dynasty founded.

The table is a comparison of dialect families from The Sino-Tibetan Languages [p.127]. The statistics, of course, are from representative languages in each group. I have rearranged the list to move the apparently more conservative languages towards the bottom of the table, though, of course, not all the indications are consistent. With the largest number of tones and of syllables, Cantonese wins as the most conservative, but then Xiang and Shanghai both have more initials than Cantonese -- and Hakka has an anomalously large number of finals and syllables. Mandarin has clearly undergone the greatest phonetic simplification.

For some idea of how the languages different, the character for "south" in Mandarin is , which, we see, is pronounced nán -- occurring in the table above and in the names of many dynasties. The same character and word becomes in Cantonese, in Hakka, in Southern Min, in Northern Min, in Eastern Min, and in Gan. In speech, one would be at a loss to identify most of these as the same word. With Gan, not a single letter is the same. With this going on, it is not hard to understand how the Chinese "dialects" are different languages. The character is borrowed into Japanese as nan (minami in Japanese) and into Vietnamese as nam (expressed as hướng nam, literally "south direction").

Back in the 1990's, I had a student from Singapore in one of my Introduction to Philosophy classes. She was a delightful person and enthusiastic student and decided that we should take Cantonese at UCLA. I had to decline the offer. I later learned that she had been "Miss Singapore" at some point. When the class got to Chinese philosophy, and I talked about the different "dialects" of Chinese, it turned out that the form of Chinese she learned growing up in Singapore was not Mandarin, but she did not know what it was. I don't think we were able to figure that out at the time.

Now I see that, although Singapore has officially been promoting Mandarin, Wikipedia says that among Chinese languages, "Hokkien (Min Nan) used to be an unofficial language of business until the 1980s. Hokkien is also used as a lingua franca among Chinese Singaporeans, and also among Malays and Indians to communicate with the Chinese majority." So my student almost certainly grew up speaking Hokkien.

The actress Michelle Yeoh (b.1962), well known for her role in Crouching Tiger, Hidden Dragon [2000], was herself born in Malaysia from a family of "Hokkien and Cantonese ancestry." She must have grown up speaking Hokkien and perhaps Cantonese also. She had to learn Mandarin for her part in the movie.

Hokkien is glossed as a version of Southern Min, . My older sources identified this as "Amoy-Swatow" and as based in the city of Xiamen, as in the list above. Wikipedia now says that Hokkien originated in "part of Fujian Province in Southeastern China and [is] spoken widely there. It is also spoken widely in Taiwan, where it is usually known as Taiwanese or Holo, and by the Chinese diaspora in Malaysia, Singapore, Indonesia, the Philippines and other parts of Southeast Asia and by other overseas Chinese all over the world." "Hokkien" itself is the Southern Min pronunciation of Mandarin Fujian, .

We then see that "The Amoy dialect is the main dialect spoken in the Chinese city of Xiamen (formerly romanized and natively pronounced as 'Amoy') and its surrounding regions of Tong'an and Xiang'an, both of which are now included in the greater Xiamen area."

In turn, we hear about Swatow thus: "The Shantou dialect, formerly known as the Swatow dialect, is a Chinese dialect mostly spoken in Shantou in Guangdong, China. It is a dialect of Chaoshan Min language." In turn, we then hear that Chaoshan "is a Southern Min language spoken by the Teochew people of the Chaoshan region of eastern Guangdong province, China, and by their diaspora around the world. It is closely related to Hokkien, with which it shares some cognates and phonology, though the two are largely mutually unintelligible."

Thus, we find that the umbrella of "Southern Min" encompases, not just different dialects, but a family of different languages. So my student perhaps would have been understood in Taiwan, but maybe not in some adjacent Mainland areas.

The language map for all of the Min languages is from the Chinese book, The Language Atlas of China [1987, 1989]. The language map above distingishes Puxian from the Min languages, but this map, and my other sources, do not. It is surrounded by Southern and Eastern Min.

Fukien Chinese Tones

1st 陰平 4th 陰去

Yīn Level, 55 Yīn Leaving, 212

high level low falling and rising

2nd 陽平 5th 陽去

Yáng Level, 53 Yáng Leaving, 242

high falling middle rising and falling

3rd 上聲

Rising Tone, 33

middle level

6th 陰入 7th 陽入

Yīn Entering, 24 Yáng Entering, 5

middle rising stopped high level stopped

The treatment of the seven tones here is taken from a book, Fujianese Dictonary & Phrasebook [Hippocrene Books, 2014]. The book does not give an author, but it has a credit for "translated by" for Xiao Chu Chen. "Fujian," of course, is also called "Fukien" or "Hokkien," and is a branch of Southern Min, as we see on the map.

Fukien Chinese Tones
1st	陰平	4th	陰去
Yīn Level, 55	Yīn Leaving, 212
high level	low falling and rising
2nd	陽平	5th	陽去
Yáng Level, 53	Yáng Leaving, 242
high falling	middle rising and falling
3rd	上聲
Rising Tone, 33
middle level
6th	陰入	7th	陽入
Yīn Entering, 24	Yáng Entering, 5
middle rising stopped	high level stopped

In the table, I have retained traditional characters; but Fukian uses simplied characters. Thus, 陰, yīn, occurs as 阴; and 陽, yáng, as 阳. The 3rd tone contains the character 聲, shēng; but this occurs in Fujian as 声. The other characters are traditional.

The Wikipedia page on the Fuzhou dialect uses the traditional characters. It does not number the tones and gives them in a slightly different sequence than Fujian. I have used the Wikpedia page for the numerical contours given for the tones. Wikipedia and Fujian use the same "description" for each tone, which frequently are not what one would expect from the traditional Chinese names, e.g the 3rd tone, called "rising" is actually level. If diacritics were to be used for the tones, the hachek, used in Pinyin, would be appropriate for the falling and rising 4th tone, while the circumflex would be appropriate for the rising and falling 5th tone.

Categories of Chinese Characters

Examples of Dialect Differences Between Peking, Shanghai and, Canton

Pronouncing Mandarin Initials

Mandarin Finals and Syllables

The Contrast between Classical and Modern Chinese

At left are examples of the Cantonese tones, using the notation in Teach Yourself Cantonese by R. Bruce [Teach Yourself Books, Hodder and Stoughton, 1970, 1976, pp.12-13]. Different tone symbols are not needed for the 7th, 8th, and 9th tones. Just as the diacritics for the 1st, 3rd, and 6th tones can be used for the 7th, 8th, and 9th, other systems actually use those numbers.

Copyright (c) 2000, 2006, 2007, 2008, 2017, 2021, 2022 Kelley L. Ross, Ph.D. All Rights Reserved

The Dialects of Chinese, Note

The word "Mandarin" has also been explained as derived from Chinese, as , "Manchu great man" [cf. Dah-an Ho, "The Characteristics of Mandarin Dialects," The Sino-Tibetan Languages, p.127]. However, this looks very much like a folk etymology, and an anachronistic one, since the Portuguese had been in China more than a century (since 1518) before the Manchus took over the country (1644). The language of the officials was going to be called something long before any officials were Manchurian.

There is also the problem of the pronunciation: probably was not pronounced with an r in the era in question. The Wade-Giles writing of the syllable, jen, reflects an older pronunciation, which we see reflected as a y in Cantonese and an actual English-like j in Japanese. Indeed, this is probably why "Japan," , is pronounced in English as it is, with an older Chinese pronunciation -- in Japanese itself, the j/y/r can and does turn up here an n.

I have now found some good evidence of the anachronism, as I suggest, of this claim. "Mandarin" was used in reference to Chinese officials as early as 1552 by the Portuguese writer Fernão Lopez de Castanheda in his Historia do descobrimento e conquista da India. The text is cited by Yule & Burnell in their classic A Glossary of Colloquial Anglo-Indian Words and Phrases ["Hobson-Jobson," Curzon Press, 1886, 1985, "Mandarin," p.550-551].

Return to Text

Examples of Dialect Differences Between
Peking, Shanghai and, Canton

Shanghai Peking

p- pu¹ "wave" po¹ [bō]

p'- p'u¹ "slope" p'o¹ [pō]

b- bu² "old woman" p'o² [pó]

t- tong¹ "east" tong¹ [dōng]

t'- t'ong¹ "be open" t'ong¹ [tōng]

d- dong² "be alike" t'ong² [tóng]

k- kuong¹ "light" kuang¹ [guāng]

k'- k'uong¹ "frame" k'uang¹ [kuāng]

g- guong² "mad, wild" k'uang² [kuáng]

Cantonese Peking

-t/0 kat^7a "cough" k'e² (sou⁴) [ké(sòu)]

-t/0 pat^7a "brush" pi³ [bǐ]

-t/0 yüt^7b/8 "moon" yüeh⁴ [yuè], 月

-t/0 yat^8/9 "sun, day" jih⁴ [rì], 日

-k/0 paak^7b/8 "hundred" pai³ [bǎi]

-k/0 sik^7a "color" (yen²)se⁴ [(yán)sè]

-k/0 kwok^7byü⁴
"national language" kou²yü³ [guóyǔ]

-p/0 t'aap^7b "pagoda" t'a³ [tǎ]

-p/0 yap^8/9 "enter" ju⁴ [rù], 入

-p/0 sap^8/9 "ten" shih² [shí], 十

As we have seen above, Cantonese and Shanghai are really separate languages, distinct from the prestige and politically favored Mandarin of Peking. Cantonese, or Yuè, 粤, is the language of Guǎngdōng, 廣東 (Kwang-tung), and Guǎngxī, 廣西 (Kwang-hsi), provinces. Shanghai, or Wú, 吴, is the language of the city of Shànghǎi, 上海, and surrounding provinces.

	Shanghai	Peking
p-	pu¹ "wave"	po¹ [bō]
p'-	p'u¹ "slope"	p'o¹ [pō]
b-	bu² "old woman"	p'o² [pó]
t-	tong¹ "east"	tong¹ [dōng]
t'-	t'ong¹ "be open"	t'ong¹ [tōng]
d-	dong² "be alike"	t'ong² [tóng]
k-	kuong¹ "light"	kuang¹ [guāng]
k'-	k'uong¹ "frame"	k'uang¹ [kuāng]
g-	guong² "mad, wild"	k'uang² [kuáng]
	Cantonese	Peking
-t/0	kat^7a "cough"	k'e² (sou⁴) [ké(sòu)]
-t/0	pat^7a "brush"	pi³ [bǐ]
-t/0	yüt^7b/8 "moon"	yüeh⁴ [yuè], 月
-t/0	yat^8/9 "sun, day"	jih⁴ [rì], 日
-k/0	paak^7b/8 "hundred"	pai³ [bǎi]
-k/0	sik^7a "color"	(yen²)se⁴ [(yán)sè]
-k/0	kwok^7byü⁴ "national language"	kou²yü³ [guóyǔ]
-p/0	t'aap^7b "pagoda"	t'a³ [tǎ]
-p/0	yap^8/9 "enter"	ju⁴ [rù], 入
-p/0	sap^8/9 "ten"	shih² [shí], 十

Cantonese, also, is particularly the language associated wtih the capital of Guǎngdōng Province, namely Canton. That name, of course, derives from the name of the province. The city itself in Chinese is Guǎngzhōu, 廣州. The politically correct practice these days, of course, is to use the unpronounceable local name of a place, rather than the traditional name already familiar in another langugage. Viewers thus may encounter documentaries or news stories that avoid the name "Canton" altogether. However, those untrained in Chinese, or linguistics, will be unable to pronounce any of these Chinese names with the appropriate phonology.

In the table superscript numbers are the tones for Shanghai, Cantonese, and Peking Mandarin in the Wade-Giles system. Brackets contain Pinyin writings, which now on this page have been updated with the proper diacritics in Unicode. While Pinyin and the system from Teach Yourself Cantonese [1976] use diacritics to indicate the tones, other common transcription systems for Cantonese, except the Yale, use superscript numbers. Since many diacritics are now easily represented in Unicode, I don't know why superscript numbers still dominate.

The examples are from The Languages of China, by S. Robert Ramsey [Princeton University Press, 1987]. The Shanghai/Peking examples are from p.91; and the Cantonese/Peking examples from pages 102 & 105. If there are two numbers for the tones, the first is from this book, while the second is what we find, recounted below, with more recent sources.

The Wu () dialect of Shanghai is noteworthy because it retains the distinction between voiced and unvoiced, aspirated and unaspirated stops that existed in T'ang Chinese. In Mandarin and the other languages the voiced stops have disappeared. In these examples, the voiced stops have been assimilated to the aspirated ones.

Cantonese () is noteworthy because it retains from T'ang Chinese a greater variety of finals. In Mandarin, a syllable must end in a vowel or in n or ng. In Cantonese, syllables can also end in p, t, k, or m as well. Words borrowed from Chinese into Korean, Japanese, and Vietnamese often also preserve evidence of the older final consonants. Thus "China," 中國 (Mandarin Zhōngguó, "Middle Country"), in Korean is Chung-guk and in Japanese Chū-koku (Japanese syllables only end in a vowel or "n"). Both of them have an extra consonant in "country" where Mandarin doesn't -- but Cantonese (Jòong-gwōk, Zung¹gwok³) does.

I had a lingustics professor once who said that you could get a kind of "instant Proto-Indo-European" by combining Greek vowels and Sanskrit consonants. Well, we can get a kind of "instant T'ang Chinese" by combining Shanghai initials and Cantonese finals. The evidence is poor for older versions of Chinese. Cantonese also preserves the larger number of tones that T'ang Chinese probably had. Mandarin only has four now, but Cantonese has six, or even nine if the tones of finals that end in stops are counted separately, which they usually are. The problem of the tones in T'ang Chinese is discussed below.

Mandarin Tones

1st 陰平 2nd 陽平

mā Yīn
Level má Yáng
Level

High Level Rising

3rd 上 4th 去

mǎ High mà Leaving

Dipping Falling

Mandarin Tones
1st	陰平	2nd	陽平
mā	Yīn Level	má	Yáng Level
High Level	Rising
3rd	上	4th	去
mǎ	High	mà	Leaving
Dipping	Falling

The most daring theory is that the Chinese of Confucius's day didn't even have tones. Evidence for this is that other members of the Sino-Tibetan language family do not have tones, while the nearby family of the Daic languages (like Thai) all have tones. In another adjacent language family, the Austroasiatic (Mon-Khmer) group, some languages have tones (like Vietnamese) and others do not. It is tempting to see the phenomenon as a South-East Asian Sprach Bund where the Daic tones have influenced some languages in the Sino-Tibetan and Austroasiatic families.

Cantonese Tones

1st High Rising,
Falling sì 司, 詩,
思

2nd Middle Rising sí 史

3rd Middle Level si̅ 試

4th Low Falling sǐ 時

5th Low Rising sĭ 市

6th Low Level si̿ 事, 是

7th High Level sìk 色, 識

8th Middle Level si̅k 錫

9th Low Level si̿k 食

Cantonese Tones
1st	High Rising, Falling	sì	司, 詩, 思
2nd	Middle Rising	sí	史
3rd	Middle Level	si̅	試
4th	Low Falling	sǐ	時
5th	Low Rising	sĭ	市
6th	Low Level	si̿	事, 是
7th	High Level	sìk	色, 識
8th	Middle Level	si̅k	錫
9th	Low Level	si̿k	食

In other treatments, such as my source for the table above, The Languages of China, the 7th and 8th tones are styled 7a and 7b, while the 9th tone becomes the 8th. One case occurs with the character 日, "sun, day." This is rì in Mandarin. It was given in this source for the table as yat⁸.

The same word is transcribed ya̿t in Teach Yourself Cantonese, yed⁹ in A Concise Cantonese-English Dictionary, yaht in the English-Cantonese Dictonary, Cantonese in Yale Romanization [by Kwan Choi Wah, et al., The Chinese University Press, Hong Kong, 1991], and jat⁶ at Wiktionary -- where this matches the transcription in the ABC Cantonese-English Comprehensive Dictionary [Robert Bauer, University of Hawai'i Press, 2021]. Since the Yale notation (with h but no diacritic) indicates the 6th tone, these are all consistent, with either the 6th or 9th tone. We thus recognize that yat⁸ is actually the 9th tone, or elsewhere written, with a diacritic or otherwise, for the 6th.

The 7th, 8th, and 9th tones are all level. This is only a little confusing with the 7th, which writes the tone for the 1st, which itself is a little confusing, often called either rising or falling. But, as we will see below, the 1st tone rises slightly and then falls. In the 7th, the falling part is lost and the rising part flattened.

While this table originally used graphic images for the characters and the diacritics, I have now been able to substitute Unicode writings for all of them. The single and double overlines for two of the diacritics, and the breve for the 5th tone, are derived from the Uncode block of "Combining Diacrtical Marks" (#x0300). The breve with "i" can also be written "ĭ" from the "Latin Extended-A" block (#x0100).

Since the characters are now in Unicode, they can be copied and pasted into search engines, where each can be found with a Wiktionary page that gives information about meaning, pronunciation in different Chinese dialects, etc. The only really novel diacritic here is the double overline, but other transcription systems seem to be innocent even of traditional marks like the breve. Conspicuously missing is the venerable circumflex, already easily rendered in HTML, and which we might think particularly suitable for the 2nd tone. I don't know why it would not be used.

Cantonese Tones

1st 陰平 2nd 陰上 3rd 陰去

Yīn Level Yīn High Yīn Leaving

4th 陽平 5th 陽上 6th 陽去
Yáng Level Yáng High Yáng Leaving

7th 上陰入 8th 下陰入 9th 陽入
High Yīn
Entering Low Yīn
Entering Yáng
Entering

Cantonese Tones
1st	陰平	2nd	陰上	3rd	陰去
Yīn Level	Yīn High	Yīn Leaving
4th	陽平	5th	陽上	6th	陽去
Yáng Level	Yáng High	Yáng Leaving
7th	上陰入	8th	下陰入	9th	陽入
High Yīn Entering	Low Yīn Entering	Yáng Entering

There seems to be a reluctance in many current systems to take full advantage of representing the tones with dedicated diacritics, even though Pinyin made full use of them, and Unicode easily makes many diacritics available.

The Chinese names of the Cantonese tones, at right, are themselves of interest. It is an exercise in the use of Yīn & Yáng. The first three tones are Yīn, 陰, and the second three Yáng, 陽. The 7th and 8th tones refer back to the 1st and 3rd, which are Yīn, while the 9th refers back to the 6th, which is Yáng. Because the 7th and 8th tones are both Yīn ("upper" and "lower"), this may be why some systems have called them "7a" and "7b." The "entering" tones, 入 (Mandarin rù, Cantonese ya̿p), have always gone with syllables closed with "p," "t," and "k," which is why we don't see them in Mandarin, where such syllables are gone.

Below left is a diagram off of Wikipedia showing the sound contours of the tones in pitch and duration, for the tones from the 1st to the 6th [Alexander L. Francis, 2008]. It is easy to match up the brief glosses from the tables with the actual tones. The characters used for the tones include the examples from Teach Yourself Cantonese, and I have added characters from the Cantonese page at Wikipedia. These are not the names of the tones but examples of characters with the given pronunciation.

The character 事, si̿, "affair, matter, undertaking, business," in particular catches my attention because of its importance in Chinese philosophy, correponding, among other things, to Greek πρᾶγμα. In Mandarin 事 is shì, in Japanese ji (kun reading koto), in Vietnamese sự, and in Korean sa, 사 [A Guide to Korean Characters, Reading and Writing Hangŭl and Hanja, by Bruce K. Grant, Hollym International, Elizabeth, NJ, Seoul, 1979, p.95]. Where 事 has an extra meaning as a verb, "to serve," the Vietnamese-English Dictionary, by Nguyễn Ðình-Hoà, has separate entries for the two terms [Tuttle Language Library, 1966, 1991, pp.398-399]. Although sự is spelled in the same way, you would not know from that alone that they were from the same Chinese character.

These example words in the table will look different in A Concise Cantonese-English Dictionary by Yang Mingxin [Guangdong Higher Education Publishing House, 1999]. First of all, the latter uses an adapted Pinyin alphabet, where "x" is used for "s" and "g" for final "k." Second, although Pinyin introduced the use of Greek-like accents to show tones, the Dictionary reverts to the old Wade-Giles way of simply numbering the tones with superscripts. Also, the Dictionary uses simplified forms of some of the characters. I have used the unsimplified characters in Bruce where these are available. The Yale system of Romanization, with discussion of some alternatives (though not the Pinyin) is used in the English-Cantonese Dictionary.

Dictionaries or grammars of Shanghai Chinese in English seem to all be out of print. Grammar, however, can be found in many pages on line.

A nice example of a difference between Mandarin and Cantonese is a surname. This is Wú in the former, Ňg in the latter (and actually O or Oh in Korean). The Cantonese name is one of many words that are simply a syllabic ng. There is also a syllabic ḿ in Cantonese, which is , "not," in Mandarin. That is the only word with that pronunciation in A Concise Cantonese-English Dictionary [pp.260-262]. Although it seems like there ought to be, there is no syllabic n in Cantonese.

There is more than one character used for the Cantonese surname. At right, we see the traditional character first, then a recent simplified one to the right of the pronunciation.

This was also the name of the Kingdom of Wu, one of the states of the Three Kingdoms Period in Chinese history, and of the modern language of Shanghai. At far right is an alternative character used, at least in Cantonese, for the surname. My only question is that the first character (with its simplification) and the second are pronounced differently. In Mandarin, the first has a 2nd tone, the second a 3rd. In Cantonese, the first has a 4th tone, 呉, Ňg, the second a 5th, 伍, N̆g (with the symbols used in Teach Yourself Cantonese). The contours are similar but the tone level a little different.

I originally learned of the two possible characters from a young woman whose name actually was Ng, but I didn't know then to ask about the different tones. Perhaps someone can help me out.

Note that the Cantonese spellings in the table above are from Teach Yourself Cantonese, while, as noted, A Concise Cantonese-English Dictionary uses a form of Pinyin adapted from Mandarin. Thus, words traditionally ending in t/k/p are written d/g/b in the latter.

Cantonese has a large variety of vowels and diphthongs. These are transcribed in confusingly different ways in different systems. Thus, the character 佛, meaning the "Buddha, Buddhism," etc., is fó in Mandarin, butsu in Japanese, phật in Vietnamese, and bul, 불, in Korean [op.cit., p.78].

For Cantonese, 佛 is transcribed fu̿t in Teach Yourself Cantonese, fed⁹ in A Concise Cantonese-English Dictionary, faht in the English-Cantonese Dictonary, Cantonese in Yale Romanization [The Chinese University Press, Hong Kong, 1991], and fat⁶ at Wiktionary.

While we might not be surprised that the 9th tone in one source turned up as the 6th in another, it may be more puzzling why the very same vowel should be represented as "u," "e," and "a" in different sources. The actual vowel may be closer to the "schwa," /ə/, which is a reduced and indefinite vowel in relation to a full "u," "e," or "a." Unless a schwa is actually used, it is hard to say who has the best idea in this. Note that the Yale system tries to indicate the tones by diacritics with slightly different versions of the vowels -- "ah" without a diacritic is the 6th tone on "a."

The Yale system is also now used in the Pocket Cantonese Dictionary, Cantonese-English, English-Cantonese, by Martha Lam and Lee Hoi Ming [Periplus, Hong Kong, 2019], and the Complete Cantonese, by Hugh Baker and Ho Pui-Kei [Teach Yourself, Hackette, 2020]. Complete Cantonese has decided that there are seven tones, apparently by dividing the 1st into "high level tone" and "high falling tone," where the "high level" actually would be confined to what the other sources regard, indeed, as the actual 7th tone. But the tones are not numbered.

The Pocket Cantonese Dictionary confines itself to six tones, with the traditional numbering. The character 佛 there is the Yale faht, which seems to be missing from the vocabulary of Complete Cantonese. The connection of Complete Cantonese to the "Teach Yourself" books is all but buried, and the interesting system of the 1976 book is gone.

Restaurant Name in Cantonese

I have had some difficulty pinning down the number of tones in Shanghai Chinese. In the table above, we see Shanghai credited with seven tones [The Sino-Tibetan Languages, edited by Graham Thurgood and Randy J. LaPolla, Routledge Language Family Series, Routledge, London, 2003, p.127]; but at Wikipedia, I see five tones, as in the table at right. The most interesting feature of the Wikipedia treatment

Shanghai Chinese Tones

the T'ang Tones 平上去入

Level High Leaving Entering

Yīn, 陰 1st, 53 2nd, 34 4th, 5ʔ

Yáng, 陽 3rd, 13 5th, 2ʔ

may be that the Yáng tones occur after voiced initials and the Yīn tones after unvoiced.

Shanghai Chinese Tones
the T'ang Tones	平	上	去	入
Level	High	Leaving	Entering
Yīn, 陰	1st, 53	2nd, 34	4th, 5ʔ
Yáng, 陽	3rd, 13	5th, 2ʔ

Over the years, I have had difficulty finding books in print about Shanghai Chinese. Now, however, I have in hand Shanghainese, Dictionary & Phrasebook, by Richard VanNess Simmons [Hippocrene Books, 2011], and A Chinese-English Dictionary of the Wu Dialect (Featuring the Dialect of the City of Shanghai), by Thomas Creamer [Dunwoody Press, 1991]. Shanghainese features a discussion of the tones, which, while giving five in number, nevertheless breaks them down into a total of eight overall. A Chinese-English Dictionary represents the tones with numerical contours, from 1 to 5. Thus, "53" is a tone that begins high, "5," and falls to mid-level, "3." This is a device commonly used, even where tones may otherwise be distinctively numbered.

While those numbers are the ones used in A Chinese-English Dictionary of the Wu Dialect, they are slightly different from what we find on the Shanghainese page at Wikipedia, where we have "52" for "53," "14" for "13," "44ʔ" for "5ʔ," and "24ʔ" for "2ʔ." These variations in sources are not unusual. What seems a little anomalous is that the "abrupt" Entering tones are represented as having the same duration as the other tones, while the dictionary has them cut off short.

Shanghai Chinese Tones

平上去入

Yīn, 陰 1a, High
Rising 2, Rising
& Falling 3,
Falling 4a, Abrupt
High 5a, Level
Mid

Yáng, 陽 1b, Low
Rising 4b, Abrupt
Low 5b, Level
Low

I have updated the table above in light of the analysis of the tones of Shanghai Chinese in Shanghainese. In three cases, we get a differentiation that occurs between voiced and unvoiced initials. With the 2nd tone, we don't seem to see this difference, but the 2nd tone is said to only occur in "words with three or more syllables" [p.14]. This is very intriguing since it begins to sound like the Greek case of only one tone occurring for a whole polysyllabic word.

Shanghai Chinese Tones
	平	上	去	入
Yīn, 陰	1a, High Rising	2, Rising & Falling	3, Falling	4a, Abrupt High	5a, Level Mid
Yáng, 陽	1b, Low Rising	4b, Abrupt Low	5b, Level Low

This impression is reinforced in the discussion of the 3rd tone, where we see "all words with a falling accent (`) over the first syllable have a falling tone that spreads across the whole word (starting in the first syllable)" [ibid.]. With the "entering" tones, they occur, where we might expect, with "q" finals (glottal stops that indicate where the finals used to be "p," "t," or "k"), but they also occur in a polysyllabic word without "q," where the following syllable will retain its own tone.

I have not given numerical contours for the tones in this table because I have not found simple versions of them. The Shanghainese Wikipedia page has a large table for "tone sandhi" (Sanskrit sandhi for sound changes from one syllable to the next) across multiple syllables; but this has two numbers for each syllable, up to five syllables, which is not the simplicity I'm looking for. And there are anomalies, such as the 1st tone described as "rising," when the contour, "53," shows it falling. I don't have an explanation for that.

This raises all sorts of questions about the Wu/Shanghai language(s). There is even an issue concerning the voiced initials, which may also be aspirated. So it is a shame not to see more books in print about it. This may be because, unlike Cantonese, Shanghai is really just a spoken language. The written language in the place is Mandarin.

Pronouncing Mandarin Initials

Mandarin Finals and Syllables

Mandarin Finals and Syllables

Copyright (c) 2000, 2005, 2006, 2015, 2019, 2020, 2021, 2022 Kelley L. Ross, Ph.D. All Rights Reserved

Pronouncing Mandarin Initials

Chinese has the extraordinary structure that nearly every syllable has a semantic content, even if only a historical one. Each syllable is thus written with a Chinese character, which was originally a separate word.

Each syllable is analyzed into an "intitial" and a "final." The "final" contains the vowel, the tone, and the final consonant, if any. This structure is also applied to

Simple Initials

Pinyin Wade-Giles Pronunication

b p p, unaspirated (spot)

p p' p^h, aspirated (pot)

m m m

f f f

d t t, unaspirated (stop)

t t' t^h, aspirated (top)

n n n

l l l

g k k, unaspirated (skit)

k k' k^h, aspirated (kit)

h h h

Sibilant Initials

Pinyin Wade-Giles Pronunication

z ts ts, unaspirated

c ts t^hs, aspirated (hats)

s s s

Retroflex Initials

Pinyin Wade-Giles Pronunication

zh ch ṭṣ, unaspirated

ch ch' ṭ^hṣ, aspirated

sh sh ṣ

r j r

Palatal Initials

Pinyin Wade-Giles Pronunication

j ch tš, unaspirated

q ch' t^hš, aspirated (church)

x hs š

Korean and Vietnamese, which borrowed Chinese writing and many Chinese words, even though neither language was even related to Chinese.

Simple Initials
Pinyin	Wade-Giles	Pronunication
b	p	p, unaspirated (spot)
p	p'	p^h, aspirated (pot)
m	m	m
f	f	f
d	t	t, unaspirated (stop)
t	t'	t^h, aspirated (top)
n	n	n
l	l	l
g	k	k, unaspirated (skit)
k	k'	k^h, aspirated (kit)
h	h	h
Sibilant Initials
Pinyin	Wade-Giles	Pronunication
z	ts	ts, unaspirated
c	ts	t^hs, aspirated (hats)
s	s	s
Retroflex Initials
Pinyin	Wade-Giles	Pronunication
zh	ch	ṭṣ, unaspirated
ch	ch'	ṭ^hṣ, aspirated
sh	sh	ṣ
r	j	r
Palatal Initials
Pinyin	Wade-Giles	Pronunication
j	ch	tš, unaspirated
q	ch'	t^hš, aspirated (church)
x	hs	š

The "initials," apart from the tones, pose the greatest challenge for foreigners trying to pronounce Chinese. And now we have two common systems for writing Mandarin, the older Wade-Giles and the recent Pinyin. The greatest challenge is that Mandarin does not have voiced stops, like b, d, and g. These existed in T'ang Chinese (and have been preserved in the Shanghai or Wu language), but have been lost in Mandarin. Instead, Mandarin contrasts aspirated stops with unaspirated stops. "Aspirates" have breath coming out, "unaspirates" don't. In Wade-Giles, aspirates were indicated with an apostrophe, as in the name of the T'ang Dynasty. Sometimes it is said that an aspirated t is pronounced like the t in "hot house." This not quite right, since the t there is in a separate syllable, and a separate word, from the "h" aspiration. Instead, it should be noted that English contrasts, in certain environments, an aspirated from an unaspirated t. Thus the t in "top" is aspirated, and the t in "stop" is unaspirated. Holding a hand in front of the mouth can detect the breath expelled in one and not expelled in the other. The Chinese unaspirated t can be duplicated by pronouncing "stop" without the "s." Aspirations are indicated in the "pronunciation" column of the table with a superscript h.

Since there are no voiced stops in Mandarin, the Pinyin system conveniently uses the Latin letters for the voiced stops for unaspriated stops, and the Latin letters for the unvoiced stops for the aspirated stops. The English word "stop" thus could be written in Pinyin as "sdob," which looks very odd, and has a final consonant unallowed by Mandarin, but does use the proper values of the Pinyin consonants.

The "retroflex" initials have the tongue curling up, as in the similar series of sounds in Sanskrit and subsequent languages in India. But other Chinese dialects do not distinguish retroflex from palatal initials. In fact, even in Mandarin, retroflexes and palatals are really just different allophones (sounds) of the same phonemes, i.e. they do not occur in the same environment and so can actually be represented by the same signs (as in Wade-Giles). Retroflexes (and sibilants) occur only with a, o/e, and u finals. Palatals occur only with i and ü finals. The "i" written with sibilants and retroflexes, e.g. "si" and "zhi," does not represent a true i, but a "buzzing" for sibiliants and an r for retroflexes.

The Wade-Giles system represents Chinese more efficiently and familiarly. Pinyin, besides the phonemic redundancy, has the drawback that the sound of a number of letters (like q and x) has nothing to do with how they are pronounced in most Western languages. On the other hand, Pinyin makes a more elegant use of the Latin alphabet.

Pronouncing Mandarin Initials

Copyright (c) 2000, 2006, 2019 Kelley L. Ross, Ph.D. All Rights Reserved

Mandarin Finals and Syllables

Simple Initials and Group-a Finals

Initials Finals

Ø á án áng ái áo

Ø a an ang ai ao

b ba ban bang bai bao

p pa pan pang pai pao

m m ma man mang mai mao

f fa fan fang

d da dan dang dai dao

t ta tan tang tai tao

n na nan nang nai nao

l la lan lang lai lao

g ga gan gang gai gao

k ka kan kang kai kao

h ha han hang hai hao

Retroflex & Sibilant Initials and Group-a Finals

Initials Finals

"i" a an ang ai ao

zh zhi zha zhan zhang zhai zhao

ch chi cha chan chang chai chao

sh shi sha shan shang shai shao

r ri ran rang rao

z zi za zan zang zai zao

c ci ca can cang cai cao

s si sa san sang sai sao

Each syllable in Chinese is analyzed into an "intitial" and a "final." Initials of Mandarin are considered in the section above. The "final" contains the vowel, the tone, and the final consonant, if any. The tables here show nearly all the possible syllables in the Standard form of Mandarin Chinese, i.e. the Mandarin of Peking (Beijing). This is not actually all the syllables because of the "Group-r" finals. Those are added either as "er" or "r" to the syllables shown. After a, o, e, u, and ng, "r" is added. After ai, an, and en, drop the i or n and add "r." After i and ü, add "er." and With "i," in, and un, drop the i or n and add "er."

Simple Initials and Group-a Finals
Initials	Finals
Ø	á	án	áng	ái	áo
Ø		a	an	ang	ai	ao
b	ba	ban	bang	bai	bao
p	pa	pan	pang	pai	pao
m	m	ma	man	mang	mai	mao
f		fa	fan	fang
d	da	dan	dang	dai	dao
t	ta	tan	tang	tai	tao
n	na	nan	nang	nai	nao
l	la	lan	lang	lai	lao
g	ga	gan	gang	gai	gao
k	ka	kan	kang	kai	kao
h	ha	han	hang	hai	hao
Retroflex & Sibilant Initials and Group-a Finals
Initials	Finals
"i"	a	an	ang	ai	ao
zh	zhi	zha	zhan	zhang	zhai	zhao
ch	chi	cha	chan	chang	chai	chao
sh	shi	sha	shan	shang	shai	shao
r	ri		ran	rang		rao
z	zi	za	zan	zang	zai	zao
c	ci	ca	can	cang	cai	cao
s	si	sa	san	sang	sai	sao

The "Group-a" finals go with the simple, the retroflex, and the sibilant initials. The "i" final only occurs with the retrolex and sibilant initials, and represents a vowel with little kinship to an actual i. For the retroflexes, it is more of an r sound, while with the sibilants it is a vowel so reduced and indefinite that it is described as a "buzzing." Indeed, in the Yale system of transcription, the former is rendered with "r" and the latter with "z." Wade-Giles uses "ih" (or "tzu" for Pinyin zi, etc.), thus distinguishing it from the simple "i" used with the palatals. In this way, neither Pinyin nor Wade-Giles give much of a clue from English phonology how to make the sound. Since the "i" is the only letter i that is not used with the "Group-i" finals and the palatal initials, its presence rather confuses the symmetry of the system, although there is no ambiguity (I will not say confusion), since "i" does only occur with the retroflex and sibilant initials. It is a cleaner and more elegant solution than in Wade-Giles. Since Pinyin was willing to pick phonetic values of the Latin alphabet from different languages, the undotted Turkish I might have been considered for the "i" sound, though this is not available in HTML and is, as noted, unnecessary.

Otherwise, the vowels in the table are as they are in Wade-Giles. The syllabic m in included in the table just as a reminder that there is such a thing in Cantonese. In the index row, the tone is written over the vowel to show, where there might be ambiguity, which vowel is used.

Simple Initials and Group-o/e Finals

Initials Finals

ó é én éng éi óu óng

Ø e en eng ou

b bo ben beng bei

p po pe pen peng pei pou

m mo men meng mei mou

f fo fen feng fei fou

d de deng dei dou dong

t te teng tou tong

n nen neng nei nou nong

l le len leng lei lou long

g ge gen geng gei gou gong

k ke ken keng kei kou kong

h ho he hen heng hei hou hong

Retroflex & Sibilant Initials and Group-o/e Finals

Initials Finals

o e en eng ei ou ong

zh zhe zhen zheng zhei zhou zhong

ch che chen cheng chou chong

sh she shen sheng shei shou

r re ren reng rou rong

z ze zen zeng zei zou zong

c ce cen ceng cou cong

s se sen seng sou song

With the "Group o/e" finals a major difference between Pinyin and Wade-Giles is that the latter writes the "ong" final as "ung." Since one may be used to seeing words like "Chung" in English, its absence from Pinyin is conspicuous.

Simple Initials and Group-o/e Finals
Initials	Finals
ó	é	én	éng	éi	óu	óng
Ø		e	en	eng		ou
b	bo		ben	beng	bei
p	po	pe	pen	peng	pei	pou
m	mo		men	meng	mei	mou
f	fo	fen	feng	fei	fou
d		de		deng	dei	dou	dong
t	te	teng		tou	tong
n		nen	neng	nei	nou	nong
l	le	len	leng	lei	lou	long
g	ge	gen	geng	gei	gou	gong
k	ke	ken	keng	kei	kou	kong
h	ho	he	hen	heng	hei	hou	hong
Retroflex & Sibilant Initials and Group-o/e Finals
Initials	Finals
o	e	en	eng	ei	ou	ong
zh		zhe	zhen	zheng	zhei	zhou	zhong
ch	che	chen	cheng		chou	chong
sh	she	shen	sheng	shei	shou
r	re	ren	reng		rou	rong
z	ze	zen	zeng	zei	zou	zong
c	ce	cen	ceng		cou	cong
s	se	sen	seng	sou	song

Of priniciple interest here in the phonetic system is the lack of contrast between the o and e finals. Where the final o is used, e is not; and where e is used, o is not. That this was not always the case is shown with two anomalous syllables against a blue background. Pe and ho used to occur, but they do no longer. The only minimal pairs with o/e are those with contrasting eng and ong finals, though there are a good number of these.

The pe syllable is found in the name "Peking," which now, with the Pinyin Beijing being used, people might just think of as some kind of mistake. It is not a mistake, just a transcription of an older form of pronunication in Mandarin, where pe existed, and where the palatals in the "Group-i" finals had not yet developed from their original stops -- the word is still king in Cantonese and was borrowed as kyô into Japanese.

A difference between Pinyin and Wade-Giles that would also apply to the "Group-a" finals above is the initial r. In Wade-Giles, that is written j, which, pronounced r, must produce for Wade-Giles as much confusion as q and x in Pinyin. Again, this reflects some history. Since the r corresponds to a y in Cantonese (yat for rì), and is often borrowed as (English) j into Japanese (e.g. jin for rén), writing j in Wade-Giles reflects the circumstance that this is pronounced y in German but j in English (the y pronunciation being the original value of j as a modification of Latin i). However, the letter is also borrowed as n into Japanese (e.g. nichi for rì), and r itself does not look much like a natural derivative of either y or j. So there seems to have been something else going on in the original Chinese sound, which may have been more an ñ than a y.

Simple Initials and Group-u Finals

Initials Finals

ú uá uó uái uí uán ún uáng uéng

Ø wu wa wo wai wéi wan wen wang weng

b bu

p pu

m mu

f fu

d du duo dui duan dun

t tu tuo tui tuan tun

n nu nuo nuan

l lu luo luan lun

g gu gua guo guai gui guan gun guang

k ku kua kuo kuai kui kuan kun kuang

h hu hua huo huai hui huan hun huang

Retroflex & Sibilant Initials and Group-u Finals

Initials Finals

u ua uo uai ui uan un uang ueng

zh zhu zhua zhuo zhuai zhui zhuan zhun zhuang

ch chu chuo chuai chui chuan chun chuang

sh shu shua shuo shuai shui shuan shun shuang

r ru ruo rui ruan run

z zu zuo zui zuan zun

c cu cuo cui cuan cun

s su suo sui suan sun

In the Group-u finals, uo often turns up as just o in Wade-Giles. Otherwise, we see a lot of possible syllables that are not used. A curiosity in both systems is that ui is actually pronounced more like ué (with the accent from French). Wei is written more like it is pronounced (with the anomaly that the tone goes on the e). That all this is the case may be because the Mandarin e in isolation has more of the reduced, schwa-like sound that is familar from many occurrences in English (the last a in "banana"), French (le), and German (Töne). We don't get a pure Italian e or French é in Mandarin.

Simple Initials and Group-u Finals
Initials	Finals
ú	uá	uó	uái	uí	uán	ún	uáng	uéng
Ø	wu	wa	wo	wai	wéi	wan	wen	wang	weng
b	bu
p	pu
m	mu
f	fu
d	du		duo		dui	duan	dun
t	tu	tuo	tui	tuan	tun
n	nu	nuo		nuan
l	lu	luo	luan	lun
g	gu	gua	guo	guai	gui	guan	gun	guang
k	ku	kua	kuo	kuai	kui	kuan	kun	kuang
h	hu	hua	huo	huai	hui	huan	hun	huang
Retroflex & Sibilant Initials and Group-u Finals
Initials	Finals
u	ua	uo	uai	ui	uan	un	uang	ueng
zh	zhu	zhua	zhuo	zhuai	zhui	zhuan	zhun	zhuang
ch	chu		chuo	chuai	chui	chuan	chun	chuang
sh	shu	shua	shuo	shuai	shui	shuan	shun	shuang
r	ru		ruo		rui	ruan	run
z	zu	zuo	zui	zuan	zun
c	cu	cuo	cui	cuan	cun
s	su	suo	sui	suan	sun

The "Group-ü" finals feature the vowel ü, written and pronounced like the u-Umlaut in German (also used now in Turkish). This is the sound i with lip-rounding, and so, being a front vowel like i, is found with the palatal initials of the "Group-i" vowels, as given below.

Where Wade-Giles did not distinguish between retroflex and palatal initials with different letters, it did so by the circumstance that the palatals only occurred with "Group-i" and "Group-ü" finals. Thus the ü was always fully written.

Simple and Palatal Initials and Group-ü Finals

Initials Finals

ü üé üán ün

Ø yú yue yuan yún

j ju jue juan jun

q qu que quan qun

x xu xue xuan xun

n nü nüe

l lü lüe

Simple and Palatal Initials and Group-ü Finals
Initials	Finals
ü	üé	üán	ün
Ø	yú	yue	yuan	yún
j	ju	jue	juan	jun
q	qu	que	quan	qun
x	xu	xue	xuan	xun
n	nü	nüe
l	lü	lüe

Since Pinyin does differentiate the initials with different letters, the need for the Umlaut, to separate "Group-u" from "Group-ü" finals, is mostly eliminated. However, some writers do not seem to realize that this is not universally the case. Where the initials are n or l, the Umlaut is still necessary. Thus, lü is sometimes improperly written as lu in Pinyin.

The retention of the Umlaut does create some graphic difficulties, since the tone must be written atop it in nü and lü, something that fonts may not often be called upon to do. Otherwise, its loss is a convenient simplification.

Mandarin Tones

1st 陰平 2nd 陽平

mā Yīn
Level má Yáng
Level

High Level Rising

3rd 上 4th 去

mǎ High mà Leaving

Dipping Falling

Unicode, however, allows us to use the tones even with nü and lü.

Mandarin Tones
1st	陰平	2nd	陽平
mā	Yīn Level	má	Yáng Level
High Level	Rising
3rd	上	4th	去
mǎ	High	mà	Leaving
Dipping	Falling

As we see here, at a convenient point, Mandarin syllables are distinguished by four tones, or five, if we count the "drop tone," where a syllable may be left without a tone. The tone is considered part of the Final, along with the other contents of this page.

Wide-Giles had indicated the tones with numerical superscripts, which was a cumbersome device and and easily lost for most non-linguistic purposes. Pinyin wisely returned to the origin of tone notation, in Classical Greek, and begins to write the tones with the equivalent of the accents that were invented to write Greek. The differences we see are the use of the macron for the high even tone, the 1st, while the 3rd tone, which is falling and rising, flips the Greek circumflex, which had been rising and falling, to show this. The result is the use of the Czech háček, which is applied to a vowel, rather than its customary use on Slavic consonants. Although this is now very convenient, we still rarely see a name like "Běijīng" written with the requisite tones.

Simple Initials and Group-i Finals

Initials Finals

í iá iáo ié iú ián ín iáng íng ióng

Ø yi ya yao ye yóu yan yin yang ying yong

b bi biao bie bian bin bing

p pi piao pie pian pin ping

m mi miao mie miu mian min ming

d di diao die diu dian ding

t ti tiao tie tian ting

n ni niao nie niu nian nin niang ning

l li lia liao lie liu lian lin liang ling

Palatal Initials and Group-i Finals

Initials Finals

i ia iao ie iu ian in iang ing iong

j ji jia jiao jie jiu jian jin jiang jing jiong

q qi qia qiao qie qiu qian qin qiang qing qiong

x xi xia xiao xie xiu xian xin xiang xing xiong

With the "Group-i" finals, we see a number of systematic differences between Pinyin and Wade-Giles. Ian here turns up as ien in Wade-Giles, and iong as iung. Although written ian, the a is a reduced vowel pronounced still more like the e discussed above.

Simple Initials and Group-i Finals
Initials	Finals
í	iá	iáo	ié	iú	ián	ín	iáng	íng	ióng
Ø	yi	ya	yao	ye	yóu	yan	yin	yang	ying	yong
b	bi		biao	bie		bian	bin		bing
p	pi	piao	pie	pian	pin	ping
m	mi	miao	mie	miu	mian	min	ming
d	di	diao	die	diu	dian		ding
t	ti	tiao	tie		tian	ting
n	ni	niao	nie	niu	nian	nin	niang	ning
l	li	lia	liao	lie	liu	lian	lin	liang	ling
Palatal Initials and Group-i Finals
Initials	Finals
i	ia	iao	ie	iu	ian	in	iang	ing	iong
j	ji	jia	jiao	jie	jiu	jian	jin	jiang	jing	jiong
q	qi	qia	qiao	qie	qiu	qian	qin	qiang	qing	qiong
x	xi	xia	xiao	xie	xiu	xian	xin	xiang	xing	xiong

We also see the most unfamiliar use of letters in Pinyin, with q for Wade-giles ch' and x for hs -- which itself was simply an alternative to sh. X actually is used to write sh in some languages (e.g. Basque). I am not aware of q being used anywhere to write any variation of English ch. However, whether intentional or not, this evokes a bit of the history, since q usually is pronounced like k, and q in Pinyin is used with an initial that, although now a ch, was actually an original k. If that was the intention, in the use of q, it was cleverly done.

Categories of Chinese Characters