4.2: The form of characters
- Page ID
- 89619
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\( \newcommand{\dsum}{\displaystyle\sum\limits} \)
\( \newcommand{\dint}{\displaystyle\int\limits} \)
\( \newcommand{\dlim}{\displaystyle\lim\limits} \)
\( \newcommand{\id}{\mathrm{id}}\) \( \newcommand{\Span}{\mathrm{span}}\)
( \newcommand{\kernel}{\mathrm{null}\,}\) \( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\) \( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\) \( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\id}{\mathrm{id}}\)
\( \newcommand{\Span}{\mathrm{span}}\)
\( \newcommand{\kernel}{\mathrm{null}\,}\)
\( \newcommand{\range}{\mathrm{range}\,}\)
\( \newcommand{\RealPart}{\mathrm{Re}}\)
\( \newcommand{\ImaginaryPart}{\mathrm{Im}}\)
\( \newcommand{\Argument}{\mathrm{Arg}}\)
\( \newcommand{\norm}[1]{\| #1 \|}\)
\( \newcommand{\inner}[2]{\langle #1, #2 \rangle}\)
\( \newcommand{\Span}{\mathrm{span}}\) \( \newcommand{\AA}{\unicode[.8,0]{x212B}}\)
\( \newcommand{\vectorA}[1]{\vec{#1}} % arrow\)
\( \newcommand{\vectorAt}[1]{\vec{\text{#1}}} % arrow\)
\( \newcommand{\vectorB}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vectorC}[1]{\textbf{#1}} \)
\( \newcommand{\vectorD}[1]{\overrightarrow{#1}} \)
\( \newcommand{\vectorDt}[1]{\overrightarrow{\text{#1}}} \)
\( \newcommand{\vectE}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash{\mathbf {#1}}}} \)
\( \newcommand{\vecs}[1]{\overset { \scriptstyle \rightharpoonup} {\mathbf{#1}} } \)
\( \newcommand{\vecd}[1]{\overset{-\!-\!\rightharpoonup}{\vphantom{a}\smash {#1}}} \)
\(\newcommand{\avec}{\mathbf a}\) \(\newcommand{\bvec}{\mathbf b}\) \(\newcommand{\cvec}{\mathbf c}\) \(\newcommand{\dvec}{\mathbf d}\) \(\newcommand{\dtil}{\widetilde{\mathbf d}}\) \(\newcommand{\evec}{\mathbf e}\) \(\newcommand{\fvec}{\mathbf f}\) \(\newcommand{\nvec}{\mathbf n}\) \(\newcommand{\pvec}{\mathbf p}\) \(\newcommand{\qvec}{\mathbf q}\) \(\newcommand{\svec}{\mathbf s}\) \(\newcommand{\tvec}{\mathbf t}\) \(\newcommand{\uvec}{\mathbf u}\) \(\newcommand{\vvec}{\mathbf v}\) \(\newcommand{\wvec}{\mathbf w}\) \(\newcommand{\xvec}{\mathbf x}\) \(\newcommand{\yvec}{\mathbf y}\) \(\newcommand{\zvec}{\mathbf z}\) \(\newcommand{\rvec}{\mathbf r}\) \(\newcommand{\mvec}{\mathbf m}\) \(\newcommand{\zerovec}{\mathbf 0}\) \(\newcommand{\onevec}{\mathbf 1}\) \(\newcommand{\real}{\mathbb R}\) \(\newcommand{\twovec}[2]{\left[\begin{array}{r}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\ctwovec}[2]{\left[\begin{array}{c}#1 \\ #2 \end{array}\right]}\) \(\newcommand{\threevec}[3]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\cthreevec}[3]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \end{array}\right]}\) \(\newcommand{\fourvec}[4]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\cfourvec}[4]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \end{array}\right]}\) \(\newcommand{\fivevec}[5]{\left[\begin{array}{r}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\cfivevec}[5]{\left[\begin{array}{c}#1 \\ #2 \\ #3 \\ #4 \\ #5 \\ \end{array}\right]}\) \(\newcommand{\mattwo}[4]{\left[\begin{array}{rr}#1 \amp #2 \\ #3 \amp #4 \\ \end{array}\right]}\) \(\newcommand{\laspan}[1]{\text{Span}\{#1\}}\) \(\newcommand{\bcal}{\cal B}\) \(\newcommand{\ccal}{\cal C}\) \(\newcommand{\scal}{\cal S}\) \(\newcommand{\wcal}{\cal W}\) \(\newcommand{\ecal}{\cal E}\) \(\newcommand{\coords}[2]{\left\{#1\right\}_{#2}}\) \(\newcommand{\gray}[1]{\color{gray}{#1}}\) \(\newcommand{\lgray}[1]{\color{lightgray}{#1}}\) \(\newcommand{\rank}{\operatorname{rank}}\) \(\newcommand{\row}{\text{Row}}\) \(\newcommand{\col}{\text{Col}}\) \(\renewcommand{\row}{\text{Row}}\) \(\newcommand{\nul}{\text{Nul}}\) \(\newcommand{\var}{\text{Var}}\) \(\newcommand{\corr}{\text{corr}}\) \(\newcommand{\len}[1]{\left|#1\right|}\) \(\newcommand{\bbar}{\overline{\bvec}}\) \(\newcommand{\bhat}{\widehat{\bvec}}\) \(\newcommand{\bperp}{\bvec^\perp}\) \(\newcommand{\xhat}{\widehat{\xvec}}\) \(\newcommand{\vhat}{\widehat{\vvec}}\) \(\newcommand{\uhat}{\widehat{\uvec}}\) \(\newcommand{\what}{\widehat{\wvec}}\) \(\newcommand{\Sighat}{\widehat{\Sigma}}\) \(\newcommand{\lt}{<}\) \(\newcommand{\gt}{>}\) \(\newcommand{\amp}{&}\) \(\definecolor{fillinmathshade}{gray}{0.9}\)Characters are the primary unit for writing Chinese. Just as English letters may have several forms (eg g /g, a/a) and styles (eg italic), so Chinese characters also have various realizations. Some styles that developed in early historical periods survive to this day in special functions. Seals, for example, are still often inscribed in the ‘seal script’, first developed during the Qin dynasty (3rd C. BCE). Other impressionistic, running scripts, developed by calligraphers, are still used in handwriting and art. Advertisements and shop signs may stretch or contort graphs for their own design purposes. Manga style comics animate onomatopoeic characters – characters that represent sound – in idiosyncratic ways. Putting such variants aside, it is estimated that the number of characters appearing in modern texts is about 6-7000 (cf. Hannas 1997, pp 130-33, and particularly table 3). Though it is far fewer than the number cited in the largest historical dictionaries, which include characters from all historical periods, it is still a disturbingly large number.
Radicals and phonetics
There are ameliorating factors that make the Chinese writing system more learnable than it might otherwise be. One of the most significant is the fact that characters have elements in common; not just a selection of strokes, but also larger constituents. Between 2/3 and ¾ of common characters (cf. DeFrancis 1984, p. 110 and passim) consist of two elements, both of which can also stand alone as characters in their own right. Historically, these elements are either roots, in which case they are called ‘phonetics’, or classifiers, in which case they are called (paradoxically) ‘radicals’. Thus, 忘 wàng ‘forget’ contains 亡 as phonetic and 心 as classifier; 語 yǔ ‘language’ has 吾 and 言. The significance of the terms phonetic and classifier will be discussed in a later unit. For now, it is enough to know that the basic graphs are components of a large number of compound graphs: 亡 appears in 忙 and 氓, for example; 心 in 志 and 忠; 言 in 謝 and 說; 吾 in 悟 and 晤. Even this set of component graphs numbers in the high hundreds, but familiarity with them allows many characters to be learned as a pairing of higher order constituents rather than a composite of strokes.
Simplified characters
Chinese policy makers have also tried to make the writing system more learnable by introducing the Chinese equivalent of spelling reform, which takes the form of reducing the number of strokes in complicated characters: 國 becomes 国; 邊 becomes 边. The two sets are usually called ‘traditional’ and ‘simplified’ in English, fántǐzì (‘complicatedbody-characters’) and jiǎntǐzì (‘simple-body-characters’) in Chinese.
For almost 2000 years in China, serious genres of writing were written in the kǎishū script (‘model writing’) that first appeared in the early centuries of the first millennium. In the 1950s, the Mainland government, seeking to increase literacy, formalized a set of simplified characters to replace many of the more complicated of the traditional forms. Many of these simplified characters were based on calligraphic and other styles in earlier use; but others were novel graphs that followed traditional patterns of character creation.
For the learner, this simplification is a mixed blessing – and possibly no blessing at all. For while it ostensibly makes writing characters simpler, it also made them less redundant for reading: 樂 and 東 (used to write the words for ‘music’ and ‘east’, respectively) are quite distinct in the traditional set; but their simplified versions, 乐 and 东, are easy to confuse. Moreover, Chinese communities did not all agree on the new reforms. The simplified set, along with horizontal writing, was officially adopted by the PRC in the late 1950s and (for most purposes) by Singapore in the 1960s. But Taiwan, most overseas Chinese communities and, until its return to the PRC, Hong Kong, retained the traditional set of characters as their standard, along with vertical writing.
Jiǎntǐzì and fántǐzì should not be thought of as two writing systems, for not only are there many characters with only one form (也 yě, 很 hěn, 好 hǎo, etc), but of those that have two forms, the vast majority exhibit only minor, regular differences, eg: 说/說, 饭/飯. What remain are perhaps 3 dozen relatively common characters with distinctively divergent forms, such as: 这/這, 买/買. Careful inspection reveals that even they often have elements in common. For native Chinese readers, the two systems represent only a minor inconvenience, rather like the difference between capital and small letters in the Roman alphabet, though on a larger scale. Learners generally focus on one system for writing, but soon get used to reading in both.

