Variant Chinese character
Encyclopedia
Variant Chinese characters ' onMouseout='HidePop("63676")' href="/topics/Hepburn_romanization">hepburn
: itaiji) are Chinese character
s that are homophone
s and synonym
s. Almost all variants are allographs in most circumstances, such as casual handwriting. Some contexts require the usage of certain variants, such as in textbook editing.
-speaking (mainland China
, Hong Kong
, Macau
, Taiwan
, Singapore
), Japanese
-speaking (Japan
), or Korean
-speaking (North Korea
, South Korea
). Some of the governments of these regions have made efforts to standardize the use of variants, by establishing certain variants as standard. The choice of which variants to use has resulted in some divergence in the forms of Chinese characters used in mainland China, Hong Kong, Taiwan, Japan, and Korea. This effect compounds with the sometimes drastic divergence in the standard Chinese character sets of these regions resulting from the character simplifications pursued by mainland China and by Japan
.
The standard character forms of each region are described in:
are known as orthodox characters or Kangxi Dictionary form , as the forms found in the Kangxi dictionary
are usually orthodox. Other variants are known as vulgar characters ' onMouseout='HidePop("14126")' href="/topics/Hepburn_romanization">Hepburn
: zokuji).
deals with variant characters in a complex manner, as a result of the process of Han unification
. In Han unification, some variants that are nearly identical between Chinese-, Japanese-, Korean-speaking regions are encoded in the same code point
, and can only be distinguished using different typeface
s. Other variants that are more divergent are encoded in different code points. On web page
s, displaying the correct variants for the intended language is dependent on the typefaces installed on the computer
, the configuration of the web browser
, and the language tags of web pages. Systems that are ready to display the correct variants are rare because many computer users do not have standard typefaces installed and the most popular web browsers are not configured to display the correct variants by default.
The following are some examples of variant forms of Chinese characters with different code points and language tags.
The following are some examples of variant forms of Chinese characters with the same code points and different language tags.
. For a set of variants to be allographs, someone who could read one should be able to read the others, but some variants cannot be read if one only knows one of them. An example is 搜 and 蒐, where someone who is able to read 搜 might not be able to read 蒐. Another example is 㠯, which is a variant of 以, but some people who could read 以 might not be able to read 㠯.
Hepburn romanization
The is named after James Curtis Hepburn, who used it to transcribe the sounds of the Japanese language into the Latin alphabet in the third edition of his Japanese–English dictionary, published in 1887. The system was originally proposed by the in 1885...
: itaiji) are Chinese character
Chinese character
Chinese characters are logograms used in the writing of Chinese and Japanese , less frequently Korean , formerly Vietnamese , or other languages...
s that are homophone
Homophone
A homophone is a word that is pronounced the same as another word but differs in meaning. The words may be spelled the same, such as rose and rose , or differently, such as carat, caret, and carrot, or to, two, and too. Homophones that are spelled the same are also both homographs and homonyms...
s and synonym
Synonym
Synonyms are different words with almost identical or similar meanings. Words that are synonyms are said to be synonymous, and the state of being a synonym is called synonymy. The word comes from Ancient Greek syn and onoma . The words car and automobile are synonyms...
s. Almost all variants are allographs in most circumstances, such as casual handwriting. Some contexts require the usage of certain variants, such as in textbook editing.
Regional standards
Variant Chinese characters exist within and across all regions where Chinese characters are used, whether ChineseChinese language
The Chinese language is a language or language family consisting of varieties which are mutually intelligible to varying degrees. Originally the indigenous languages spoken by the Han Chinese in China, it forms one of the branches of Sino-Tibetan family of languages...
-speaking (mainland China
Mainland China
Mainland China, the Chinese mainland or simply the mainland, is a geopolitical term that refers to the area under the jurisdiction of the People's Republic of China . According to the Taipei-based Mainland Affairs Council, the term excludes the PRC Special Administrative Regions of Hong Kong and...
, Hong Kong
Hong Kong
Hong Kong is one of two Special Administrative Regions of the People's Republic of China , the other being Macau. A city-state situated on China's south coast and enclosed by the Pearl River Delta and South China Sea, it is renowned for its expansive skyline and deep natural harbour...
, Macau
Macau
Macau , also spelled Macao , is, along with Hong Kong, one of the two special administrative regions of the People's Republic of China...
, Taiwan
Taiwan
Taiwan , also known, especially in the past, as Formosa , is the largest island of the same-named island group of East Asia in the western Pacific Ocean and located off the southeastern coast of mainland China. The island forms over 99% of the current territory of the Republic of China following...
, Singapore
Singapore
Singapore , officially the Republic of Singapore, is a Southeast Asian city-state off the southern tip of the Malay Peninsula, north of the equator. An island country made up of 63 islands, it is separated from Malaysia by the Straits of Johor to its north and from Indonesia's Riau Islands by the...
), Japanese
Japanese language
is a language spoken by over 130 million people in Japan and in Japanese emigrant communities. It is a member of the Japonic language family, which has a number of proposed relationships with other languages, none of which has gained wide acceptance among historical linguists .Japanese is an...
-speaking (Japan
Japan
Japan is an island nation in East Asia. Located in the Pacific Ocean, it lies to the east of the Sea of Japan, China, North Korea, South Korea and Russia, stretching from the Sea of Okhotsk in the north to the East China Sea and Taiwan in the south...
), or Korean
Korean language
Korean is the official language of the country Korea, in both South and North. It is also one of the two official languages in the Yanbian Korean Autonomous Prefecture in People's Republic of China. There are about 78 million Korean speakers worldwide. In the 15th century, a national writing...
-speaking (North Korea
North Korea
The Democratic People’s Republic of Korea , , is a country in East Asia, occupying the northern half of the Korean Peninsula. Its capital and largest city is Pyongyang. The Korean Demilitarized Zone serves as the buffer zone between North Korea and South Korea...
, South Korea
South Korea
The Republic of Korea , , is a sovereign state in East Asia, located on the southern portion of the Korean Peninsula. It is neighbored by the People's Republic of China to the west, Japan to the east, North Korea to the north, and the East China Sea and Republic of China to the south...
). Some of the governments of these regions have made efforts to standardize the use of variants, by establishing certain variants as standard. The choice of which variants to use has resulted in some divergence in the forms of Chinese characters used in mainland China, Hong Kong, Taiwan, Japan, and Korea. This effect compounds with the sometimes drastic divergence in the standard Chinese character sets of these regions resulting from the character simplifications pursued by mainland China and by Japan
Shinjitai
Shinjitai are the forms of kanji used in Japan since the promulgation of the Tōyō Kanji List in 1946. Some of the new forms found in shinjitai are also found in simplified Chinese, but shinjitai is generally not as extensive in the scope of its modification...
.
The standard character forms of each region are described in:
- The List of Commonly Used Characters in Modern Chinese for mainland China
- The List of Forms of Frequently Used Characters for Hong Kong
- The Standard Form of National CharactersStandard Form of National CharactersThe Standard Form of National Characters is a standardized form of Chinese characters set by the Ministry of Education of the Republic of China.-Characteristics:...
for Taiwan - The list of Jōyō kanjiJoyo kanjiThe is the guide to kanji characters announced officially by the Japanese Ministry of Education. Current jōyō kanji are those on a list of 2,136 characters issued in 2010...
for Japan - The Kangxi DictionaryKangxi dictionaryThe Kangxi Dictionary was the standard Chinese dictionary during the 18th and 19th centuries. The Kangxi Emperor of the Manchu Qing Dynasty ordered its compilation in 1710. The creator innovated greatly by reusing and confirming the new Zihui system of 596 radicals, since then known as 596 Kangxi...
(de facto) for Korea
Orthodox and vulgar variants
Character forms that are most orthodoxOrthodoxy
The word orthodox, from Greek orthos + doxa , is generally used to mean the adherence to accepted norms, more specifically to creeds, especially in religion...
are known as orthodox characters or Kangxi Dictionary form , as the forms found in the Kangxi dictionary
Kangxi dictionary
The Kangxi Dictionary was the standard Chinese dictionary during the 18th and 19th centuries. The Kangxi Emperor of the Manchu Qing Dynasty ordered its compilation in 1710. The creator innovated greatly by reusing and confirming the new Zihui system of 596 radicals, since then known as 596 Kangxi...
are usually orthodox. Other variants are known as vulgar characters ' onMouseout='HidePop("14126")' href="/topics/Hepburn_romanization">Hepburn
Hepburn romanization
The is named after James Curtis Hepburn, who used it to transcribe the sounds of the Japanese language into the Latin alphabet in the third edition of his Japanese–English dictionary, published in 1887. The system was originally proposed by the in 1885...
: zokuji).
Usage in computing
UnicodeUnicode
Unicode is a computing industry standard for the consistent encoding, representation and handling of text expressed in most of the world's writing systems...
deals with variant characters in a complex manner, as a result of the process of Han unification
Han unification
Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the so-called CJK languages into a single set of unified characters. Han characters are a common feature of written Chinese , Japanese , Korean , and—at least historically—other...
. In Han unification, some variants that are nearly identical between Chinese-, Japanese-, Korean-speaking regions are encoded in the same code point
Code point
In character encoding terminology, a code point or code position is any of the numerical values that make up the code space . For example, ASCII comprises 128 code points in the range 0hex to 7Fhex, Extended ASCII comprises 256 code points in the range 0hex to FFhex, and Unicode comprises 1,114,112...
, and can only be distinguished using different typeface
Typeface
In typography, a typeface is the artistic representation or interpretation of characters; it is the way the type looks. Each type is designed and there are thousands of different typefaces in existence, with new ones being developed constantly....
s. Other variants that are more divergent are encoded in different code points. On web page
Web page
A web page or webpage is a document or information resource that is suitable for the World Wide Web and can be accessed through a web browser and displayed on a monitor or mobile device. This information is usually in HTML or XHTML format, and may provide navigation to other web pages via hypertext...
s, displaying the correct variants for the intended language is dependent on the typefaces installed on the computer
Computer
A computer is a programmable machine designed to sequentially and automatically carry out a sequence of arithmetic or logical operations. The particular sequence of operations can be changed readily, allowing the computer to solve more than one kind of problem...
, the configuration of the web browser
Web browser
A web browser is a software application for retrieving, presenting, and traversing information resources on the World Wide Web. An information resource is identified by a Uniform Resource Identifier and may be a web page, image, video, or other piece of content...
, and the language tags of web pages. Systems that are ready to display the correct variants are rare because many computer users do not have standard typefaces installed and the most popular web browsers are not configured to display the correct variants by default.
The following are some examples of variant forms of Chinese characters with different code points and language tags.
Different code points, Mainland China Mainland China Mainland China, the Chinese mainland or simply the mainland, is a geopolitical term that refers to the area under the jurisdiction of the People's Republic of China . According to the Taipei-based Mainland Affairs Council, the term excludes the PRC Special Administrative Regions of Hong Kong and... language tag | Different code points, Taiwan Taiwan Taiwan , also known, especially in the past, as Formosa , is the largest island of the same-named island group of East Asia in the western Pacific Ocean and located off the southeastern coast of mainland China. The island forms over 99% of the current territory of the Republic of China following... language tag | Different code points, Hong Kong Hong Kong Hong Kong is one of two Special Administrative Regions of the People's Republic of China , the other being Macau. A city-state situated on China's south coast and enclosed by the Pearl River Delta and South China Sea, it is renowned for its expansive skyline and deep natural harbour... language tag | Different code points, Japanese Japanese language is a language spoken by over 130 million people in Japan and in Japanese emigrant communities. It is a member of the Japonic language family, which has a number of proposed relationships with other languages, none of which has gained wide acceptance among historical linguists .Japanese is an... language tag |
---|---|---|---|
The following are some examples of variant forms of Chinese characters with the same code points and different language tags.
Same code point, different language tags | |||
---|---|---|---|
Mainland China language tag | Taiwan language tag | Hong Kong language tag | Japanese language tag |
Graphemic variants
Some variants are not allographicAllography
Allography, from the Greek for "other writing", has several meanings which all relate to how words and sounds are written down.-Allographs as authorship:...
. For a set of variants to be allographs, someone who could read one should be able to read the others, but some variants cannot be read if one only knows one of them. An example is 搜 and 蒐, where someone who is able to read 搜 might not be able to read 蒐. Another example is 㠯, which is a variant of 以, but some people who could read 以 might not be able to read 㠯.