Unicode typefaces
Encyclopedia
A Unicode font is a computer font
that contains a wide range of characters
, letters
, digit
s, glyphs, symbols, ideograms, logograms, etc., which are collectively mapped into the standard Universal Character Set
, derived from many different languages and scripts
from around the world. Unlike most conventional computer fonts, which are specific to a particular language or legacy character set and contain only a small subset of the UCS characters, these fonts attempt to include many thousands of possible glyphs, so that they can be used as a single typeface across multi-lingual documents.
standard does not specify or create the font (typeface
), a collection of graphical shapes called glyphs, itself. Rather, it defines the abstract characters as a specific number (known as a codepoint) and also defines the required changes of shape depending on the context the glyph is used in (e.g., Combining character
s, precomposed character
s and letter
-diacritic
combinations). The choice of font, which governs how the abstract UCS characters are converted into a bitmap or vector output that can be viewed on a screen or printed, is left up to the user. If a font is chosen which does not contain a glyph for a codepoint used in the document, typically a question mark ("?"), a box, or some other Substitute character
is displayed.
Computer font
s use various techniques to display characters or glyphs. A Bitmap font contains a grid of dots known as pixel
s forming an image of each glyph in each face and size. Outline fonts (also known as Vector fonts) use drawing instructions or mathematical formulæ to describe each glyph. Stroke fonts use a series of specified lines (for the glyph's border) and additional information to define the profile, or size and shape of the line in a specific face and size, which together describe the appearance of the glyph.
Many fonts have kerning
pairs which implements better spacing in between the characters. Fonts also includes embedded special orthographic
rules to output certain combinations of letterforms (an alternative symbols for the same letter) be combined into special ligature forms
(mixed characters). Operating System
or Web-Browser (aka, User Agent
) or both, which uses a font to display text on the screen or print media, can be programmed to use those embedded rules, or use external script-shaping technologies (also known as Rendering Technology or Smartfont Engine), and they can also be programmed to use either a large unicode font, or use multiple different fonts for different characters or languages.
No single "Unicode font" includes all the characters defined in the present revision of ISO 10646 (Unicode) standard, as it is continually adding more & more languages and characters. As a result, font developers and foundries are also incorporating those new characters in newer version or revision of a font, and correcting their previous errors if there were any.
The UCS has over 1.1 million code points, but only the first 65,536 (the Plane 0: Basic Multilingual Plane, or BMP) had entered into common use before 2000. (See the Mapping of Unicode characters
article for more information on other planes, including Plane 1: SMP, Plane 2: SIP, Plane 14: SSP, Plane 15 and 16: reserved for PUA.)
The first Unicode fonts (with very large character set, and supporting many Unicode blocks) were Lucida Sans Unicode
(released March 1993), Unihan font
(1993), and Everson Mono
(1995).
characters (seen in Chinese, Japanese, and Korean) will be typographically different in different regions. For example, Unicode point U+9AA8 (骨) is typographically different between simplified Chinese and traditional Chinese. This has implications for the idea that a single typeface can satisfy the needs of all locales.
The design of Unicode ensures that such differences do not create semantic ambiguity, but the use of incorrect forms is often considered inappropriate to native readers of East Asian languages.
is now the base character set for many new standards and protocols, and is built into the architecture of operating systems (Microsoft Windows
, Apple
Mac OS X
, and many versions of Unix
), programming languages (Ada
, Perl
, Python, Java
, Common LISP
, APL
), and libraries (IBM International Components for Unicode
(ICU) along with the Pango
, Graphite
, Scribe
, Uniscribe
, and ATSUI
rendering engines), font formats (TrueType
and OpenType
) and so on. Many other standards are also getting upgraded to Unicode compliance, day by day.
. More Unicode fonts can be found in the (List of typefaces) article's "Unicode fonts" section.
Note
. The cells then show the number of those codes which are covered by each font.
Computer font
A computer font is an electronic data file containing a set of glyphs, characters, or symbols such as dingbats. Although the term font first referred to a set of metal type sorts in one style and size, since the 1990s it is generally used to refer to a scalable set of digital shapes that may be...
that contains a wide range of characters
Character (computing)
In computer and machine-based telecommunications terminology, a character is a unit of information that roughly corresponds to a grapheme, grapheme-like unit, or symbol, such as in an alphabet or syllabary in the written form of a natural language....
, letters
Letter (alphabet)
A letter is a grapheme in an alphabetic system of writing, such as the Greek alphabet and its descendants. Letters compose phonemes and each phoneme represents a phone in the spoken form of the language....
, digit
Digit (unit)
The digit or finger is an ancient and obsolete non-SI unit of measurement of length. It was originally based on the breadth of a human finger...
s, glyphs, symbols, ideograms, logograms, etc., which are collectively mapped into the standard Universal Character Set
Universal Character Set
The Universal Character Set , defined by the International Standard ISO/IEC 10646, Information technology — Universal multiple-octet coded character set , is a standard set of characters upon which many character encodings are based...
, derived from many different languages and scripts
Writing system
A writing system is a symbolic system used to represent elements or statements expressible in language.-General properties:Writing systems are distinguished from other possible symbolic communication systems in that the reader must usually understand something of the associated spoken language to...
from around the world. Unlike most conventional computer fonts, which are specific to a particular language or legacy character set and contain only a small subset of the UCS characters, these fonts attempt to include many thousands of possible glyphs, so that they can be used as a single typeface across multi-lingual documents.
Background
The UnicodeUnicode
Unicode is a computing industry standard for the consistent encoding, representation and handling of text expressed in most of the world's writing systems...
standard does not specify or create the font (typeface
Typeface
In typography, a typeface is the artistic representation or interpretation of characters; it is the way the type looks. Each type is designed and there are thousands of different typefaces in existence, with new ones being developed constantly....
), a collection of graphical shapes called glyphs, itself. Rather, it defines the abstract characters as a specific number (known as a codepoint) and also defines the required changes of shape depending on the context the glyph is used in (e.g., Combining character
Combining character
In digital typography, combining characters are characters that are intended to modify other characters. The most common combining characters in the Latin script are the combining diacritical marks ....
s, precomposed character
Precomposed character
A precomposed character is a Unicode entity that can be defined as a combination of two or more other characters. A precomposed character may typically represent a letter with a diacritical mark, such as é...
s and letter
Letter (alphabet)
A letter is a grapheme in an alphabetic system of writing, such as the Greek alphabet and its descendants. Letters compose phonemes and each phoneme represents a phone in the spoken form of the language....
-diacritic
Diacritic
A diacritic is a glyph added to a letter, or basic glyph. The term derives from the Greek διακριτικός . Diacritic is both an adjective and a noun, whereas diacritical is only an adjective. Some diacritical marks, such as the acute and grave are often called accents...
combinations). The choice of font, which governs how the abstract UCS characters are converted into a bitmap or vector output that can be viewed on a screen or printed, is left up to the user. If a font is chosen which does not contain a glyph for a codepoint used in the document, typically a question mark ("?"), a box, or some other Substitute character
Substitute character
A substitute character is a control character that is used in the place of a character that is recognized to be invalid or in error or that cannot be represented on a given device....
is displayed.
Computer font
Computer font
A computer font is an electronic data file containing a set of glyphs, characters, or symbols such as dingbats. Although the term font first referred to a set of metal type sorts in one style and size, since the 1990s it is generally used to refer to a scalable set of digital shapes that may be...
s use various techniques to display characters or glyphs. A Bitmap font contains a grid of dots known as pixel
Pixel
In digital imaging, a pixel, or pel, is a single point in a raster image, or the smallest addressable screen element in a display device; it is the smallest unit of picture that can be represented or controlled....
s forming an image of each glyph in each face and size. Outline fonts (also known as Vector fonts) use drawing instructions or mathematical formulæ to describe each glyph. Stroke fonts use a series of specified lines (for the glyph's border) and additional information to define the profile, or size and shape of the line in a specific face and size, which together describe the appearance of the glyph.
Many fonts have kerning
Kerning
In typography, kerning is the process of adjusting the spacing between characters in a proportional font, usually to achieve a visually pleasing result. Kerning is the adjustment of the space between individual letter forms vs. tracking which is the uniform adjustment of spacing applied over a...
pairs which implements better spacing in between the characters. Fonts also includes embedded special orthographic
Orthographic
Orthographic may refer to:* Orthographic projection** Orthographic projection ** Orthographic projection * Orthography...
rules to output certain combinations of letterforms (an alternative symbols for the same letter) be combined into special ligature forms
Ligature (typography)
In writing and typography, a ligature occurs where two or more graphemes are joined as a single glyph. Ligatures usually replace consecutive characters sharing common components and are part of a more general class of glyphs called "contextual forms", where the specific shape of a letter depends on...
(mixed characters). Operating System
Operating system
An operating system is a set of programs that manage computer hardware resources and provide common services for application software. The operating system is the most important type of system software in a computer system...
or Web-Browser (aka, User Agent
User agent
In computing, a user agent is a client application implementing a network protocol used in communications within a client–server distributed computing system...
) or both, which uses a font to display text on the screen or print media, can be programmed to use those embedded rules, or use external script-shaping technologies (also known as Rendering Technology or Smartfont Engine), and they can also be programmed to use either a large unicode font, or use multiple different fonts for different characters or languages.
No single "Unicode font" includes all the characters defined in the present revision of ISO 10646 (Unicode) standard, as it is continually adding more & more languages and characters. As a result, font developers and foundries are also incorporating those new characters in newer version or revision of a font, and correcting their previous errors if there were any.
The UCS has over 1.1 million code points, but only the first 65,536 (the Plane 0: Basic Multilingual Plane, or BMP) had entered into common use before 2000. (See the Mapping of Unicode characters
Mapping of Unicode characters
Unicode’s Universal Character Set has a potential capacity to support over 1 million characters. Each UCS character is mapped to a code point which is an integer between 0 and 1,114,111 used to represent each character within the internal logic of text processing software .As of Unicode 5.2.0,...
article for more information on other planes, including Plane 1: SMP, Plane 2: SIP, Plane 14: SSP, Plane 15 and 16: reserved for PUA.)
The first Unicode fonts (with very large character set, and supporting many Unicode blocks) were Lucida Sans Unicode
Lucida Sans Unicode
In digital typography, Lucida Sans Unicode OpenType font from the design studio of Bigelow & Holmes is designed to support the most commonly used characters defined in version 2.0 of the Unicode standard...
(released March 1993), Unihan font
Unihan font
Unihan font was developed by Ross Paterson in 1993.Unihan font had two variations, 16x16 and 24x24 pixel fonts. These covered most of theCJK Auxiliary and UniHan portions of Unicode. Font files were in HBF format...
(1993), and Everson Mono
Everson Mono
Everson Mono is a monospaced humanist sans serif Unicode font whose development by Michael Everson began in 1995. At first, Everson Mono was a collection of 8-bit fonts containing glyphs for tables in ISO/IEC 10646; at that time, it was not easy to edit cmaps to have true Unicode indices, and there...
(1995).
Issues
There are typographical ambiguities in Unicode, so that some of the unified HanHan unification
Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the so-called CJK languages into a single set of unified characters. Han characters are a common feature of written Chinese , Japanese , Korean , and—at least historically—other...
characters (seen in Chinese, Japanese, and Korean) will be typographically different in different regions. For example, Unicode point U+9AA8 (骨) is typographically different between simplified Chinese and traditional Chinese. This has implications for the idea that a single typeface can satisfy the needs of all locales.
The design of Unicode ensures that such differences do not create semantic ambiguity, but the use of incorrect forms is often considered inappropriate to native readers of East Asian languages.
Application of Unicode fonts
Despite all the issues, UnicodeUnicode
Unicode is a computing industry standard for the consistent encoding, representation and handling of text expressed in most of the world's writing systems...
is now the base character set for many new standards and protocols, and is built into the architecture of operating systems (Microsoft Windows
Microsoft Windows
Microsoft Windows is a series of operating systems produced by Microsoft.Microsoft introduced an operating environment named Windows on November 20, 1985 as an add-on to MS-DOS in response to the growing interest in graphical user interfaces . Microsoft Windows came to dominate the world's personal...
, Apple
Apple Computer
Apple Inc. is an American multinational corporation that designs and markets consumer electronics, computer software, and personal computers. The company's best-known hardware products include the Macintosh line of computers, the iPod, the iPhone and the iPad...
Mac OS X
Mac OS X
Mac OS X is a series of Unix-based operating systems and graphical user interfaces developed, marketed, and sold by Apple Inc. Since 2002, has been included with all new Macintosh computer systems...
, and many versions of Unix
Unix
Unix is a multitasking, multi-user computer operating system originally developed in 1969 by a group of AT&T employees at Bell Labs, including Ken Thompson, Dennis Ritchie, Brian Kernighan, Douglas McIlroy, and Joe Ossanna...
), programming languages (Ada
Ada (programming language)
Ada is a structured, statically typed, imperative, wide-spectrum, and object-oriented high-level computer programming language, extended from Pascal and other languages...
, Perl
Perl
Perl is a high-level, general-purpose, interpreted, dynamic programming language. Perl was originally developed by Larry Wall in 1987 as a general-purpose Unix scripting language to make report processing easier. Since then, it has undergone many changes and revisions and become widely popular...
, Python, Java
Java (programming language)
Java is a programming language originally developed by James Gosling at Sun Microsystems and released in 1995 as a core component of Sun Microsystems' Java platform. The language derives much of its syntax from C and C++ but has a simpler object model and fewer low-level facilities...
, Common LISP
Common Lisp
Common Lisp, commonly abbreviated CL, is a dialect of the Lisp programming language, published in ANSI standard document ANSI INCITS 226-1994 , . From the ANSI Common Lisp standard the Common Lisp HyperSpec has been derived for use with web browsers...
, APL
APL programming language
APL is an interactive array-oriented language and integrated development environment, which is available from a number of commercial and noncommercial vendors and for most computer platforms. It is based on a mathematical notation developed by Kenneth E...
), and libraries (IBM International Components for Unicode
International Components for Unicode
International Components for Unicode is an open source project of mature C/C++ and Java libraries for Unicode support, software internationalization and software globalization. ICU is widely portable to many operating systems and environments. It gives applications the same results on all...
(ICU) along with the Pango
Pango
Pango is an LGPL licensed open source computing library used by software developers for laying out and rendering text in high quality, emphasising support for multilingual text...
, Graphite
Graphite (SIL)
Graphite is a programmable Unicode-compliant smart-font technology and rendering system developed by SIL International. It is free software, distributed under the terms of the GNU Lesser General Public License and the Common Public License....
, Scribe
Qt (toolkit)
Qt is a cross-platform application framework that is widely used for developing application software with a graphical user interface , and also used for developing non-GUI programs such as command-line tools and consoles for servers...
, Uniscribe
Uniscribe
Uniscribe is the Microsoft Windows set of services for rendering Unicode-encoded text, especially complex text layout. They are implemented in the DLL USP10.DLL. USP10.dll became available to the public with Windows 2000 and Internet Explorer 5.0...
, and ATSUI
Apple Type Services for Unicode Imaging
The Apple Type Services for Unicode Imaging is the set of services for rendering Unicode-encoded text starting with Mac OS 8.5 and in Mac OS X.It replaced the WorldScript engine for legacy encodings....
rendering engines), font formats (TrueType
TrueType
TrueType is an outline font standard originally developed by Apple Computer in the late 1980s as a competitor to Adobe's Type 1 fonts used in PostScript...
and OpenType
OpenType
OpenType is a format for scalable computer fonts. It was built on its predecessor TrueType, retaining TrueType's basic structure and adding many intricate data structures for prescribing typographic behavior...
) and so on. Many other standards are also getting upgraded to Unicode compliance, day by day.
Utility software
Utility software can be used to see exactly which characters are included inside a font file:- Character MapCharacter MapCharacter Map is a utility included with Microsoft Windows operating systems and is used to view the characters in any installed font, to check what keyboard input is used to enter those characters, and to copy characters to the clipboard in lieu of typing them. The tool is usually useful for...
applet included with Windows - Font BookFont BookApple's Font Book is a font manager that comes with Mac OS X since version 10.3 .The program enables users to:* Preview and install font files, avoiding multiple instances of the same font....
application included with Mac OS X - BabelMap application, for Windows
- FontForgeFontForgeFontForge is a typeface editor program developed by George Williams. FontForge is free software and is distributed under the BSD license. FontForge is available for several operating systems and is localized in several languages.- Features :Fontforge supports many font formats, including...
application (Font Creator), for Linux, Unix, Windows
List of Unicode fonts
Of the many Unicode fonts available, the few ones listed below are the most commonly used by a majority of users around the world on mainstream computing platformsPlatform (computing)
A computing platform includes some sort of hardware architecture and a software framework , where the combination allows software, particularly application software, to run...
. More Unicode fonts can be found in the (List of typefaces) article's "Unicode fonts" section.
Font | Char(s) | Glyphs | Kernpairs (Standard) |
Point (Range) : Hinting/ Smoothing Behavior | Version & Date | Filename, Size |
Font Family | Font Weight, style |
Font type |
Serif style |
License | Creator / Author (Copyright) | Notes |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Arial Arial Arial, sometimes marketed or displayed in software as Arial MT, is a sans-serif typeface and set of computer fonts. Fonts from the Arial family are packaged with Microsoft Windows, some other Microsoft software applications, Apple Mac OS X and many PostScript 3 computer printers... |
3,415 | 3,415 | 909 | v5.06 2009-01-20 |
arial.ttf (756 KB) |
Arial | Medium (Normal), Regular | OTF+TTO | Normal Sans | Proprietary | (2008) The Monotype Corporation. | ||
Arial Unicode MS Arial Unicode MS In digital typography, the TrueType font Arial Unicode MS is an extended version of the font Arial. Compared to Arial, it includes higher line height, omits kerning pairs and adds enough glyphs to cover a large subset of Unicode 2.1—thus supporting most Microsoft code pages, but also requiring much... |
38,917 | 50,377 | 0 | v1.01 sfnt rev 1 2002-11-18 |
ARIALUNI.TTF (22.1 MB) |
Arial Unicode MS | Medium (Normal), Regular | OTF+TTO | Normal Sans | Proprietary | Agfa Monotype Corporation. Monotype Corporation. |
||
Bitstream Cyberbit Bitstream Cyberbit Bitstream Cyberbit is a commercial Unicode font designed by Bitstream Inc. It is freeware for non-commercial uses. It was historically one of the first widely available fonts with support for a large proportion of the Unicode repertoire.... |
32,961 | 29,934 | 935 | v2.0 beta 1998-03-22 |
Cyberbit.ttf (12.7 MB) |
Bitstream Cyberbit | Medium (Normal), Roman | TTF | Cove | Freeware | Bitstream Inc. | ||
BitstreamCyberCJK | 30,275 | 28,686 | 935 | v2.0 beta 1998-03-17 |
Cyberbit.ttf (12.4 MB) |
Bitstream CyberCJK | Medium (Normal), Book, Roman | TTF | Cove | Freeware | Bitstream Inc. | ||
Cardo | 2,879 | 2,882 | 216 | v0.099 sfnt rev 0.098999 2010-05-23 |
Cardo99s.ttf (706 KB) |
Cardo | Medium (Normal), Book, Regular | TTF | Cove | Freeware. SIL Open Font License v1.1. | David J. Perry | ||
Caslon Roman Caslon Roman Caslon Roman is a serif style Caslon family TrueType Unicode font, developed by George Williams. It is available free, under BSD-like license or SIL Open Font License.-Character Coverage:... |
3,683 | 3,686 | 0 | v001.000 2003-10-23 |
(3.70 MB) |
Caslon | Roman | TTF | BSD-like license | George Williams | |||
Code2000 Code2000 Code2000 is a pan-Unicode digital font, which includes characters and symbols from a very large range of writing systems. As of the current final version 1.171 released in 2008, Code2000 is designed and implemented by James Kass to include as much of the Unicode 5.2 standard as practical , and to... |
53,068 | 63,546 | 239 | v1.171 sfnt rev 1.1709 2008-06-12 |
CODE2000.TTF (7.98 MB) |
Code2000 | Medium (Normal), Book, Regular | TTF | Any | Shareware (unrestricted) |
James Kass | ||
Charis SIL Charis SIL Charis SIL is a glyphic serif typeface developed by SIL International. It is based on Bitstream Charter, one of the first fonts designed for laser printers... |
2,172 | 4,661 | 0 | v4.106 sfnt rev 4.106 2009-03-18 |
CharisSILR.ttf (1.57 MB) |
Charis SIL | Medium (Normal), Regular | TTF | Any | OFL SIL Open Font License The SIL Open Font License is a free and open source license designed for fonts by SIL International for use with some of their Unicode fonts. The license is considered free by the Free Software Foundation, which states that a simple hello world program is enough to satisfy the license's requirement... |
SIL International | ||
Chryſanþi Unicode (Chrysanthi Unicode) | 4,818 | 4,383 | 0 | v3.1 2001-07-15 |
chrysuni.ttf (737 KB) |
Chrysanthi Unicode | Medium (Normal), Regular | TTF | Any | Freeware | Dolgthrasir Mioethdrauci. Every Witch Way. |
||
ClearlyU ClearlyU ClearlyU is a bitmap unicode font developed by Mark Leisher.It contains 9,538 glyphs in v1.9. Includes characters from following unicode ranges: Basic Latin, Latin-1 Supplement, Latin Extended-B, IPA Extensions, Spacing Modifier Letters, Combining Diacritical Marks, Greek, Cyrillic, Armenian,... |
9,538 |
0 | v1.9 2002-11-22 |
cu12.bdf (1.13 MB) |
ClearlyU | Medium | Freeware | Mark Leisher. | |||||
DejaVu Sans DejaVu fonts The DejaVu fonts are modifications of the Bitstream Vera fonts designed for greater coverage of Unicode, as well as providing more styles. The Bitstream Vera family was limited mainly to the characters in the Basic Latin and Latin-1 Supplement portions of Unicode but was released with a license... |
5,467 | 5,762 | 2,637 | v2.32 sfnt rev 2.31999 2010-08-22 |
DejaVuSans.ttf (667 KB) |
DejaVu Sans | Medium (Normal), Book | OTF+TTO | Normal Sans | Bitstream Vera Bitstream Vera Vera is a group typeface with a liberal license.It was designed by Jim Lyles from Bitstream, and it is closely based on Bitstream's Prima, for which Lyles was also responsible. It is a TrueType font with full hinting instructions, which improve its rendering quality on low-resolution devices such... license and public domain for additions |
Bitstream, Inc. Tavmjong Bah. public domain. |
||
Doulos SIL Doulos SIL Doulos SIL is a serif typeface developed by SIL International, very similar to Times or Times New Roman. Unlike Times New Roman, Doulos only has a single face, Regular... |
2,172 | 4,661 | 0 | v4.106 2009-03-18 |
DoulosSILR.ttf (1.64 MB) |
Doulos SIL | Medium (Normal), Book, Regular | TTF | Any | OFL SIL Open Font License The SIL Open Font License is a free and open source license designed for fonts by SIL International for use with some of their Unicode fonts. The license is considered free by the Free Software Foundation, which states that a simple hello world program is enough to satisfy the license's requirement... |
SIL International | ||
Everson Mono Everson Mono Everson Mono is a monospaced humanist sans serif Unicode font whose development by Michael Everson began in 1995. At first, Everson Mono was a collection of 8-bit fonts containing glyphs for tables in ISO/IEC 10646; at that time, it was not easy to edit cmaps to have true Unicode indices, and there... |
6,391 | 6,399 | 0 | v5.203 sfnt rev v5.203 2010-07-03 |
Everson Mono.ttf (1.91 MB) |
Everson Mono | Medium (Normal), Regular | TTF | Any | Shareware (unrestricted) |
Michael Everson | ||
FreeSerif | 7,203 | 8,995 | 36,302 | v1.548 sfnt rev 1.54799 2010-09-19 |
FreeSerif.ttf (1.60 MB) |
FreeSerif | Medium (Normal) | TTF | Cove | GPL | Free Software Foundation. | ||
Gentium Regular Gentium Gentium is a Unicode serif typeface designed by Victor Gaultney. Gentium fonts are free and open source software, and are released under the SIL Open Font License , which permits modification and redistribution. Gentium has wide support for languages using the Latin, Greek, and Cyrillic alphabets,... |
1,469 | 1,699 | 2,857 | v1.02; 2005; sfnt rev 1.02 2005-11-20 |
GenR102.TTF (354 KB) |
Gentium | Book, Regular | TTF | Any | OFL SIL Open Font License The SIL Open Font License is a free and open source license designed for fonts by SIL International for use with some of their Unicode fonts. The license is considered free by the Free Software Foundation, which states that a simple hello world program is enough to satisfy the license's requirement... |
J. Victor Gaultney. SIL International | ||
Gentium Plus | 2,513 | 5,586 | 0 | v1.502 sfnt rev 1.502 2010-8-19 |
GentiumPlus-R.ttf (1.54 MB) |
Gentium Plus | Medium (Normal), Book, Regular | TTF | Any | OFL SIL Open Font License The SIL Open Font License is a free and open source license designed for fonts by SIL International for use with some of their Unicode fonts. The license is considered free by the Free Software Foundation, which states that a simple hello world program is enough to satisfy the license's requirement... |
SIL International | ||
GNU Unifont GNU Unifont The GNU Unifont by Roman Czyborra is a free bitmap font that covers the Unicode Basic Multilingual Plane , using an intermediate bitmapped font format.... (Unifont) |
63,446 | 63,449 | 0 | v1.00 sfnt rev 1 (5.1.20080914) 2008-09-07 |
(15.5 MB) |
unifont | Medium (Normal) | Bitmap, TTF | Any | GPL | Roman Czyborra | ||
HAN NOM A, HAN NOM B |
32,328 45,020 |
34,147 45,175 |
0 0 |
v2.0; 2005 sfnt rev 2 2005-02-05 |
(20.3 MB) (32.2 MB) |
HAN NOM A HAN NOM B |
Medium (Normal), Book, Regular | TTF | No Fit | Freeware | Chan Nguyen, Tv Vien Chieu, To Minh Tam. | ||
Junicode Junicode Junicode is a free old-style serif typeface developed by Peter S. Baker of the University of Virginia. The design is based on a 17th century typeface used in Oxford, England.... |
3,022 | 3,096 | v0.6.17 sfnt rev 0.599991 2009-07-11 |
(752 KB) |
Junicode | Medium (Normal), Book, Regular | TTF | Any | GPL | Peter S. Baker | |||
Linux Libertine Linux Libertine Linux Libertine is a digital typeface created by the Libertine Open Fonts Project, which aims to create free and open alternatives to Proprietary software typefaces such Times Roman... |
2,549 | 2,552 | 0 | v4.7.5 sfnt rev 4.7 2010-06-15 |
(893 KB) |
Linux Libertine | Medium (Normal), Book, Regular | OTF+TTO | Any | GPL, OFL SIL Open Font License The SIL Open Font License is a free and open source license designed for fonts by SIL International for use with some of their Unicode fonts. The license is considered free by the Free Software Foundation, which states that a simple hello world program is enough to satisfy the license's requirement... |
Philipp H. Poll | ||
Lucida Grande Lucida Grande Lucida Grande is a humanist sans-serif typeface. It is a member of the Lucida family of typefaces designed by Charles Bigelow and Kris Holmes. It has been used throughout Mac OS X user interface since 1999, as well as in Safari for Windows up to the browser's version 3.2.3 released on May 12,... |
2,245 | 2,826 | 0 | v5.0d8e1 sfnt rev 1.2 2003-03-07 |
Lucida Grande.ttf (1.07 MB) |
Lucida Grande | Medium (Normal), Regular | OTF | Normal Sans | Proprietary | Bigelow & Holmes Inc | ||
Lucida Sans Unicode Lucida Sans Unicode In digital typography, Lucida Sans Unicode OpenType font from the design studio of Bigelow & Holmes is designed to support the most commonly used characters defined in version 2.0 of the Unicode standard... |
1,765 | 1,776 | 0 | v2.00 1998-08-12 |
l_10646.ttf (316 KB) |
Lucida Sans | Medium (Normal), Regular | OTF+TTO | Normal Sans | Proprietary | Bigelow & Holmes Inc | ||
Microsoft JhengHei | 28,958 | 29,220 | 583 | v6.02 sfnt rev 6.02 2009-02-25 |
msjh.ttf (20.6 MB) |
Microsoft JhengHei | Medium (Normal), Book, Regular | OTF+TTO | Normal Sans | Proprietary | Microsoft Corporation | ||
Microsoft Sans Serif | 2,788 | 3,077 | 0 | v5.02 sfnt rev 5.02 2009-02-12 |
micross.ttf (637 KB) |
Microsoft Sans Serif | Medium (Normal), Book, Regular | OTF+TTO | Normal Sans | Proprietary | Microsoft Corporation | ||
New Gulim New Gulim New Gulim is a sans-serif type unicode font designed especially for the Korean-language script, designed by HanYang System Co., Limited . It is an expanded version of Hanyang Gulrim .... |
46,567 | 49,284 | 0 | v3.10 sfnt rev 1 2002-10-29 |
nGulim.ttf (24.5 MB) |
New Gulim | Medium (Normal), Regular | TTF | Normal Sans. Obtuse Cove | Proprietary | HanYang System Co.,LTD. | ||
Quivira | 7,882 | 7,398 | 0 | v3.3 sfnt rev 3.3 2010-03-28 |
Quivira.ttf (936 KB) |
Quivira | Medium (Normal), Book, Regular | TTF | Cove | Freeware | GrinningBit. | ||
STIX | 3,290 | 3,292 | 1,016 | v1.0.0 sfnt rev 1.0 2010-04-29 |
STIXGeneral.otf (404 KB) |
STIXGeneral | Medium (Normal), Book, Regular | OTF | Any | Freeware | |||
Sun-ExtA, Sun-ExtB |
50,112 47,564 |
50,016 47,660 |
0 0 |
(A) v5.4 sfnt rev 5 2009-01-03 (B) v5.4 sfnt rev 5.39999 2009-02-02 |
Sun-ExtA.ttf (21.9 MB) Sun-ExtB.ttf (16.8 MB) |
Sun-ExtA Sun-ExtB |
Medium (Normal), Book, Regular | TTF | No Fit | Both Freeware. Proprietary |
okuc. Beijing ZhongYi Electronics Co. |
||
Tahoma Tahoma (typeface) Tahoma is a humanist sans-serif typeface designed by Matthew Carter for the Microsoft Corporation in 1994 with initial distribution along with Verdana for Windows 95.... |
1,912 | 3,412 | 674 | v5.06 2009-02-12 |
tahoma.ttf (681 KB) |
Tahoma | Medium (Normal), Regular | OTF+TTO | Normal Sans | Proprietary | Microsoft Corporation | ||
Times New Roman | 2,790 | 3,414 | 867 | v5.05 2009-02-06 |
times.ttf (816 KB) |
Times New Roman | Medium (Normal), Regular | OTF+TTO | Cove | Proprietary; | The Monotype Corporation. | ||
TITUS Cyberbit Basic | 9,209 | 10,044 | 0 | v3.0 (2000) sfnt rev 4 2005-10-08 |
TITUSCBZ.TTF (1.91 MB) |
TITUS Cyberbit Basic | Medium (Normal), Regular | TTF | Cove | Freeware | Bitstream Inc / TITUS project. | ||
WenQuanYi WenQuanYi WenQuanYi is an open-source project of Chinese computer fonts licensed under GNU General Public License.-General:WenQuanYi project was started by Qianqian Fang , a Chinese biomedical imaging researcher at the Massachusetts General Hospital, in October, 2004.Now the fonts of WenQuanYi project are... Bitmap Song |
41,295 | 154,997 | 0 | 1.0.0-RC1 | WenQuanYi Bitmap Song | Regular | Multi-strike Bitmap Font | Song(Serif) Style for Chinese | GPL | ||||
WenQuanYi Zen Hei | 42,285 | 43,643 | 0 | v0.9.45 sfnt rev 0.89994 2010-03-11 |
wqy-zenhei.ttc (16.0 MB) |
WenQuanYi Zen Hei, WenQuanYi Zen Hei Mono and WenQuanYi Zen Hei Sharp | Medium, Regular | TTC | Hei(Sans) Style for Chinese | GPL | Qianqian Fang and WenQuanYi Board of Trustees. | ||
WenQuanYi Micro Hei | 34,707 | 48,755 | 0 | 0.2.0-beta | WenQuanYi Micro Hei and WenQuanYi Micro Hei Mono | Regular, Light | TTC | Hei(Sans) Style for Chinese | GPL | ||||
Y.OzFontN | 21,957 | 57,621 | 0 | v13.00 sfnt rev 5 Pen-Ji 2010-08-24 |
YOzRN.TTC (13.5 MB) |
YOzFontN | Regular | TTC | Any | Freeware | Y.OzVox | ||
XITS | 3,290 | 3,292 | 1,016 | v1.0.10 2011-04-25 |
xits-regular.otf (247 KB) |
XITS | Medium (Normal), Book, Regular | OTF | Any | OFL OFL OFL may stand for :* Off-line* Ontario Federation of Labour, a Canadian trade union federation.* Open Font Licence by SIL International* Overfishing level* Overflow... |
|||
Font | Char(s) | Glyphs | Kernpairs (Standard) |
Point (Range) : Hinting/ Smoothing Behavior | Version & Date | Filename, Size |
Font Family | Font weight, style |
Font type |
Serif style |
License | Creator / Author (Copyright) | Notes |
Note
- OTF+TTO: OpenTypeOpenTypeOpenType is a format for scalable computer fonts. It was built on its predecessor TrueType, retaining TrueType's basic structure and adding many intricate data structures for prescribing typographic behavior...
font with TrueTypeTrueTypeTrueType is an outline font standard originally developed by Apple Computer in the late 1980s as a competitor to Adobe's Type 1 fonts used in PostScript...
outlines. - OpenType fonts sometimes don't contain a one-by-one Kernpair table but a kern-by-classes table where groups of similar characters are seen as one kern group. For instance, V and W have nearly the same left and right geometry. So “0” doesn't mean that no kerning is supported!
Comparison of fonts
Number of characters included by the above version of fonts, for different Unicode blocks (or, ranges), are listed below. Basic Latin (128: 0000–007F) means that in the range called 'Basic Latin', there are 128 assigned codes, numbered 0 to 7FHexadecimal
In mathematics and computer science, hexadecimal is a positional numeral system with a radix, or base, of 16. It uses sixteen distinct symbols, most often the symbols 0–9 to represent values zero to nine, and A, B, C, D, E, F to represent values ten to fifteen...
. The cells then show the number of those codes which are covered by each font.
- Cells shaded green indicate complete coverage.
- Cells shaded blue are not complete, but are the most complete of the fonts listed.
- Empty cells indicate that no character exists in that block.
0000-077F
BMP (Plane 0) | ||||||||||||||||||||||||||||
Font Range |
||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Non-Unicode Glyphs | 703 | 23 | 1 | 295 | 8 | 4 | 76 | 3 | 589 | 263 | 443 | 624 | 704 | 866 | 954 | |||||||||||||
Unassigned Code Points (0000-11FFFF) | 261 | 2 | 1 | 10 | 2 | 1 | 7 | 1 | 1 | 2 | 1 | 1 | 32 | 67 | 1 | 37 | 118 | |||||||||||
Total Glyphs in BMP (60699: 0000–FFFF) | ||||||||||||||||||||||||||||
C0 Control Character (0000–001F) | 31 | 1 | 1 | 1 | 31 | 1 | 1 | 31 | 31 | 32 | ||||||||||||||||||
Basic Latin Latin alphabet The Latin alphabet, also called the Roman alphabet, is the most recognized alphabet used in the world today. It evolved from a western variety of the Greek alphabet called the Cumaean alphabet, which was adopted and modified by the Etruscans who ruled early Rome... (95: 0020–007E) |
||||||||||||||||||||||||||||
Delete Character (007F-007F) | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 1 | ||||
C1 Control Character (0080-009F) | 32 | 32 | 32 | |||||||||||||||||||||||||
Latin-1 Supplement (96: 00A0–00FF) | 94 | 95 | 95 | 94 | 94 | 94 | 95 | |||||||||||||||||||||
Latin Extended-A (128: 0100–017F) | 127 | 8 | 29 | |||||||||||||||||||||||||
Latin Extended-B (208: 0180–024F) | 148 | 52 | 178 | 188 | 188 | 178 | 188 | 182 | 194 | 183 | 119 | 1 | 7 | 183 | 8 | 28 | ||||||||||||
IPA Extensions International Phonetic Alphabet The International Phonetic Alphabet "The acronym 'IPA' strictly refers [...] to the 'International Phonetic Association'. But it is now such a common practice to use the acronym also to refer to the alphabet itself that resistance seems pedantic... (96: 0250–02AF) |
89 | 94 | 94 | 94 | 94 | 89 | 55 | |||||||||||||||||||||
Spacing Modifier Letters (80: 02B0–02FF) | 57 | 79 | 63 | 75 | 63 | 62 | 63 | 75 | 75 | 63 | 57 | 12 | 10 | 11 | 14 | |||||||||||||
Combining Diacritical Marks Diacritic A diacritic is a glyph added to a letter, or basic glyph. The term derives from the Greek διακριτικός . Diacritic is both an adjective and a noun, whereas diacritical is only an adjective. Some diacritical marks, such as the acute and grave are often called accents... (112: 0300–036F) |
72 | 82 | 103 | 82 | 82 | 93 | 103 | 107 | 109 | 111 | 106 | 68 | 1 | 109 | 110 | 106 | 32 | |||||||||||
Greek and Coptic Greek alphabet The Greek alphabet is the script that has been used to write the Greek language since at least 730 BC . The alphabet in its classical and modern form consists of 24 letters ordered in sequence from alpha to omega... (134: 0370–03FF) |
126 | 105 | 124 | 110 | 17 | 76 | 110 | 17 | 120 | 128 | 120 | 80 | 110 | 106 | 91 | 117 | 126 | 73 | 127 | 123 | 126 | 117 | 72 | 76 | ||||
Cyrillic Cyrillic alphabet The Cyrillic script or azbuka is an alphabetic writing system developed in the First Bulgarian Empire during the 10th century AD at the Preslav Literary School... (256: 0400–04FF) |
254 | 226 | 253 | 2 | 238 | 214 | 238 | 244 | 214 | 247 | 214 | 230 | 244 | 153 | 254 | 94 | 253 | 254 | 254 | 246 | 92 | 66 | ||||||
Cyrillic Supplement (38: 0500–052F) | 24 | 16 | 36 | 34 | 16 | 36 | 34 | 36 | 34 | 16 | 24 | 20 | 24 | 24 | 15 | |||||||||||||
Armenian Armenian alphabet The Armenian alphabet is an alphabet that has been used to write the Armenian language since the year 405 or 406. It was devised by Saint Mesrop Mashtots, an Armenian linguist and ecclesiastical leader, and contained originally 36 letters. Two more letters, օ and ֆ, were added in the Middle Ages... (86: 0530–058F) |
85 | 85 | 84 | |||||||||||||||||||||||||
Hebrew Hebrew alphabet The Hebrew alphabet , known variously by scholars as the Jewish script, square script, block script, or more historically, the Assyrian script, is used in the writing of the Hebrew language, as well as other Jewish languages, most notably Yiddish, Ladino, and Judeo-Arabic. There have been two... (87: 0590–05FF) |
82 | 46 | 86 | 82 | 86 | 60 | 82 | 54 | 53 | 82 | 51 | 83 | ||||||||||||||||
Arabic Arabic alphabet The Arabic alphabet or Arabic abjad is the Arabic script as it is codified for writing the Arabic language. It is written from right to left, in a cursive style, and includes 28 letters. Because letters usually stand for consonants, it is classified as an abjad.-Consonants:The Arabic alphabet has... (250: 0600–06FF) |
225 | 194 | 60 | 10 | 185 | 69 | 201 | 161 | 67 | 232 | 225 | 225 | 225 | 183 | ||||||||||||||
Syriac Syriac alphabet The Syriac alphabet is a writing system primarily used to write the Syriac language from around the 2nd century BC . It is one of the Semitic abjads directly descending from the Aramaic alphabet and shares similarities with the Phoenician, Hebrew, Arabic, and the traditional Mongolian alphabets.-... (77: 0700–074F) |
50 | |||||||||||||||||||||||||||
Arabic Supplement (48: 0750–077F) | 30 | 30 | 30 | 30 | ||||||||||||||||||||||||
Range Font Range |
0780-139F
Range Font Range |
||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Thaana (50: 0780–07BF) | 49 | 49 | ||||||||||||||||||||||||||
N'ko N'Ko N'Ko is both a script devised by Solomana Kante in 1949 as a writing system for the Mande languages of West Africa, and the name of the literary language itself written in the script. The term N'Ko means 'I say' in all Manding languages.... (59: 07C0–07FA) |
54 | |||||||||||||||||||||||||||
Samaritan (46: 0800–082D) | ||||||||||||||||||||||||||||
Aramaic (29: 0820–083F) | ||||||||||||||||||||||||||||
Samaritan, Punctuation (15: 0830–083E) | ||||||||||||||||||||||||||||
Pahlavi (64: 0840–087F) | ||||||||||||||||||||||||||||
Tifinagh (48: 0880–08AF) | ||||||||||||||||||||||||||||
Mandaic (48: 08D0–08FF) | ||||||||||||||||||||||||||||
Devanagari Devanagari Devanagari |deva]]" and "nāgarī" ), also called Nagari , is an abugida alphabet of India and Nepal... (117: 0900–097F) |
104 | 112 | 104 | 103 | 105 | 106 | ||||||||||||||||||||||
Bengali Bengali script The Bengali alphabet is the writing system for the Bengali language. The script with variations is used for Assamese and is basis for Meitei, Bishnupriya Manipuri, Kokborok, Garo and Mundari alphabets. All these languages are spoken in the eastern region of South Asia. Historically, the script has... (92: 0980–09FF) |
89 | 91 | 89 | 91 | 90 | |||||||||||||||||||||||
Gurmukhi Gurmukhi script Gurmukhi is the most common script used for writing the Punjabi language. An abugida derived from the Laṇḍā script and ultimately descended from Brahmi, Gurmukhi was standardized by the second Sikh guru, Guru Angad Dev Ji, in the 16th century. The whole of the Sri Guru Granth Sahib Ji's 1430... (79: 0A00–0A7F) |
75 | 77 | ||||||||||||||||||||||||||
Gujarati Gujarati script The Gujarati script , which like all Nāgarī writing systems is strictly speaking an abugida rather than an alphabet, is used to write the Gujarati and Kutchi languages... (83: 0A80–0AFF) |
78 | 78 | 78 | |||||||||||||||||||||||||
Oriya Oriya script The Oriya script or Utkala Lipi or Utkalakshara is used to write the Oriya language, and can be used for several other Indian languages, for example, Sanskrit.- History :... (84: 0B00–0B7F) |
79 | 79 | ||||||||||||||||||||||||||
Tamil Tamil script The Tamil script is a script that is used to write the Tamil language as well as other minority languages such as Badaga, Irulas, and Paniya... (72: 0B80–0BFF) |
61 | 71 | 70 | |||||||||||||||||||||||||
Telugu Telugu script Telugu script, an abugida from the Brahmic family of scripts, is used to write the Telugu language, a language found in the South-Central Indian state of Andhra Pradesh as well as several other neighboring states. The Telugu script is derived from the Bhattiprolu script... (93: 0C00–0C7F) |
80 | 42 | 80 | |||||||||||||||||||||||||
Kannada Kannada script The Kannada script is an alphasyllabary of the Brahmic family, used primarily to write the Kannada language, one of the Dravidian languages of southern India and also Sanskrit in the past. The Telugu script is derived from Old Kannada, and resembles Kannada script... (86: 0C80–0CFF) |
80 | |||||||||||||||||||||||||||
Malayalam Malayalam script The Malayalam script is a Brahmic script used commonly to write the Malayalam language—which is the principal language of the Indian state of Kerala, spoken by 36 million people in the world. Like many other Indic scripts, it is an abugida, or a writing system that is partially “alphabetic” and... (95: 0D00–0D7F) |
78 | 78 | ||||||||||||||||||||||||||
Sinhala (80: 0D80–0DFF) | ||||||||||||||||||||||||||||
Thai Thai alphabet Thai script , is used to write the Thai language and other, minority, languages in Thailand. It has forty-four consonants , fifteen vowel symbols that combine into at least twenty-eight vowel forms, and four tone marks .... (87: 0E00–0E7F) |
86 | 1 | ||||||||||||||||||||||||||
Lao Lao alphabet The Lao alphabet, Aksone Lao , is the main script used to write the Lao language and other minority languages in Laos. It is ultimately of Indic origin, the alphabet includes 27 consonants , 7 consonantal ligatures , 33 vowels , and 4 tone marks... (65: 0E80–0EFF) |
||||||||||||||||||||||||||||
Tibetan Tibetan script The Tibetan alphabet is an abugida of Indic origin used to write the Tibetan language as well as the Dzongkha language, Denzongkha, Ladakhi language and sometimes the Balti language. The printed form of the alphabet is called uchen script while the hand-written cursive form used in everyday... (205: 0F00–0FFF) |
168 | 168 | 55 | 193 | ||||||||||||||||||||||||
Burmese Burmese alphabet The Burmese script is an abugida in the Brahmic family used for writing Burmese. Furthermore, various other scripts share some aspect and letters of the Burmese script, though they should not be considered strictly Burmese, including Mon, Shan, S'gaw Karen, Eastern and Western Pwo Karen and Geba... (Mayanmar) (160: 1000–109F) |
156 | 78 | ||||||||||||||||||||||||||
Georgian Georgian alphabet The Georgian alphabet is the writing system used to write the Georgian language and other Kartvelian languages , and occasionally other languages of the Caucasus such as Ossetic and Abkhaz during the 1940s... (83: 10A0–10FF) |
78 | 1 | 78 | 1 | 78 | |||||||||||||||||||||||
Hangul Hangul Hangul,Pronounced or ; Korean: 한글 Hangeul/Han'gŭl or 조선글 Chosŏn'gŭl/Joseongeul the Korean alphabet, is the native alphabet of the Korean language. It is a separate script from Hanja, the logographic Chinese characters which are also sometimes used to write Korean... Jamo, Choseong (96: 1100-115F) |
91 | 91 | 91 | 95 | 47 | |||||||||||||||||||||||
Hangul Hangul Hangul,Pronounced or ; Korean: 한글 Hangeul/Han'gŭl or 조선글 Chosŏn'gŭl/Joseongeul the Korean alphabet, is the native alphabet of the Korean language. It is a separate script from Hanja, the logographic Chinese characters which are also sometimes used to write Korean... Jamo, Jungseong (72: 1160-11A7) |
67 | 67 | 71 | 37 | ||||||||||||||||||||||||
Hangul Hangul Hangul,Pronounced or ; Korean: 한글 Hangeul/Han'gŭl or 조선글 Chosŏn'gŭl/Joseongeul the Korean alphabet, is the native alphabet of the Korean language. It is a separate script from Hanja, the logographic Chinese characters which are also sometimes used to write Korean... Jamo, Jongseong (88: 11A8-11FF) |
82 | 82 | 82 | 82 | 62 | |||||||||||||||||||||||
Ethiopic (Ge'ez/Ethiopic) (356: 1200–137F) | 345 | 345 | 345 | 346 | ||||||||||||||||||||||||
Ethiopic Supplement (26: 1380–139F) | ||||||||||||||||||||||||||||
Range Font Range |
13A0-1DBF
Range Font Range |
||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Cherokee Cherokee language Cherokee is an Iroquoian language spoken by the Cherokee people which uses a unique syllabary writing system. It is the only Southern Iroquoian language that remains spoken. Cherokee is a polysynthetic language.-North American etymology:... (85: 13A0–13FF) |
85 | |||||||||||||||||||||||||||
Unified Canadian Aboriginal Syllabics (669: 1400–167F) | 29 | 630 | 433 | 659 | 29 | |||||||||||||||||||||||
Ogham Ogham Ogham is an Early Medieval alphabet used primarily to write the Old Irish language, and occasionally the Brythonic language. Ogham is sometimes called the "Celtic Tree Alphabet", based on a High Medieval Bríatharogam tradition ascribing names of trees to the individual letters.There are roughly... (29: 1680–169F) |
||||||||||||||||||||||||||||
Runic (81: 16A0–16FF) | 1 | |||||||||||||||||||||||||||
Tagalog Baybayin Baybayin , is a pre-Spanish Philippine writing system. It is a member of the Brahmic family and is recorded as being in use in the 16th century... (Baybayin) (20: 1700–171F) |
||||||||||||||||||||||||||||
Hanunoo (23: 1720–173F) | 2 | 2 | ||||||||||||||||||||||||||
Buhid Buhid script Buhid, is an indigenous Brahmic script of the Philippines, closely related to Baybayin, and is used today by the Mangyans to write their language, Buhid.- Unicode :Buhid script was added to the Unicode Standard in March, 2002 with the release of version 3.2.... (20: 1740–175F) |
||||||||||||||||||||||||||||
Tagbanwa Tagbanwa Tagbanwa, also known as Apurahuano, is one of the indigenous writing systems of the Philippines. The Tagbanwa language, an Austronesian language, with about 8, 000 speakers in the central and northern regions of Palawan, is dying out as the younger generations of Tagbanua are learning Cuyonon... (18: 1760–177F) |
||||||||||||||||||||||||||||
Khmer (114: 1780–17FF) | 103 | 1 | ||||||||||||||||||||||||||
Mongolian Mongolian alphabet Many alphabets have been devised for the Mongolian language over the centuries, and from a variety of scripts. The oldest, called simply the Mongolian script, has been the predominant script during most of Mongolian history, and is still in active use today in the Inner Mongolia region of China... (156: 1800–18AF) |
155 | 155 | ||||||||||||||||||||||||||
Limbu Limbu script The Limbu script is used to write the Limbu language. The Limbu script is an abugida derived from the Tibetan script.-History:According to traditional histories, the Limbu script was first invented in the late 9th century by King Sirijonga Haang, then fell out of use, to be reintroduced in the 18th... (66: 1900–194F) |
||||||||||||||||||||||||||||
Tai Le Tai Le script Tai Le is the name of Tai Nüa script, the script used for the Tai Nüa language, given by Microsoft.-Unicode:Tai Le script was added to the Unicode Standard in April, 2003 with the release of version 4.0.... (35: 1950–197F) |
||||||||||||||||||||||||||||
Tai Lue Tai Lü language Tai Lü is a language spoken by about 670,000 people in South East Asia. This includes 250,000 people in China, 200,000 in Burma, 134,000 in Thailand, and 5,000 in Vietnam... (New Tai Lu) (83: 1980–19DF) |
||||||||||||||||||||||||||||
Khmer Symbols Khmer script The Khmer script is an alphasyllabary script used to write the Khmer language . It is also used to write Pali among the Buddhist liturgy of Cambodia and Thailand.... (32: 19E0–19FF) |
||||||||||||||||||||||||||||
Buginese Buginese language Buginese is the language spoken by about four million people mainly in the southern part of Sulawesi, Indonesia.-History:The word Buginese derives from the word Bahasa Bugis in Malay. In Buginese, it is called while the Bugis people are called... (30: 1A00–1A1F) |
||||||||||||||||||||||||||||
Tai Tham (127: 1A20–1AAD) | ||||||||||||||||||||||||||||
Balinese (121: 1B00–1B7F) | ||||||||||||||||||||||||||||
Sundanese (55: 1B80–1BBF) | ||||||||||||||||||||||||||||
Lepcha (74: 1C00–1C4F) | ||||||||||||||||||||||||||||
Ol Chiki (48: 1C50–1C7F) | ||||||||||||||||||||||||||||
Phonetic Extensions (128: 1D00–1D7F) | 17 | 109 | 106 | 109 | 110 | 109 | 22 | 108 | 108 | |||||||||||||||||||
Phonetic Extensions Supplement (64: 1D80–1DBF) | 38 | 3 | ||||||||||||||||||||||||||
Range Font Range |
1DC0-257F
Range Font Range |
||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Combining Diacritical Marks Supplement (42: 1DC0–1DFF) | 13 | 2 | 13 | 5 | 6 | 5 | 5 | 26 | 13 | 13 | 13 | 13 | ||||||||||||||||
Latin Extended Additional (256: 1E00–1EFF) | 247 | 246 | 96 | 88 | 246 | 246 | 246 | 252 | 246 | 8 | 247 | 8 | 246 | 247 | 247 | 247 | 8 | |||||||||||
Greek Extended (233: 1F00–1FFF) | 232 | 4 | ||||||||||||||||||||||||||
General Punctuation Punctuation Punctuation marks are symbols that indicate the structure and organization of written language, as well as intonation and pauses to be observed when reading aloud.In written English, punctuation is vital to disambiguate the meaning of sentences... (107: 2000–206F) |
54 | 63 | 96 | 65 | 69 | 106 | 74 | 56 | 77 | 74 | 101 | 74 | 77 | 59 | 85 | 67 | 27 | 45 | 25 | 97 | 44 | 54 | 62 | 20 | 91 | |||
Superscripts and Subscripts Unicode subscripts and superscripts Unicode has subscripted and superscripted versions of a number of characters including a full set of arabic numerals. These characters allow any polynomial, chemical and certain other equations to be represented in plain text without using any form of markup like HTML or TeX.The World Wide Web... (34: 2070–209F) |
6 | 28 | 9 | 28 | 29 | 29 | 28 | 29 | 29 | 28 | 1 | 6 | 6 | 6 | 6 | 29 | 6 | 34 | ||||||||||
Currency Symbols (25: 20A0-20CF) | 22 | 13 | 6 | 16 | 22 | 22 | 15 | 16 | 23 | 22 | 24 | 22 | 2 | 9 | 18 | 12 | 17 | 22 | 4 | 22 | 22 | 22 | 12 | 2 | 18 | |||
Combining Diacritical Marks for Symbols (33: 20D0–20FF) | 1 | 18 | 20 | 28 | 5 | 20 | 7 | 5 | 5 | 1 | 2 | 1 | 27 | 1 | 1 | 27 | ||||||||||||
Letterlike Symbols Letterlike Symbols Letterlike Symbols are graphemes which are constructed mainly from the glyphs of one or more letters.In Unicode, Letterlike Symbols are placed in the block U+2100–214F, as in the following table.-See also:*Mapping of Unicode characters... (80: 2100–214F) |
9 | 57 | 75 | 13 | 59 | 5 | 59 | 59 | 75 | 5 | 71 | 5 | 10 | 31 | 32 | 57 | 4 | 8 | 10 | 77 | 8 | 8 | 10 | 9 | 75 | |||
Number Forms Number Forms Number Forms are Unicode characters which have specific meaning as numbers, but are constructed from other characters. They consist primarily of vulgar fractions and roman numerals. They are placed in the Unicode codepoint range 0x2150 through 0x218F , except for three fractions in ISO-8859-1... (58: 2150–218F) |
7 | 48 | 4 | 49 | 54 | 54 | 49 | 49 | 55 | 54 | 54 | 50 | 50 | 49 | 4 | 10 | 5 | 26 | 50 | 5 | 7 | 28 | 30 | 53 | ||||
Arrows Arrow (symbol) An arrow is a graphical symbol such as → or ←, used to point or indicate direction, being in its simplest form a line segment with a triangle affixed to one end, and in more complex forms a representation of an actual arrow... (112: 2190–21FF) |
7 | 91 | 14 | 100 | 19 | 92 | 100 | 19 | 19 | 34 | 20 | 91 | 8 | 13 | 7 | 20 | 12 | |||||||||||
Mathematical Operators Unicode Mathematical Operators Unicode ranges mathematical operators and symbols in multiple blocks.* Mathematical Operators * Miscellaneous Mathematical Symbols-A * Miscellaneous Mathematical Symbols-B... (256: 2200–22FF) |
16 | 242 | 246 | 24 | 242 | 21 | 242 | 242 | 21 | 21 | 16 | 82 | 18 | 242 | 26 | 13 | 43 | 14 | 16 | 52 | 51 | |||||||
Miscellaneous Technical Symbols Miscellaneous Technical (Unicode) Miscellaneous Technical is the name of a a Unicode block ranging from U+2300 to U+23FF, which contains various common symbols which are related to and used in the various technical, programming language and academic professions.... (233: 2300–23FF) |
4 | 123 | 36 | 57 | 228 | 27 | 4 | 154 | 65 | 27 | 219 | 105 | 27 | 6 | 14 | 14 | 10 | 1 | 5 | 211 | 4 | 7 | 1 | 209 | ||||
Control Pictures (39: 2400–243F) | 37 | 1 | 2 | 1 | 36 | 1 | 1 | 1 | 37 | 1 | 4 | |||||||||||||||||
Optical Character Recognition Optical character recognition Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic translation of scanned images of handwritten, typewritten or printed text into machine-encoded text. It is widely used to convert books and documents into electronic files, to computerize a record-keeping... (11: 2440–245F) |
10 | |||||||||||||||||||||||||||
Enclosed Alphanumerics (160: 2460–24FF) | 139 | 73 | 139 | 10 | 10 | 114 | 1 | 82 | 112 | 139 | ||||||||||||||||||
Box Drawing Box drawing characters Box drawing characters, also known as line drawing characters, or pseudographics, are widely used in text user interfaces to draw various frames and boxes... (128: 2500–257F) |
40 | 115 | 1 | 116 | 53 | 97 | 40 | 99 | 117 | |||||||||||||||||||
Range Font Range |
2580-2DFF
Range Font Range |
||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Block Elements (32: 2580–259F) | 8 | 22 | 20 | 22 | 22 | 21 | 18 | 8 | 8 | 10 | 20 | |||||||||||||||||
Geometric Shapes Unicode Geometric Shapes Geometric Shapes is a Unicode block of 96 symbols at codepoint range U+25A0-25FF.-U+25A0-U+25CF:-U+25D0-U+25FF:-Font coverage:Only two font sets—Code2000 and the DejaVu family—include coverage for each of the glyphs in the Geometric Shapes range, Unifont also contains all the glyphs... (96: 25A0–25FF) |
16 | 80 | 92 | 8 | 88 | 2 | 95 | 88 | 2 | 2 | 1 | 26 | 18 | 79 | 16 | 7 | 34 | 7 | 16 | 38 | 29 | |||||||
Miscellaneous Symbols Miscellaneous Symbols The Miscellaneous Symbols Unicode block contains various glyphs representing things from a variety of categories: Astrological, Astronomical, Chess, Dice, Ideological symbols, Musical notation, Political symbols, Recycling, Religious symbols, Trigrams, Warning signs and Weather.-Tables:Note: These... (128: 2600–267F) |
12 | 106 | 118 | 31 | 108 | 127 | 128 | 4 | 45 | 14 | 6 | 2 | 24 | 125 | 1 | 12 | 25 | 33 | ||||||||||
Dingbats (Zapf Dingbats) (175: 2700–27BF) | 160 | 6 | 2 | 174 | 2 | 170 | 1 | 174 | 2 | 174 | 174 | 2 | 12 | 14 | 3 | 174 | 174 | |||||||||||
Miscellaneous Mathematical Symbols-A (28: 27C0–27EF) | 3 | 2 | 2 | 2 | 15 | 6 | 1 | 16 | 16 | |||||||||||||||||||
Supplemental Arrows-A (32: 27F0–27FF) | 8 | 2 | 23 | 2 | 28 | 22 | 2 | 4 | 2 | 12 | 28 | |||||||||||||||||
Braille Patterns Braille The Braille system is a method that is widely used by blind people to read and write, and was the first digital form of writing.Braille was devised in 1825 by Louis Braille, a blind Frenchman. Each Braille character, or cell, is made up of six dot positions, arranged in a rectangle containing two... (256: 2800–28FF) |
||||||||||||||||||||||||||||
Supplemental Arrows-B (128: 2900–297F) | 6 | 6 | 119 | |||||||||||||||||||||||||
Miscellaneous Mathematical Symbols-B (128: 2980–29FF) | 2 | 13 | 1 | |||||||||||||||||||||||||
Supplemental Mathematical Operators (256: 2A00–2AFF) | 10 | 72 | 194 | 10 | 1 | |||||||||||||||||||||||
Miscellaneous Symbols and Arrows (Supplemental Symbols and Arrows) (87: 2B00–2BFF) | 82 | 35 | 49 | 15 | ||||||||||||||||||||||||
Reserved | ||||||||||||||||||||||||||||
Glagolitic Glagolitic alphabet The Glagolitic alphabet , also known as Glagolitsa, is the oldest known Slavic alphabet. The name was not coined until many centuries after its creation, and comes from the Old Slavic glagolъ "utterance" . The verb glagoliti means "to speak"... (94: 2C00–2C5F) |
86 | |||||||||||||||||||||||||||
Latin Extended-C (32: 2C60-2C7F) | 21 | 29 | 12 | 31 | 12 | 29 | 12 | 5 | 17 | 21 | 17 | 21 | 21 | |||||||||||||||
Coptic Coptic alphabet The Coptic alphabet is the script used for writing the Coptic language. The repertoire of glyphs is based on the Greek alphabet augmented by letters borrowed from the Demotic and is the first alphabetic script used for the Egyptian language... (121: 2C80–2CFF) |
114 | 66 | 114 | |||||||||||||||||||||||||
Georgian Supplement (38: 2D00–2D2F) | ||||||||||||||||||||||||||||
Tifinagh Tifinagh Tifinagh is a series of abjad and alphabetic scripts used by some Berber peoples, notably the Tuareg, to write their language.A modern derivate of the traditional script, known as Neo-Tifinagh, was introduced in the 20th century... (55: 2D30–2D7F) |
||||||||||||||||||||||||||||
Ethiopic Extended (79: 2D80–2DDF) | ||||||||||||||||||||||||||||
Cyrillic Extended-A (16: 2DE0–2DFF) | ||||||||||||||||||||||||||||
Range Font Range |
2E00-4DBF
Range Font Range |
||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Supplemental Punctuation (50: 2E00–2E7F) | 1 | 24 | 49 | 6 | 1 | 22 | 9 | 1 | 1 | 1 | 1 | |||||||||||||||||
CJK Radicals (Supplement) Radical (Chinese character) A Chinese radical is a component of a Chinese character. The term may variously refer to the original semantic element of a character, or to any semantic element, or, loosely, to any element whatever its origin or purpose... (115: 2E80–2EFF) |
||||||||||||||||||||||||||||
Kangxi Radicals (214: 2F00–2FDF) | 214 | |||||||||||||||||||||||||||
Ideographic Description Characters (16: 2FE0–2FEF) | 12 | 12 | ||||||||||||||||||||||||||
CJK Symbols and Punctuation (64: 3000–303F) | 57 | 12 | 9 | 40 | 18 | 17 | 31 | 46 | 45 | |||||||||||||||||||
Hiragana Hiragana is a Japanese syllabary, one basic component of the Japanese writing system, along with katakana, kanji, and the Latin alphabet . Hiragana and katakana are both kana systems, in which each character represents one mora... (93: 3040–309F) |
90 | 90 | 90 | 86 | 83 | 90 | 93 | |||||||||||||||||||||
Katakana Katakana is a Japanese syllabary, one component of the Japanese writing system along with hiragana, kanji, and in some cases the Latin alphabet . The word katakana means "fragmentary kana", as the katakana scripts are derived from components of more complex kanji. Each kana represents one mora... (96: 30A0–30FF) |
94 | 94 | 94 | 92 | 86 | 94 | 95 | |||||||||||||||||||||
Bopomofo Bopomofo Zhuyin fuhao , often abbreviated as zhuyin and colloquially called bopomofo, was introduced in the 1910s as the first official phonetic system for transcribing Chinese, especially Mandarin.... (41: 3100–312F) |
40 | 37 | 39 | 40 | 37 | |||||||||||||||||||||||
Hangul Compatibility Jamo (94: 3130–318F) | 93 | 1 | ||||||||||||||||||||||||||
Kanbun Kanbun The Japanese word originally meant "Classical Chinese writings, Chinese classic texts, Classical Chinese literature". This evolved into a Japanese method of reading annotated Classical Chinese in translation . Much Japanese literature was written in literary Chinese using this annotated style... (16: 3190–319F) |
14 | 16 | 14 | |||||||||||||||||||||||||
Bopomofo Extended Bopomofo Zhuyin fuhao , often abbreviated as zhuyin and colloquially called bopomofo, was introduced in the 1910s as the first official phonetic system for transcribing Chinese, especially Mandarin.... (24: 31A0–31BF) |
4 | |||||||||||||||||||||||||||
CJK Strokes (36: 31C0–31EF) | ||||||||||||||||||||||||||||
Katakana Phonetic Extensions (16: 31F0–31FF) | ||||||||||||||||||||||||||||
Enclosed CJK Letters and Months (254: 3200–32FF) | 202 | 242 | 58 | 1 | 58 | 232 | 206 | 179 | ||||||||||||||||||||
CJK Compatibility (256: 3300–33FF) | 249 | 105 | 11 | 80 | 249 | 154 | ||||||||||||||||||||||
CJK Unified Ideographs Extension A (6646: 3400–4DBF) | 1 | 64 | 180 | |||||||||||||||||||||||||
Range Font Range |
4DC0-FAFF
Range Font Range |
||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Yijing Hexagram Symbols (70: 4DC0–4DFF) | 64 | 6 | 6 | |||||||||||||||||||||||||
CJK Unified Ideographs CJK Unified Ideographs The Chinese, Japanese and Korean scripts share a common background. In the process called Han unification the common characters were identified, and named "CJK Unified Ideographs"... (Han Unification Han unification Han unification is an effort by the authors of Unicode and the Universal Character Set to map multiple character sets of the so-called CJK languages into a single set of unified characters. Han characters are a common feature of written Chinese , Japanese , Korean , and—at least historically—other... ) (20940: 4E00–9FFF) |
79 | |||||||||||||||||||||||||||
Yi Syllables Yi script The Yi script, also historically known as Cuan Wen or Wei Shu , is used to write the Yi languages.-Classical Yi:Classical Yi is a syllabic logographic system that was reputedly devised during the Tang dynasty by someone called Aki... (1024: A000–A3FF) |
||||||||||||||||||||||||||||
Yi Radicals (32: A490–A4AF) | 0 | |||||||||||||||||||||||||||
Lisu Lisu The Lisu people are a Tibeto-Burman ethnic group who inhabit the mountainous regions of Burma , Southwest China, Thailand, and the Indian state of Arunachal Pradesh.... (48: A4D0–A4FF) |
||||||||||||||||||||||||||||
Vai Vai Vai has several possible meanings:* Vaï, Moroccan-French Canadian rapper* Vai people** Vai language** Vai syllabary* Vai * Văi, a village in Lupşa Commune, Alba County, Romania* Steve Vai, guitarist* Steve Vai's band Vai - see Steve Vai... (300: A500–A63F) |
||||||||||||||||||||||||||||
Cyrillic Extended-B (78: A640–A69F) | 31 | |||||||||||||||||||||||||||
Modifier Tone Letters (32: A700–A71F) | 9 | 1 | 20 | 1 | 1 | 1 | 9 | 4 | 9 | 9 | ||||||||||||||||||
Latin Extended D (114: A720-A7FF) | 7 | 28 | 57 | 28 | 81 | 28 | 97 | 3 | 7 | 2 | 7 | 7 | ||||||||||||||||
Syloti Nagri (44: A800–A82F) | ||||||||||||||||||||||||||||
Phags-pa (56: A840–A87F) | ||||||||||||||||||||||||||||
Saurashtra (81: A880–A8DF) | ||||||||||||||||||||||||||||
Kayah Li (48: A900–A92F) | ||||||||||||||||||||||||||||
Rejang Rejang script The Rejang script, sometimes spelt Redjang and locally known as Surat Ulu , is an abugida of the Brahmic family, and is related to other scripts of the region, like Batak, Buginese, and others. Rejang is a member of the closely related group of Surat Ulu scripts that include the script variants of... (37: A930–A95F) |
||||||||||||||||||||||||||||
Cham (83: AA00–AA5F) | ||||||||||||||||||||||||||||
Mayanmar Extended (28: AA60–AA7B) | ||||||||||||||||||||||||||||
Tai Viet (72: AA80–AADF) | ||||||||||||||||||||||||||||
Meetei Mayek (56: ABC0–ABF9) | ||||||||||||||||||||||||||||
Hangul Syllables Hangul Hangul,Pronounced or ; Korean: 한글 Hangeul/Han'gŭl or 조선글 Chosŏn'gŭl/Joseongeul the Korean alphabet, is the native alphabet of the Korean language. It is a separate script from Hanja, the logographic Chinese characters which are also sometimes used to write Korean... (11172: AC00–D7AF) |
6 | |||||||||||||||||||||||||||
High Surrogates (D800–DB7F) | ||||||||||||||||||||||||||||
High Private Use Surrogates (DB80–DBFF) | ||||||||||||||||||||||||||||
Low Surrogates (DC00–DFFF) | ||||||||||||||||||||||||||||
Private Use Area (Private/Corporate Use) (6400: E000–F8FF) | 55 | 43 | 20 | 565 | 148 | 352 | 788 | 372 | 29 | 229 | 40 | 229 | 961 | 276 | 6 | 16 | 47 | 84 | 55 | |||||||||
CJK Compatibility Ideographs Ideograph Ideograph is a term coined by rhetorical scholar and critic Michael Calvin McGee describing the use of particular words and phrases as political language in a way that captures particular ideological positions... (470: F900–FAFF) |
302 | 302 | 437 | 302 | 16 | 466 | 1 | 455 | 363 | |||||||||||||||||||
Range Font Range |
FB00-FFFF
Range Font Range |
||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Alphabetic Presentation Forms (Latin Lig + Armenian Lig + Hebrew Lig) (58: FB00–FB4F) | 48 | 57 | 26 | 53 | 42 | 5 | 52 | 5 | 50 | 5 | 7 | 7 | 32 | 3 | 48 | 2 | 48 | 48 | 53 | 7 | ||||||||
Latin Ligatures (7: FB00–FB06) | 2 | 2 | 5 | 5 | 5 | 5 | 5 | 2 | 2 | 2 | 2 | 2 | ||||||||||||||||
Armenian Ligatures (5: FB13–FB17) | 1 | |||||||||||||||||||||||||||
Hebrew Ligatures / Pointed Letters (46: FB1D–FB4F) | 46 | 45 | 2 | 34 | 42 | 38 | 27 | |||||||||||||||||||||
Arabic Presentation Forms-A (595: FB50–FDFF) | 90 | 593 | 62 | 155 | 22 | 128 | 98 | 20 | 44 | 89 | 89 | 187 | 25 | |||||||||||||||
Variation Selectors (16: FE00–FE0F) | ||||||||||||||||||||||||||||
Vertical Forms (10: FE10–FE1F) | 2 | |||||||||||||||||||||||||||
Combining Half Marks (7: FE20–FE2F) | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 | ||||||||||||||||
CJK Compatibility Forms (32: FE30–FE4F) | 28 | 28 | 28 | 27 | ||||||||||||||||||||||||
Small Form Variants (26: FE50–FE6F) | 18 | 2 | 25 | 25 | ||||||||||||||||||||||||
Arabic Presentation Forms-B (140: FE70–FEFE) | 85 | 139 | 120 | 139 | 1 | 125 | 1 | 131 | 1 | 1 | 53 | 87 | 85 | 139 | ||||||||||||||
Byte Order Mark (1: FEFF) | ||||||||||||||||||||||||||||
Halfwidth and Fullwidth Forms Halfwidth and Fullwidth Forms In CJK computing, graphic characters are traditionally classed into fullwidth and halfwidth characters... (225: FF00–FFEF) |
223 | 166 | 171 | 186 | 152 | 209 | 100 | 224 | 157 | 101 | 172 | |||||||||||||||||
Latin Full Width Forms (94: FF01-FF5E) | 94 | |||||||||||||||||||||||||||
KataKana Half Width Forms (63: FF61-FF9F) | 63 | |||||||||||||||||||||||||||
Hangul Jamo Half Width Forms (52: FFA0-FFDC) | 15 | 51 | ||||||||||||||||||||||||||
Specials (5: FFF0–FFFF) | 1 | 2 | 1 | 1 | 3 | 1 | 1 | 1 | 1 | 2 | 1 | 2 | 2 | 1 | ||||||||||||||
Range Font |
||||||||||||||||||||||||||||
BMP (Plane 0) |
List of SMP Unicode fonts
Of the many Unicode fonts rich with vast amount of SMP plane characters, few ones listed below are the most commonly used by a majority of users around the world on mainstream computing platforms. You must also see the first top font list, as some of them also contains vast amount of SMP plane characters. More Unicode fonts can be found in the (List of typefaces) article's "Unicode fonts" section.Font | Char(s) | Glyphs | Kernpairs (Standard) |
Range(Point): Hinting/ Smoothing Behavior |
Version & Date | Filename, Size |
Font Family | Font Weight, style |
Font type |
Serif style |
License | Creator / Author (Copyright) | Notes |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Code2001 | 2,985 | 3,135 | 0 | v0.919 sfnt rev 0.918945 2008-04-04 |
CODE2001.TTF (485 KB) |
Code2001 | Medium (Normal), Book, Regular | TTF | Any | Freeware | James Kass | ||
New Athena Unicode | 2,985 | 2,033 | 0 | v3.710 sfnt rev 3.70999 2010-05-02 |
newathu.ttf (769 KB) |
New Athena Unicode | Medium (Normal), Book, Regular | TTF | Any | Freeware. SIL OFL. | American Philological Association http://www.apaclassics.org/. | ||
MPH 2B Damase | 2,743 | 2,895 | 192 | v002.010 sfnt rev 1 2005-10-18 |
damase_v.2.ttf (879 KB) |
MPH 2B Damase | Medium (Normal), Book, Regular | TTF | Any | Freeware. | Public Domain 2005. Mark Williamson. |
10000-1D7FF
SMP (Plane 1) | |||||||||||||||||||||||||||||||
Font Range |
|||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Total Glyphs in SMP (5258: 10000–1FFFF) | 222 | 2 | 304 | 2 | 745 | 2 | 649 | 203 | 994 | 87 | 106 | 210 | |||||||||||||||||||
Linear B Syllabary Linear B Linear B is a syllabic script that was used for writing Mycenaean Greek, an early form of Greek. It pre-dated the Greek alphabet by several centuries and seems to have died out with the fall of Mycenaean civilization... (88: 10000–1007F) |
74 | ||||||||||||||||||||||||||||||
Linear B Ideograms (123: 10080–100FF) | 73 | ||||||||||||||||||||||||||||||
Aegean Numbers (64: 10100–1013F) | 57 | ||||||||||||||||||||||||||||||
Ancient Greek Numbers (75: 10140–1018F) | 5 | ||||||||||||||||||||||||||||||
Ancient Symbols (12: 10190–101CF) | |||||||||||||||||||||||||||||||
Phaistos Disc Phaistos Disc The Phaistos Disc is a disk of fired clay from the Minoan palace of Phaistos on the Greek island of Crete, possibly dating to the middle or late Minoan Bronze Age . It is about 15 cm in diameter and covered on both sides with a spiral of stamped symbols... (46: 101D0–101FF) |
|||||||||||||||||||||||||||||||
Lycian (29: 10280–1029F) | |||||||||||||||||||||||||||||||
Alphabetic and Syllabic LTR scripts (311: 10300–107FF) | 62 | 27 | 307 | 35 | |||||||||||||||||||||||||||
Old Italic Old Italic alphabet Old Italic refers to several now extinct alphabet systems used on the Italian Peninsula in ancient times for various Indo-European languages and non-Indo-European languages... (35: 10300–1032F) |
|||||||||||||||||||||||||||||||
Gothic Gothic language Gothic is an extinct Germanic language that was spoken by the Goths. It is known primarily from the Codex Argenteus, a 6th-century copy of a 4th-century Bible translation, and is the only East Germanic language with a sizable Text corpus... (27: 10330–1034F) |
|||||||||||||||||||||||||||||||
Ugaritic (31: 10380–1039F) | |||||||||||||||||||||||||||||||
Old Persian Old Persian language The Old Persian language is one of the two directly attested Old Iranian languages . Old Persian appears primarily in the inscriptions, clay tablets, and seals of the Achaemenid era... (50: 103A0–103DF) |
|||||||||||||||||||||||||||||||
Deseret Deseret alphabet The Deseret alphabet is a phonemic English spelling reform developed in the mid-19th century by the board of regents of the University of Deseret under the direction of Brigham Young, second president of The Church of Jesus Christ of Latter-day Saints.In public statements, Young claimed the... (80: 10400–1044F) |
|||||||||||||||||||||||||||||||
Shavian (48: 10450–1047F) | |||||||||||||||||||||||||||||||
Osmanya (40: 10480–104AF) | |||||||||||||||||||||||||||||||
Cypriot syllabary Cypriot syllabary The Cypriot syllabary is a syllabic script used in Iron Age Cyprus, from ca. the 11th to the 4th centuries BCE, when it was replaced by the Greek alphabet. A pioneer of that change was king Evagoras of Salamis... (55: 10800–1083F) |
|||||||||||||||||||||||||||||||
Phoenician Phoenician alphabet The Phoenician alphabet, called by convention the Proto-Canaanite alphabet for inscriptions older than around 1050 BC, was a non-pictographic consonantal alphabet, or abjad. It was used for the writing of Phoenician, a Northern Semitic language, used by the civilization of Phoenicia... (29: 10900-1091F) |
27 | 27 | |||||||||||||||||||||||||||||
Kharosthi (65: 10A00–10A5F) | |||||||||||||||||||||||||||||||
Cuneiform Cuneiform script Cuneiform script )) is one of the earliest known forms of written expression. Emerging in Sumer around the 30th century BC, with predecessors reaching into the late 4th millennium , cuneiform writing began as a system of pictographs... (879: 12000-123FF) |
6 | ||||||||||||||||||||||||||||||
Sumero-Akkadian Cuneiform (879: 12000-123FF) | 6 | ||||||||||||||||||||||||||||||
Byzantine Musical Symbols (246: 1D000–1D0FF) | 33 | ||||||||||||||||||||||||||||||
Musical Symbols (220: 1D100–1D1FF) | 44 | ||||||||||||||||||||||||||||||
Ancient Greek Musical Notation (70: 1D200–1D24F) | |||||||||||||||||||||||||||||||
Tai Xuan Jing Symbols (87: 1D300–1D35F) | |||||||||||||||||||||||||||||||
Chinese Counting Rod Numerals (18: 1D360–1D371) | |||||||||||||||||||||||||||||||
Mathematical Alphanumeric Symbols Mathematical alphanumeric symbols Mathematical Alphanumeric Symbols is a Unicode block of Latin and Greek letters and decimal digits that enable mathematicians to denote different notions with different letter styles .Unicode now includes many such symbols Mathematical Alphanumeric Symbols is a Unicode block of Latin and Greek... (994: 1D400–1D7FF) |
13 | 2 | 2 | 2 | |||||||||||||||||||||||||||
Range Font |
|||||||||||||||||||||||||||||||
SMP (Plane 1) |
List of SIP Unicode fonts
Of the many Unicode fonts rich with vast amount of SIP plane characters, few ones listed below are the most commonly used by a majority of users around the world on mainstream computing platforms. You must also see the first top font list (BMP) and SMP fonts, as some of them also contains vast amount of SIP plane characters. More Unicode fonts can be found in the (List of typefaces) article's "Unicode fonts" section.Font | Char(s) | Glyphs | Kernpairs (Standard) |
Range(Point): Hinting/ Smoothing Behavior |
Version & Date | Filename, Size |
Font Family | Font Weight, style |
Font type |
Serif style |
License | Creator / Author (Copyright) | Notes |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Code2002 | 20,419 | 30,469 | 0 | v0.917 sfnt rev 0.916992 2005-04-04 |
CODE2002.TTF (4.09 MB) |
Code2002 | Medium (Normal), Book, Regular | TTF | Normal Sans | Freeware | James Kass |
20000-2FFFF
SIP (Plane 2) | |||||||||||||||||||||||||||||
Font Range |
|||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Total Glyphs in SIP (47624: 20000–2FFFF) | 12 | 543 | 387 | ||||||||||||||||||||||||||
CJK Unified Ideographs Extension B (42711: 20000-2A6D6) | 12 | 1 | 342 | ||||||||||||||||||||||||||
CJK Unified Ideographs Extension C (4149: 2A700-2B734) | |||||||||||||||||||||||||||||
CJK Unified Ideographs Extension D (222: 2B740-2B81D) | |||||||||||||||||||||||||||||
CJK Compatibility Ideographs Supplement (542: 2F800-2FA1F) | 44 | ||||||||||||||||||||||||||||
SIP (Plane 2) |
E0000-EFFFF
SSP (Plane 14; 0Ehexadecimal) | ||||||||||||||||||||||||||||
Font Range |
||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Total Glyphs in SSP (337: E0000–EFFFF) | 12 | 16 | ||||||||||||||||||||||||||
Tag Characters (97: E0000–E007F) | 12 | |||||||||||||||||||||||||||
SSP (Plane 14; 0Ehexadecimal) |
F0000-FFFFF
PUA-A (Plane 15; 0Fhexadecimal) | ||||||||||||||||||||||||||||
Font Range |
||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Total Glyphs in PUA-A (65534: F0000–FFFFF) | 462 | 292 | 73 | |||||||||||||||||||||||||
PUA-A (Plane 15; 0Fhexadecimal) |
100000-10FFFF
PUA-B (Plane 16; 10hexadecimal) | ||||||||||||||||||||||||||||
Font Range |
||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Total Glyphs in PUA-B (65534: 100000–10FFFF) | ||||||||||||||||||||||||||||
PUA-B (Plane 16; 10hexadecimal) | ||||||||||||||||||||||||||||
Unicode Fonts |
See also
- Free software Unicode typefacesFree software Unicode typefacesA few projects exist to provide free and open-source Unicode typefaces, i.e. Unicode typefaces which are open-source and designed to contain glyphs of all Unicode characters. However there are also numerous projects aimed at providing only a certain script, such as the Arabeyes Arabic font...
- Unicode fallback font
- Comparison of Unicode encodingsComparison of Unicode encodingsThis article compares Unicode encodings. Two situations are considered: 8-bit-clean environments and environments that forbid use of byte values that have the high bit set. Originally such prohibitions were to allow for links that used only seven data bits, but they remain in the standards and so...
- List of typefaces
- Typographic unitTypographic unitTypographic units are the units of measurement used in typography or typesetting. The traditional units are different from common metric units, as they were established earlier. Even though these units are all very small, across a line of print they add up quickly...
- FoundriesType foundryA type foundry is a company that designs or distributes typefaces. Originally, type foundries manufactured and sold metal and wood typefaces and matrices for line-casting machines like the Linotype and Monotype machines designed to be printed on letterpress printers...
- HTMLHTMLHyperText Markup Language is the predominant markup language for web pages. HTML elements are the basic building-blocks of webpages....
- CalligraphyCalligraphyCalligraphy is a type of visual art. It is often called the art of fancy lettering . A contemporary definition of calligraphic practice is "the art of giving form to signs in an expressive, harmonious and skillful manner"...
- Font-management programFont-management programFont management software is utility software that computer users use to browse and preview fonts and typically to install and uninstall fonts. In addition, depending on the individual font management program's capabilities, users can also...
- List of CJK fonts
- Unicode ConsortiumUnicode ConsortiumThe Unicode Consortium is a non-profit organization that coordinates the development of the Unicode standard. Its stated goal is to eventually replace existing character encoding schemes with Unicode and its standard Unicode Transformation Format schemes, claiming that many of the existing...
- Alt codesAlt codesOn personal computers running the Microsoft Windows or DOS operating systems, additional characters to those available in the current keyboard layout can be typed using an Alt code: pressing and holding the Alt key while entering a character code with the keyboard's numeric keypad...
- Input method editorInput method editorAn input method is an operating system component or program that allows any data, such as keyboard strokes or mouse movements, to be received as input. In this way users can enter characters and symbols not found on their input devices...
(IME) - Chinese input methods for computersChinese input methods for computersHundreds of Chinese input methods are available for entry of Chinese characters into computers, but most keyboard-based methods rely on either pinyin phonetic readings or root shapes in Chinese characters...
- Korean language and computersKorean language and computersThis article addresses how computers are used to read and write Korean, using Hangul.-Character encodings:In RFC 1557, a method known as ISO-2022-KR for a 7-bit encoding of Korean characters in email was described. Where 8 bits are allowed, the EUC-KR encoding is preferred. These two...
- Japanese input methodsJapanese input methodsJapanese input methods are the methods used to input Japanese characters on a computer.There are two main methods of inputting Japanese on computers. One is via a romanized version of Japanese called rōmaji , and the other is via keyboard keys corresponding to the Japanese kana...
- List of input methods for UNIX platforms
- Enabling complex text support for Indic scripts
- Keyboard layoutKeyboard layoutA keyboard layout is any specific mechanical, visual, or functional arrangement of the keys, legends, or key–meaning associations of a computer, typewriter, or other typographic keyboard....
- Code pageCode pageCode page is another term for character encoding. It consists of a table of values that describes the character set for a particular language. The term code page originated from IBM's EBCDIC-based mainframe systems, but many vendors use this term including Microsoft, SAP, and Oracle Corporation...
External links
- ISO/IEC JTC1/SC2/WG2, the working group in charge of ISO 10646
- Fonts and Keyboards at Unicode.org
- Unicode Font Guide For Free/Libre Open Source Operating Systems - A huge index of high quality free fonts.
- Alan Wood's Unicode Resources - Unicode fonts for Windows computers - Index of free and commercial Unicode fonts.
- Character sets - Ken Fowles, Microsoft, 1997. - Enable Unicode for applications.
- Microsoft Typography - Fonts and Products - Reference for determining which fonts are supplied with Microsoft products.