当前位置:主页>翻译技术>本地化技术一览
本地化技术一览
来源:作者:本站
=Unicode=

As a universal character set that includes all characters of the world, Unicode assigns code points to its characters by 16-bit integers, which means that up to 65,536 characters can be encoded. However, due to the huge set of CJK characters, this has become insufficient, and Unicode 3.0 has extended the index to 21 bits, which will support up to 1,114,112 characters.

Unicode 是一个包括了世界上所有字符的字符集,用16位整数来编码字符指针,也就是可以编码最多65,536个字符。但是,由于 CJK 字符集的庞大规模,连这个容量也不够使用,因此 Unicode 3.0 把索引字长扩展到21位,支持多达1,114,112个字符。

Planes
=平面=

Unicode code point is a numeric value between 0 and 10FFFF, divided into planes of 64K characters. In Unicode 4.0, allocated planes are Plane 0, 1, 2 and 14.

Unicode 编码指针是一个在0和10FFFF之间的数值,分成64K个字符组成的平面。在 Unicode 4.0 里,分配的平面是平面0,1,2和14。

Plane 0, ranging from 0000 to FFFF, is called Basic Multilingual Plane (BMP), which is the set of characters assigned by the previous 16-bit scheme.

平面0,从0000到FFFF,叫做基本多语言平面(Basic Multilingual Plane, BMP),由过去的16位编码系统下的字符集组成。

Plane 1, ranging from 10000 to 1FFFF and called Supplementary Multilingual Plane (SMP), is dedicated to lesser used historic scripts, special-purpose invented scripts and special notations. These include Gothic, Shavian and musical symbols. Many more historic scripts may be encoded in this plane in the future.

平面1,从10000到1FFFF,叫做辅助多语言平面(Supplementary Multilingual Plane, SMP),用于较少使用的古文字,特殊用途的文字和特殊符号。这些文字包括哥特文字,Shavian 文字和乐谱符号。今后可能会有更多的古文字被编码到这个平面中。

Plane 2, ranging from 20000 to 2FFFF and called Supplementary Ideographic Plane (SIP), is the spillover allocation area for those CJK characters that cannot fit into the blocks for common CJK characters in the BMP. Plane 14, ranging from E0000 to EFFFF and called Supplementary Special-purpose Plane (SSP), is for some control characters that do not fit into the small areas allocated in the BMP.

平面2,从20000到2FFFF,称为辅助表意文字平面(Supplementary Ideographic Plane, SIP),用于容纳 BMP 中一般 CJK 字符容纳不下的字符的区域。平面14,从E0000到EFFFF,称为辅助特殊用途平面(Supplementary Special-purpose Plane, SSP),是为 BMP 中有限的小区域无法容纳的控制字符准备的。

There are two more reserved planes Plane 15 and Plane 16, for private use, where no code point is assigned.

还有两个保留平面,平面15和平面16,用于个别用途,没有分配编码指针。
上一页12 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 下一页
上一篇:搜索技巧
下一篇:本地化关键概念