Several encodings are used for Unicode: the two most popular are UTF-8 and UTF-16.

1、

Compressed UTF-8 will likely be close in size to compressed UTF-16, regardless of the initial size difference.

压缩后，UTF-8和UTF-16的大小差不多，不论原始大小相差多少。

互联网摘选

2、

Characters in the ASCII range occupy only half the space in UTF-8 that they do in some other encodings of Unicode, particularly UTF-16.

与其他Unicode编码特别是UTF-16相比，在UTF-8中ASCII字符占用的空间只有一半。

互联网摘选

3、

This method completely ignores all the encoding information available, and the returned string is always encoded in UTF-16.

这个方法完全忽略所有可用的编码信息，所返回的字符串总是用UTF-16编码的。

互联网摘选

4、

The character-based functions may need to convert the input data string to an intermediate UNICODE code page, like UTF-16 or UTF-32, before its processing can be done.

基于字符的函数可能需要将输入数据字符串转换为一个中间的UNICODE代码页，比如UTF-16或UTF-32，然后才能对它进行处理。

互联网摘选

5、

UTF-8 is less likely than UTF-16 or other Unicode encodings to cause problems for systems that are unaware of Unicode and XML.

与UTF-16或其他Unicode编码相比，对于不支持Unicode和XML的系统，UTF-8更不容易造成问题。

互联网摘选

6、

UTF-8 was chosen as the default format for character data columns, with UTF-16 for graphic data columns.

UTF-8被选择为字符数据列的默认格式，其中UTF-16用于图形数据列。

互联网摘选

7、

The character is specified as one or two UTF-16 code units in hexadecimal notation.

这个字符指定为十六进制形式的一个或两个UTF-16编码单元。

互联网摘选

8、

The third subclause gives fixed point pseudo-code for the remaining modules of the coder. GB Support of XML Parser Using "Fake UTF-16" Coding Algorithm

第3小节给出用于编码器的保持系数的定点伪码。一个解决XML解析器对中文数据处理的伪UTF-16编码算法

互联网摘选

9、

There are others ( UTF-16 and UTF-32, for example) defined by the Unicode consortium, but UTF-8 is the best supported encoding for international character sets.

Unicode协会还规定了其他一些编码方式（如UTF-16，UTF-32），但UTF-8是国际字符集支持得最好的一种。

互联网摘选

10、

For instance, the Java ™ language's internal representation of strings is based on UTF-16, which makes indexing into the string much faster.

比如，Java™语言中字符串的内部表示采用UTF-16，因此对字符串的索引更快。

互联网摘选

11、

Thus the encoding attribute of the manually transcoded XML string is still "UTF-16" instead of "Big5".

因此这个以手工方式转码的XML字符串的编码属性仍然是“UTF-16”而不是“Big5”。

互联网摘选

12、

The DOMString type is explicitly specified to consist of wide UTF-16 characters.

DOMString类型被显式指定包括宽UTF-16字符。

互联网摘选

13、

Unicode defines character encodings in three distinct sizes UTF-8, UTF-16, and UTF-32 while the traditional character type is8 bits.

Unicode用三种不同的大小定义字符编码UTF-8、UTF-16和UTF-32而传统的字符类型是8位的。

互联网摘选

14、

Omit the XML declaration, and use the UTF-8 encoding, or use a UTF-16 Unicode Byte Order Mark ( BOM) at the beginning of your document.

在文档的开头部分，省略XML声明，并使用UTF-8编码，或者使用UTF-16Unicode字节顺序标记（Byte Order Mark，BOM）。

互联网摘选

15、

Google doesn't even allow alternate encodings of Unicode such as UTF-16, much less non-Unicode encodings like ISO-8859-1.

Google甚至不允许其他Unicode编码（如UTF-16），更不用说ISO-8859-1这样的非Unicode编码了。

互联网摘选

16、

The input file should be encoded in UTF-8 or UTF-16 format.

应该将输入文件编码为UTF-8或UTF-16格式。

互联网摘选

17、

But even when you're encoding CJK XML in UTF-8, the actual size gain compared to UTF-16 probably isn't so large.

但即使用UTF-8编码CJK XML，实际的大小可能也比UTF-16小。

互联网摘选

18、

IDENTITY_16BIT collator implements CESU-8 ( An8-bit compatibility encoding scheme for UTF-16).

IDENTITY16BIT排序器实现CESU-8（一种8位的兼容UTF-16的编码方案）。

互联网摘选

19、

In addition, there is another encoding scheme called UTF-16 that can also be used to represent supplementary characters.

另外，还有一种称为UTF-16的编码方案，它也可以用来表示补充字符。

互联网摘选

20、

UTF-16 is a variable-width character encoding, once surrogate pairs are taken into account.

如果考虑到替换对，UTF-16是一种变长字符编码。

互联网摘选