UTF-16

Tony Johansson · Oct 9, 2010

Hello!

Here is some text and it says somewhere in the middle that "a
code point is encoded into a sequence of one or more 16-bit values".

I mean that if you use UTF-16 a code point is encoded into a sequence of a
16 bit value not as the text says
a sequesnce of one or more 16 bit values ?

The Unicode Standard identifies each Unicode character with a unique 21-bit
scalar number called a code point, and defines the UTF-16 encoding form that
specifies how a
code point is encoded into a sequence of one or more 16-bit values. Each
16-bit value ranges
from hexadecimal 0x0000 through 0xFFFF and is stored in a Char structure.
The value of a
Char object is its 16-bit numeric (ordinal) value.

//Tony

Arne Vajhøj · Oct 9, 2010

Here is some text and it says somewhere in the middle that "a
code point is encoded into a sequence of one or more 16-bit values".

I mean that if you use UTF-16 a code point is encoded into a sequence of a
16 bit value not as the text says
a sequesnce of one or more 16 bit values ?

The Unicode Standard identifies each Unicode character with a unique 21-bit
scalar number called a code point, and defines the UTF-16 encoding form that
specifies how a
code point is encoded into a sequence of one or more 16-bit values. Each
16-bit value ranges
from hexadecimal 0x0000 through 0xFFFF and is stored in a Char structure.
The value of a
Char object is its 16-bit numeric (ordinal) value.

When Unicode and UTF-16 were designed there were less than
65536 values. So each code point could always be in in
one 16 bit value.

When they passed the 65536, then they had to use
two 16 bit values for certain code point.

You may work as a software developer 40 years in Sweden
and never see a code point requiring two 16 bit values
in real life.

Arne

What happens if we have a chinese character in .NET that doesn't fit in a UTF-16	1	Feb 12, 2012
about encoding UTF-8 and UTF-16	6	Mar 31, 2010
This spanish character string "ñ" cause something that I don't understand	7	Mar 31, 2010
C# and encodings	30	Feb 3, 2009
Unicode in .NET	8	Apr 30, 2010
XML Encoding	2	Feb 1, 2008
data type representations in .NET	6	Apr 29, 2010
Convert UTF-16 Unicode to UTF-8 Unicode?	0	Apr 1, 2010

UTF-16

Tony Johansson

Arne Vajhøj

Ask a Question

Similar Threads