What is UTF-8?

J

Just Me

I can save a file as ANSI (I think I know what that is) or Unicode (I think
I know what that is) or UTF-8, (I've no idea what that is).
Can someone give me a brief into to UTF-8?

I get the impression I should be using it for clipboard but maybe not for
files.


Thanks



For example:

ITextDocument::Save Method
....snip...
CodePage
[in] The specified code page. Common values are CP_ACP (zero: system ANSI
code page), 1200 (Unicode), and 1208 (Unicode transformation format
(UTF)-8).
 
H

Herfried K. Wagner [MVP]

Just Me said:
I can save a file as ANSI (I think I know what that is) or Unicode (I think
I know what that is) or UTF-8, (I've no idea what that is).
Can someone give me a brief into to UTF-8?

UTF-8 is a text encoding:

UTF-8 and Unicode Standards
<URL:http://www.utf-8.com/>

UTF-8 is often used for encoding XML/XHTML files, for example, but you can
encode other text formats in UTF-8 too if the libraries used to read the
file can deal with this encoding.
 
J

Just Me

Great article. Makes me wonder if when running the English version of
Windows the system admits to surrogate space which I believe would always be
non existant and slow searches,

Thanks
 
C

Cor Ligthert

Just Me,

UTF is a format on an external device.

In memory it is for characters (strings) forever Unicode.

I hope this helps?

Cor
 
J

Just Me

If I understand, some characters take more that 16 bits in UTF. How do they
look in memory? More than 2 bytes??

Thanks, I think the doc will make more sense now. I kept reading Unicode and
wondered which UTF they meant - now I see they meant Unicode because it is
in memory.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Top