What is UTF-8?

  • Thread starter Thread starter Just Me
  • Start date Start date
J

Just Me

I can save a file as ANSI (I think I know what that is) or Unicode (I think
I know what that is) or UTF-8, (I've no idea what that is).
Can someone give me a brief into to UTF-8?

I get the impression I should be using it for clipboard but maybe not for
files.


Thanks



For example:

ITextDocument::Save Method
....snip...
CodePage
[in] The specified code page. Common values are CP_ACP (zero: system ANSI
code page), 1200 (Unicode), and 1208 (Unicode transformation format
(UTF)-8).
 
Just Me said:
I can save a file as ANSI (I think I know what that is) or Unicode (I think
I know what that is) or UTF-8, (I've no idea what that is).
Can someone give me a brief into to UTF-8?

UTF-8 is a text encoding:

UTF-8 and Unicode Standards
<URL:http://www.utf-8.com/>

UTF-8 is often used for encoding XML/XHTML files, for example, but you can
encode other text formats in UTF-8 too if the libraries used to read the
file can deal with this encoding.
 
Great article. Makes me wonder if when running the English version of
Windows the system admits to surrogate space which I believe would always be
non existant and slow searches,

Thanks
 
Just Me,

UTF is a format on an external device.

In memory it is for characters (strings) forever Unicode.

I hope this helps?

Cor
 
If I understand, some characters take more that 16 bits in UTF. How do they
look in memory? More than 2 bytes??

Thanks, I think the doc will make more sense now. I kept reading Unicode and
wondered which UTF they meant - now I see they meant Unicode because it is
in memory.
 

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments. After that, you can post your question and our members will help you out.

Ask a Question

Back
Top