Extended ASCII Encoding in .NET

Guest · Dec 1, 2005

Does anyone know how to decode extended ASCII string into extended ASCII bytes?

For example, "Ã¤" is 228 in the extended ASCII character set.

ASCIIEncoding supports 7-bit ASCII, thus every character in the extended set
is decoded as "?".

UnicodeEncoding, UTF7Encoding, UTF8Encoding, UTF32Encoding does not provide
correct results. I was thinking to create a new encoder class, but before
that I would like to know if there is some class in .NET which can do the
encoding.

Gerrit H · Dec 1, 2005

Reading specific encoding is supported in .NET. Try code simular to the
following:

_inputReader = new StreamReader(inputStream,
System.Text.Encoding.ASCII.WindowsCodePage);
_line = _inputReader.ReadLine();

Replace the System.Text.Encoding.ASCII.WindowsCodePage with your requested
encoding type and the StreamReader will do the rest.

G

Jon Skeet [C# MVP] · Dec 1, 2005

JoeUser said:
Does anyone know how to decode extended ASCII string into extended ASCII bytes?

There is no one "extended ASCII" encoding.

For example, "?" is 228 in the extended ASCII character set.

In *which* "extended ASCII" character set? There are lots of code pages
which might all be called "extended ASCII".

You need to find out which code page you really mean, then ask for the
encoding for that code page.

See http://www.pobox.com/~skeet/csharp/unicode.html for more
information.

Guest · Dec 2, 2005

Hello Jon,

Thank you for your comments and help. Problem still exists, but I found a
detour.

By Extended ASCII set I ment characters with ASCII codes 128 ... 255, this
set is also called as the "IBM character set" or 8-bit ASCII. In the
following, I refer to this set of characters.

Closest codepage in my application is 28591.

Gerrit's reply gave me idea to try using encoding class as follows:

Encoding enc = Encoding.GetEncoding(28591);
byte[] encodedBytes = enc.GetBytes(myString);

However, this does not produce extended ASCII character set. For example
encodedBytes = enc.GetBytes("Ã¤");
// encodedBytes[0] = 228, encodedBytes.Length = 1
// "Ã¤" is character number 132 in the Extended ASCII set

However, it is not absolutely necessary to have ASCII conversion as long as
the conversion is unique, that is each character in the Extended set gets
unique value between 128 ... 255. Having conversion to 8-bit ASCII would have
been the best option, but this other option seems to provide a workable
solution.

Stefan Simek · Dec 2, 2005

Hi,

The following encodings seem to fulfill your 'Ã¤' = 132 request

437 - IBM437
775 - ibm775
850 - ibm850
852 - ibm852
857 - ibm857
858 - IBM00858
861 - ibm861
865 - IBM865
29001 - x-Europa

But I expect you require on of the IBM 850/852 encodings, which are/were
used widely. But I've never heard of any of them to be refered to as
"Extended ASCII"

HTH,
Stefan

Jon Skeet said:
Hello Jon,

Thank you for your comments and help. Problem still exists, but I found a
detour.

By Extended ASCII set I ment characters with ASCII codes 128 ... 255, this
set is also called as the "IBM character set" or 8-bit ASCII. In the
following, I refer to this set of characters.

Closest codepage in my application is 28591.

Gerrit's reply gave me idea to try using encoding class as follows:

Encoding enc = Encoding.GetEncoding(28591);
byte[] encodedBytes = enc.GetBytes(myString);

However, this does not produce extended ASCII character set. For example
encodedBytes = enc.GetBytes("Ã¤");
// encodedBytes[0] = 228, encodedBytes.Length = 1
// "Ã¤" is character number 132 in the Extended ASCII set

However, it is not absolutely necessary to have ASCII conversion as long as
the conversion is unique, that is each character in the Extended set gets
unique value between 128 ... 255. Having conversion to 8-bit ASCII would have
been the best option, but this other option seems to provide a workable
solution.

Jon Skeet said:

There is no one "extended ASCII" encoding.

In *which* "extended ASCII" character set? There are lots of code pages
which might all be called "extended ASCII".

You need to find out which code page you really mean, then ask for the
encoding for that code page.

See http://www.pobox.com/~skeet/csharp/unicode.html for more
information.

Click to expand...

Jon Skeet [C# MVP] · Dec 2, 2005

JoeUser said:
Thank you for your comments and help. Problem still exists, but I found a
detour.

By Extended ASCII set I ment characters with ASCII codes 128 ... 255, this
set is also called as the "IBM character set" or 8-bit ASCII.

That doesn't describe a single set of characters. What unicode
character do you want byte 128 to mean, for instance? What unicode
character do you want byte 129 to mean?

In the following, I refer to this set of characters.

Closest codepage in my application is 28591.

Gerrit's reply gave me idea to try using encoding class as follows:

Encoding enc = Encoding.GetEncoding(28591);
byte[] encodedBytes = enc.GetBytes(myString);

However, this does not produce extended ASCII character set. For example
encodedBytes = enc.GetBytes("?");
// encodedBytes[0] = 228, encodedBytes.Length = 1
// "?" is character number 132 in the Extended ASCII set

What actual character is it?

However, it is not absolutely necessary to have ASCII conversion as long as
the conversion is unique, that is each character in the Extended set gets
unique value between 128 ... 255. Having conversion to 8-bit ASCII would have
been the best option, but this other option seems to provide a workable
solution.

If you could tell us which Unicode character you expect to get from
each byte, we could probably work out which encoding you actually mean.
Did you read the link I referenced, by the way?

Problem With Extended ASCII processing.	4	Oct 9, 2007
streamreader ignores Ascii 144	4	Jul 11, 2007
Convert Ascii Character to decimal	3	Mar 4, 2005
ResGen.exe: how to use extended ascii set in text resource files?	1	Apr 27, 2005
Encodings and MD5	3	Sep 8, 2005
Encoding/Decoding chinese characters	3	Nov 20, 2003
Hex encoding to Base64 encoding	2	Nov 8, 2005
Non-ascii characters in VS.NET service	10	Feb 9, 2007

Extended ASCII Encoding in .NET

Guest

Gerrit H

Jon Skeet [C# MVP]

Guest

Stefan Simek

Jon Skeet [C# MVP]

Ask a Question

Similar Threads