ASCII Character Conversion

Phanidhar · Apr 10, 2008

Hi,
I'm developing a Winform application in C#( .net 2.0). I've a dialog box
where user can input text and that text would be sent across to other machine
using sockets.

When the user enters ASCII character which are non-printable like ASCII 20
( using ALT+20), this character is converted to ASCII value 194( or something
like that). What should be done to preserve the original ASCII value of 20?

Thanking you in advance.
Phani

Jon Skeet [C# MVP] · Apr 10, 2008

I'm developing a Winform application in C#( .net 2.0). I've a dialog box
where user can input text and that text would be sent across to other machine
using sockets.

When the user enters ASCII character which are non-printable like ASCII20
( using ALT+20), this character is converted to ASCII value 194( or something
like that). What should be done to preserve the original ASCII value of 20?

I think your use of the term "ASCII" is pretty loose here. What actual
character are you talking about? ASCII 20 (well, pseudo-ASCII - I
believe true, pedantic, standard ASCII only starts at 32) is a control
code. All .NET characters are stored as Unicode - what's the Unicode
code point for the character you're trying to represent?

If, fundamentally, you're trying to send non-text data you shouldn't
try to pretend it's text. Separate out the text data from the binary
data and be very careful about how you use each of them.

Jon

Jeroen Mostert · Apr 10, 2008

Jon said:
ASCII 20 (well, pseudo-ASCII - I believe true, pedantic, standard ASCII
only starts at 32)

No, the control codes are part of the standard; 20 is DLE (Data Link
Escape), though good luck quizzing people on what that was used for. The
*printable* ASCII characters are in the range 32-126, 127 is the control
code DEL, and the remainder is, popular misconceptions to the contrary, not
part of ASCII (and there's no single "extended ASCII" character set, let
alone a standard).

Jeroen Mostert · Apr 10, 2008

Jeroen said:
No, the control codes are part of the standard; 20 is DLE (Data Link
Escape), though good luck quizzing people on what that was used for.

Hum. 20 is DLE in octal. In decimal, 16 is DLE; 20 is Device Control 4,
which is obviously *quite* different. I mean, imagine sending DLE when the
other side expects DC4, or vice versa! It doesn't take a scientist to
realize the potential for disaster, or something.

Anyway, moving swiftly on...

Jon Skeet [C# MVP] · Apr 10, 2008

No, the control codes are part of the standard; 20 is DLE (Data Link
Escape), though good luck quizzing people on what that was used for. The
*printable* ASCII characters are in the range 32-126, 127 is the control
code DEL

For some reason I had the impression that the "full" ISO standard for
ASCII didn't include either 0-31 or 127 itself. However, I can't
remember any source for that, and certainly it's not the commonly used
idea of ASCII.

and the remainder is, popular misconceptions to the contrary, not
part of ASCII (and there's no single "extended ASCII" character set, let
alone a standard).

Heartily agreed

Jon

Alain Boss · Apr 14, 2008

Jon said:
For some reason I had the impression that the "full" ISO standard for
ASCII didn't include either 0-31 or 127 itself. However, I can't
remember any source for that, and certainly it's not the commonly used
idea of ASCII.

0x20 = 32 decimal = 'space' in ASCII

regards
Alain

Jon Skeet [C# MVP] · Apr 14, 2008

0x20 = 32 decimal = 'space' in ASCII

True, but I don't see the relevance. I don't believe the OP was
talking about space, for example.

Jon

Arne Vajhøj · Apr 15, 2008

Jon said:
For some reason I had the impression that the "full" ISO standard for
ASCII didn't include either 0-31 or 127 itself. However, I can't
remember any source for that, and certainly it's not the commonly used
idea of ASCII.

If http://en.wikipedia.org/wiki/ASCII is correct then the
non printable are part of ASCII.

And lots of non printable characters are or were widely used
in both files and communication.

CR and LF are still used.

XON, XOFF were used a lot in terminals (the real ones, not
terminal emulators with a buffer of 2000 lines).

Arne

Jon Skeet [C# MVP] · Apr 15, 2008

Ifhttp://en.wikipedia.org/wiki/ASCIIis correct then the
non printable are part of ASCII.

And lots of non printable characters are or were widely used
in both files and communication.

CR and LF are still used.

Oh absolutely - that's why I was surprised when I was first told that
they weren't part of "official" ASCII.

I just wish I could remember where I heard it from. I suspect it was
in a discussion which included a debate about whether ISO-8859-1 has a
"hole" between 128 and 159, or whether it includes other control
characters. (I argued from the Unicode documentation which states - or
at least stated - that the first 256 characters of Unicode were the
same as in ISO-8859-1; others argued from other sources.)

I'm happy to just assume I'm wrong on this one though - certainly
everyone realistically includes 0-31 as part of ASCII.

Jon

Arne Vajhøj · Apr 16, 2008

I just wish I could remember where I heard it from. I suspect it was
in a discussion which included a debate about whether ISO-8859-1 has a
"hole" between 128 and 159, or whether it includes other control
characters. (I argued from the Unicode documentation which states - or
at least stated - that the first 256 characters of Unicode were the
same as in ISO-8859-1; others argued from other sources.)

Unfortunately ISO standards are not freely (beer not speach)
available.

They are in Unicode Basic Latin
http://www.unicode.org/charts/PDF/U0000.pdf and
Latin1 http://www.unicode.org/charts/PDF/U0080.pdf !

I know that they are defined in DECMCS the Predecessor
of ISO-8859-1.

Considering that they are in DECMCS and in Unicode
under the Latin1 name (which is a known synonym
for ISO-8859-1), then there are very strong
indications that they are in ISO-8859-1.

Arne

Non-ascii characters in VS.NET service	10	Feb 9, 2007
Converting a single ASCII character to an int	6	Aug 31, 2006
Reading an Ascii string	18	Jul 8, 2006
Removing non-ascii characters from a string	13	Aug 29, 2008
Unicode character conversation	4	Mar 21, 2006
Replacing Printable Ascii Codes	1	Feb 2, 2010
Convert Ascii Character to decimal	3	Mar 4, 2005
byte[] to enter key	8	Dec 21, 2006

ASCII Character Conversion

Phanidhar

Jon Skeet [C# MVP]

Jeroen Mostert

Jeroen Mostert

Jon Skeet [C# MVP]

Alain Boss

Jon Skeet [C# MVP]

Arne Vajhøj

Jon Skeet [C# MVP]

Arne Vajhøj

Ask a Question

Similar Threads