decompression problem

Frederik Vanderhaegen · Oct 1, 2006

Hey,

I've written a http sniffer that monitors the traffic on port 80.
This application works like it should be but there is one small problem.
Suppose I run the application and I surf to www.google.be then I retrieve
all the HTTP messages (Response and Request)
but the content of the message is gzip encoded (http compression).

Here you see the original message:

HTTP/1.1 200 OK
Cache-Control: private
Content-Type: text/html; charset=UTF-8
Content-Encoding: gzip
Server: GWS/2.1
Content-Length: 1763
Date: Sun, 01 Oct 2006 09:22:42 GMT

< ÿWërÛ¸þï§@èF×'(ÛqÖµDf'l.înÓNsíL[·<$±
-{=~zþècì<õ? %1-´òX"Às¿|8~¦äÑ¼ sFó%.1Õ-5»
½D
ÂOÍMiW¡gàÚ-qF',*
&üùÓÛñTÍ
3¢wRææ[Íµ¹±?OÆã½X¦7#"Zè¨MSÛEZ3Z2~sN£|¤©Ðc
Sew{f¿Âùñ´ºÆåm"¹TçûÓir·7GóÀ?ß>ëD±ÊàfU"Õ"1L
¢³¡>Ê¤.ÑöI6YN2\è¡?»Û%|1¬I6Jh:JÌ(IG:÷oY6\ó²'æ ýÛ+ª¬~Håj"')üüñâµ,+)'¼
;6Ï è"V0³üuèyÍfdY+ªhÙQýffeÄså[ò>"Þ!Ø?,SÓ?Áþä»`äy¾¿Ùº<ò'÷ôø.ç7$ó·´$k¤ànóõùÛJøÜØ3+raÝú'ÐPûMÃOÞa'ópÍ}^ÜÆ1ß.ZI©XãXøLæÙGúýïÿ¾þñmýþÃÉûàê|v§ÀÔJ£jÀÄÁÔîan»o®Dmý8oj
ÜÏs±õbAØáL,Ü"IÈ.[zÑ³~P·Ê²?PSY$újëfY.TåL"'Ä=ÀòÂ"'Ñ<ÁäÂÒ¦1K.,
§$ÎuE&òvUÑ4u«KMM§O'Í²¦Vw.Be.!WSVÑÜ7±ÅZ¢9¶z,,ôlêó XV"¼i¤I"Ë?&?¬.ÑÁOÍ{a>'?BKÞ§Z!<Ü<^RQOAÌSbâIí-µK'ÖÏgÖÈhÎÊ-ÅzÁÑ6_ÐÄ¢Iß÷~\ºpíOÃ.[ÄÒY:å¶È\YE5äX¢!§g)ßY9Êxa(ûkòZ¶äN¿÷?:Þôüz]^?ÄÄ"SÃb-ûX{X×Hß3
yuOûÉN7NZr¥túÛskÛçyfÿHfYáizW×Ö¢WÀsöÛoa_-NW±ÂæÌ¤* m
44P.DÐ.F*÷Ay©??°vk¨.Ñ¿¹®È`ð0®"CfYØfis?Ë{¤Ãâc
j°x7¬a,ÒáúOÙÐ[?zÿ[Ï¿mÅ.ø¶fQ<|Î$Kø
~÷Ïï<dT-(ë¸?-FÄ7÷jv×aÙÑ
ûvy¶®ÝpGïÁ<MOMÒ@ÄºsÝÿFdqé_¶`IÕ<#hP¸bÂákÎ'EèµÖcSLÁ´?CÉb@A«|³¢>r%ëJoÁS®ª¾þüQý?0'¶f?,
úÿlÀ|n[ VºoxÔ,ê.~To¼Að? ÃÁYÙj×·¶)*TË§è²-3?ÌµØe8oÖ.â0áøôikæ
t'Um^êÂ,¥x:¸f-8icÒÑ"ôsfÈÐé³3W|§§Z|ÙRãYÑÌy "_%,@x
6lëÒu\2Óqõ?ÈØ^w°\,H* oËÅ?çb'ä¹¶ÙSÉvG_M"¹?¦WT

When I copy the encoded content into notepad and save it as a gz file.
If I try to decompress it, I receive "CRC is invalid".
I've tried several things but nothing seems to work.

Has anybody how I can decompress the content of this message so I retrieve
the html code?

Thx in advance

Frederik

Chris Fulstow · Oct 1, 2006

You can use the GZipStream class to decompress the response stream:
http://msdn2.microsoft.com/en-us/library/system.io.compression.gzipstream.aspx

Or take a look at Fiddler:
http://www.fiddlertool.com/fiddler/

This is an HTTP sniffer with built-in support for gzip decompression.

Jon Slaughter · Oct 1, 2006

Frederik Vanderhaegen said:
Hey,

I've written a http sniffer that monitors the traffic on port 80.
This application works like it should be but there is one small problem.
Suppose I run the application and I surf to www.google.be then I retrieve
all the HTTP messages (Response and Request)
but the content of the message is gzip encoded (http compression).

Here you see the original message:

HTTP/1.1 200 OK
Cache-Control: private
Content-Type: text/html; charset=UTF-8
Content-Encoding: gzip
Server: GWS/2.1
Content-Length: 1763
Date: Sun, 01 Oct 2006 09:22:42 GMT

< ÿWërÛ¸þï§@èF×'(ÛqÖµDf'l.înÓNsíL[·<$±
-{=~zþècì<õ? %1-´òX"Às¿|8~¦äÑ¼ sFó %.1Õ-5»
½D
ÂOÍMiW¡gàÚ-qF',*
&üùÓÛñTÍ
3¢wRææ[Íµ¹±?OÆã½X¦7#"Zè¨MSÛ EZ3Z2~sN£|¤©Ðc
Sew{f¿Âùñ´ºÆåm"¹TçûÓir·7GóÀ?ß>ëD±ÊàfU"Õ"1L
¢³¡>Ê¤.ÑöI6YN2\è¡?»Û%|1¬I6Jh:JÌ(IG:÷oY6\ó²'æ
ýÛ+ª¬~Håj"')üüñâµ,+)'¼ ;6Ï
è"V0³üuèyÍfdY+ªhÙQýffeÄså[ò>"Þ!Ø?,SÓ?Áþä»`äy¾¿Ùº< ò'÷ôø.ç7$ó·´
$k¤ànóõùÛJøÜØ3+raÝú'Ð Pû
MÃOÞa'ópÍ}^ÜÆ1ß.ZI©XãXøLæÙGúýïÿ¾þñmýþÃÉûàê|v§ÀÔJ£jÀÄÁÔîan»o®Dmý8oj
ÜÏs±õbAØáL,Ü"IÈ.[zÑ³~P·Ê²?PSY$újëfY.TåL"'Ä=ÀòÂ"'Ñ<ÁäÂÒ¦1K.,
§$ÎuE&òvUÑ4u«KMM§O'Í²¦Vw.Be.!WSVÑÜ7±ÅZ¢9¶z,,ôlêó
XV"¼i¤I"Ë?&?¬.ÑÁOÍ{a>'?BKÞ§Z!<Ü<^RQOAÌSbâIí-µK'ÖÏgÖÈhÎÊ-ÅzÁÑ6_ÐÄ¢
Iß÷~\ºpíOÃ.[ÄÒY:å¶È\YE5äX¢!§g)ßY9Êxa(ûkòZ¶äN¿÷?:Þôüz]^?ÄÄ"SÃb-ûX{X×Hß3
yuOûÉN7NZr¥túÛskÛçyfÿHfYáizW×Ö¢WÀsöÛoa_-NW±ÂæÌ¤* m 4
4P.DÐ.F*÷Ay©??°vk¨.Ñ¿¹®È`ð0®"CfYØfis?Ë{¤Ãâc
j°x7¬a,ÒáúOÙÐ[?zÿ[Ï¿mÅ.ø¶fQ<|Î$Kø
~÷Ïï<dT-(ë¸?-FÄ7÷jv×aÙÑ ûvy¶®ÝpGïÁ
<MOMÒ@ÄºsÝÿFdqé_¶`IÕ<#hP¸bÂákÎ'EèµÖcSLÁ´?CÉb@A«|³¢>r%ëJoÁS®ª¾þüQý?0'¶f?,
úÿlÀ|n[ VºoxÔ, ê.~To¼Að? ÃÁYÙj×·¶)*TË§è²-3?ÌµØe8oÖ.â0áøôikæ
t'Um^êÂ,¥x:¸f-8icÒÑ"ôsfÈÐé³3W|§§Z|ÙRãYÑÌy "_%,@x
6lëÒu\2Óqõ?ÈØ^w°\,H* oËÅ?çb'ä¹¶ÙSÉvG_M"¹?¦WT

When I copy the encoded content into notepad and save it as a gz file.
If I try to decompress it, I receive "CRC is invalid".
I've tried several things but nothing seems to work.

Because when you cut and paste you are not copying all the characters. There
are many control characters that won't show up such as tabs(0x10 or
something), line feeds(0x9 I guess), etc...

If you stored the msg as a binary then you can use some hex editor to remove
the text header and it might work. (or just try to save the body as a binary
file with .zip extension and then open it and it should work)

Joerg Jooss · Oct 5, 2006

Thus wrote Chris,

You can use the GZipStream class to decompress the response stream:
http://msdn2.microsoft.com/en-us/library/system.io.compression.gzipstr
eam.aspx

Or take a look at Fiddler:
http://www.fiddlertool.com/fiddler/
This is an HTTP sniffer with built-in support for gzip decompression.

In .NET 2.0, HttpWebResponse can decompress HTTP messages automatically,
if you set HttpWebRequest.AutomaticDecompression to an appropriate value
(i.e. anything other than DecompressionMethods.None).

Cheers,

http compression/decompression	2	Aug 23, 2006
Scrabble Value calculation for Welsh words	0	Oct 19, 2021
Chinese antique	1	Oct 10, 2008
Question About Strange Temp Files	1	Mar 29, 2007
dynamic class loading	4	Mar 26, 2005
Cyrstal Report fail problem	2	Dec 6, 2006
what is this? i get it in my inbox every day	2	May 4, 2008
my win2k3 dhcp server problem	1	Sep 3, 2005

decompression problem

Frederik Vanderhaegen

Chris Fulstow

Jon Slaughter

Joerg Jooss

Ask a Question

Similar Threads