Decompressing using GZipStream

Big Daddy · Jan 27, 2009

There's an example in the .NET documentation of using GZipStream to
decompress a byte array:

http://msdn.microsoft.com/en-us/library/as1ff51s.aspx

In the example, the data is read out of the stream 100 bytes at a time
in a while loop until it's finished:

private const int buffer_size = 100;
public static int ReadAllBytesFromStream(Stream stream, byte[]
buffer)
{
// Use this method is used to read all bytes from a stream.
int offset = 0;
int totalCount = 0;
while (true)
{
int bytesRead = stream.Read(buffer, offset, buffer_size);
if (bytesRead == 0)
{
break;
}
offset += bytesRead;
totalCount += bytesRead;
}
return totalCount;
}
int totalCount = GZipTest.ReadAllBytesFromStream(zipStream,
decompressedBuffer);

Why isn't it read out in one fell swoop like this:

int bytesRead = stream.Read(buffer, offset, buffer.length);

I have tried it and it seems to work. The code would be cleaner and
run faster, but it makes me worried that there was a good reason for
reading it out 100 bytes at a time in the documentation.

Any ideas?
thanks in advance,
John

Cowboy \(Gregory A. Beamer\) · Jan 27, 2009

In essence, you are not using a buffer, but a dump. This is fine for small
objects, but can be nasty when you get to very large compressed bits. That
is why you normally see a buffer of n characters rather than a dump of the
entire contents.

--
Gregory A. Beamer
MVP; MCP: +I, Se, SD, DBA

Blog:
http://feeds.feedburner.com/GregoryBeamer

*************************************************
| Think outside the box! |
*************************************************

Big Daddy · Jan 28, 2009

In essence, you are not using a buffer, but a dump. This is fine for small
objects, but can be nasty when you get to very large compressed bits. That
is why you normally see a buffer of n characters rather than a dump of the
entire contents.

I don't understand what you mean. What's the difference between a
buffer and a dump? Do you mean that if you take it out 100 bytes at a
time (buffer), then you can process it 100 bytes at a time, and then
you don't have to have the entire contents in a byte array at once in
case it's really big? If your output might be over a gig in size, then
you might not want to put the whole thing into an array at once. But
my output won't be bigger than tens of thousands of bytes, and I have
to have the whole thing in a byte array at once. So then would it be
OK for me to just use the simpler method (all at once) rather than 100
bytes at a time?

thanks

Moe Sisko · Jan 28, 2009

Why isn't it read out in one fell swoop like this:

int bytesRead = stream.Read(buffer, offset, buffer.length);

I have tried it and it seems to work. The code would be cleaner and
run faster, but it makes me worried that there was a good reason for
reading it out 100 bytes at a time in the documentation.

Well, it will probably work, but there might be no guarantee according to
the documentation for Stream.Read.

More info : http://www.yoda.arachsys.com/csharp/readbinary.html

FWIW, here's a slightly cleaner (IMO) version of the looping method :

int bytesRead;
while ((bytesRead = stream.Read(buffer, offset, buffer_size))>0)
{
offset += bytesRead;
totalCount += bytesRead;
}

Big Daddy · Jan 30, 2009

Thank you, that answers my question. I also see it's clear in the
description of the Stream::Read method.

http://msdn.microsoft.com/en-us/library/system.io.stream.read.aspx

File download using WebResponse	10	Jun 5, 2009
'System.ArgumentOutOfRangeException' download large file	5	Sep 23, 2009
Simple file encryption decryption problem in C#	2	Apr 23, 2009
Using an API	2	Oct 9, 2007
using Filestream - content of the file vanishes	7	Oct 2, 2007
How to download http file.	2	Oct 7, 2008
Data from one stream to another	2	Jun 22, 2006
GZipStream catch-22	3	Jul 3, 2009

Decompressing using GZipStream

Big Daddy

Cowboy \(Gregory A. Beamer\)

Big Daddy

Moe Sisko

Big Daddy

Ask a Question

Similar Threads