PC Review


Reply
Thread Tools Rate Thread

A bug in .Net Binary Serialization?

 
 
ztRon
Guest
Posts: n/a
 
      2nd Jul 2008
Hi all,

I recently came across something really strange and after a couple of days
of debugging, I finally nailed the cause of it. However, I have absolutely no
idea what I am doing wrong or is it just a bug in binary serialization. The
following is a simple example of the code:




using System;
using System.Collections.Generic;
using System.IO;
using System.Runtime.Serialization.Formatters.Binary;

namespace ConsoleApplication5
{
class Program
{
static void Main(string[] args)
{
A a = new A();
B b = new B(a);
List<C> cList = new List<C>();
for (int i = 0; i < 10000; i++)
{
cList.Add(new C("someValue"));
}
b.CList = cList;

MemoryStream stream = new MemoryStream();
BinaryFormatter objFormatter = new BinaryFormatter();
objFormatter.Serialize(stream, b);
}
}

[Serializable]
class A
{
private Dictionary<string, string> _dic1 = new Dictionary<string,
string>();

public A()
{
_dic1.Add("key1", "value1");
_dic1.Add("key2", "value2");
}
}

[Serializable]
class B
{
private List<C> _cList = new List<C>();
private A _a;

public B(A a)
{
_a = a;
}

public List<C> CList
{
get { return _cList; }
set { _cList = value; }
}
}

[Serializable]
class C
{
private Dictionary<string, string> _dic2 = new Dictionary<string,
string>();
private string _value;

public C(string value)
{
_value = value;
}
}
}































































If you run the code, you will find that the stream has a length of 4,532,517
bytes. Now, try changing _dic1(Class A) to be a Dictionary<string, object>
and run the code again. Now, the stream length is 462,924 bytes. Why is there
such a big difference just by changing the type? What I noticed also was that
this might be due to the fact that I have another dictionary of the same type
in Class C.

Am I doing something wrong here? If not, is this a bug?

Thanks in advance!!

 
Reply With Quote
 
 
 
 
ztRon
Guest
Posts: n/a
 
      2nd Jul 2008
Sorry my post seems to have a huge white space in between. Reposting it below:

Hi all,

I recently came across something really strange and after a couple of days
of debugging, I finally nailed the cause of it. However, I have absolutely no
idea what I am doing wrong or is it just a bug in binary serialization. The
following is a simple example of the code:

using System;
using System.Collections.Generic;
using System.IO;
using System.Runtime.Serialization.Formatters.Binary;

namespace ConsoleApplication5
{
class Program
{
static void Main(string[] args)
{
A a = new A();
B b = new B(a);
List<C> cList = new List<C>();
for (int i = 0; i < 10000; i++)
{
cList.Add(new C("someValue"));
}
b.CList = cList;

MemoryStream stream = new MemoryStream();
BinaryFormatter objFormatter = new BinaryFormatter();
objFormatter.Serialize(stream, b);
}
}

[Serializable]
class A
{
private Dictionary<string, string> _dic1 = new Dictionary<string,
string>();

public A()
{
_dic1.Add("key1", "value1");
_dic1.Add("key2", "value2");
}
}

[Serializable]
class B
{
private List<C> _cList = new List<C>();
private A _a;

public B(A a)
{
_a = a;
}

public List<C> CList
{
get { return _cList; }
set { _cList = value; }
}
}

[Serializable]
class C
{
private Dictionary<string, string> _dic2 = new Dictionary<string,
string>();
private string _value;

public C(string value)
{
_value = value;
}
}
}

If you run the code, you will find that the stream has a length of 4,532,517
bytes. Now, try changing _dic1(Class A) to be a Dictionary<string, object>
and run the code again. Now, the stream length is 462,924 bytes. Why is there
such a big difference just by changing the type? What I noticed also was that
this might be due to the fact that I have another dictionary of the same type
in Class C.

Am I doing something wrong here? If not, is this a bug?

Thanks in advance!!
 
Reply With Quote
 
ztRon
Guest
Posts: n/a
 
      2nd Jul 2008
> It would be interesting to try to serialize to a more readable format and
> see what the specific differences are. I don't have the time at the
> moment to explore too much, but it's something you might like to try.


This example was actually derived from a more complex code if that was what
you meant. And in my unit testing of it, I noticed that the size recently
tripled due to the addition of one dictionary even though there is only ever
one instance of it. This was when I started to debug and finally were able to
pinpoint its cause and came out with a simpler example to express this
problem.
 
Reply With Quote
 
ztRon
Guest
Posts: n/a
 
      3rd Jul 2008
The problem with something like the XmlSerializer is that it does not support
serialization of dictionaries.

Does anyone else have any other ideas to this problem?

Thanks.

"Peter Duniho" wrote:

> On Wed, 02 Jul 2008 00:20:01 -0700, ztRon
> <(E-Mail Removed)> wrote:
>
> >> It would be interesting to try to serialize to a more readable format
> >> and
> >> see what the specific differences are. I don't have the time at the
> >> moment to explore too much, but it's something you might like to try.

> >
> > This example was actually derived from a more complex code if that was
> > what
> > you meant.

>
> No, it's not. The code you posted was fine. I'm talking about the
> resulting data itself. Serialize less, and to a format like SOAP so that
> you can take the two alternatives and inspect them as text files
> side-by-side. That should give you some clues as to what differences
> exist between the two. And that _might_ lead you to some useful
> conclusion as to why such a simple change produces such a dramatic
> difference.
>
> If you can accomplish that with the output from the BinaryFormatter, more
> power to you. But I'd go with a text-format serialization. I naïvely
> tried to swap in an XmlSerializer for the BinaryFormatter, but of course
> it has different requirements from the regular serialization stuff (for
> one, it requires everything to be public that's going to be serialized).
> I didn't have the time to make the necessary adjustments, but that could
> be something you might try, since the output from the XmlSerializer is yet
> again much more readable than SOAP.
>
> Pete
>

 
Reply With Quote
 
ztRon
Guest
Posts: n/a
 
      3rd Jul 2008
But isn't it the same with SOAP? I think SOAP does not support Generics which
thus means that it doesn't support dictionaries?


> Well, like I said, SOAP is also basically text-based and you can use that
> as easily as BinaryFormatter.
>
> Pete
>

 
Reply With Quote
 
ztRon
Guest
Posts: n/a
 
      3rd Jul 2008
I actually did that yesterday using the SOAPFormatter and it did not work,
which was why I thought maybe you meant something else.

"Peter Duniho" wrote:

> On Wed, 02 Jul 2008 21:06:00 -0700, ztRon
> <(E-Mail Removed)> wrote:
>
> > But isn't it the same with SOAP? I think SOAP does not support Generics
> > which
> > thus means that it doesn't support dictionaries?

>
> My recollection is that it does. I admit, I haven't tested it recently to
> make sure. But I am under the impression that you can just swap in
> SoapFormatter where you have BinaryFormatter, and it will "just work".
>
> If I'm wrong, well...it should only take you a few minutes to find out.
>
> Pete
>

 
Reply With Quote
 
SMJT
Guest
Posts: n/a
 
      3rd Jul 2008
On Jul 2, 6:50*am, ztRon <zt...@discussions.microsoft.com> wrote:
> Hi all,
>
> I recently came across something really strange and after a couple of days
> of debugging, I finally nailed the cause of it. However, I have absolutely no
> idea what I am doing wrong or is it just a bug in binary serialization. The
> following is a simple example of the code:
>
> using System; *
> using System.Collections.Generic; *
> using System.IO; *
> using System.Runtime.Serialization.Formatters.Binary; *
>
> namespace ConsoleApplication5 *
> { *
> * * class Program *
> * * { *
> * * * * static void Main(string[] args) *
> * * * * { *
> * * * * * * A a = new A(); *
> * * * * * * B b = new B(a); *
> * * * * * * List<C> cList = new List<C>(); *
> * * * * * * for (int i = 0; i < 10000; i++) *
> * * * * * * { *
> * * * * * * * * cList.Add(new C("someValue")); *
> * * * * * * } *
> * * * * * * b.CList = cList; *
>
> * * * * * * MemoryStream stream = new MemoryStream(); *
> * * * * * * BinaryFormatter objFormatter = new BinaryFormatter(); *
> * * * * * * objFormatter.Serialize(stream, b); *
> * * * * } *
> * * } *
>
> * * [Serializable] *
> * * class A *
> * * { *
> * * * * private Dictionary<string, string> _dic1 = new Dictionary<string,
> string>(); *
>
> * * * * public A() *
> * * * * { *
> * * * * * * _dic1.Add("key1", "value1"); *
> * * * * * * _dic1.Add("key2", "value2"); *
> * * * * } *
> * * } *
>
> * * [Serializable] *
> * * class B *
> * * { *
> * * * * private List<C> _cList = new List<C>(); *
> * * * * private A _a; *
>
> * * * * public B(A a) *
> * * * * { *
> * * * * * * _a = a; *
> * * * * } *
>
> * * * * public List<C> CList *
> * * * * { *
> * * * * * * get { return _cList; } *
> * * * * * * set { _cList = value; } *
> * * * * } *
> * * } *
>
> * * [Serializable] *
> * * class C *
> * * { *
> * * * * private Dictionary<string, string> _dic2 = new Dictionary<string,
> string>(); *
> * * * * private string _value; *
>
> * * * * public C(string value) *
> * * * * { *
> * * * * * * _value = value; *
> * * * * } *
> * * } *
>
> } *
>
> If you run the code, you will find that the stream has a length of 4,532,517
> bytes. Now, try changing _dic1(Class A) to be a Dictionary<string, object>
> and run the code again. Now, the stream length is 462,924 bytes. Why is there
> such a big difference just by changing the type? What I noticed also was that
> this might be due to the fact that I have another dictionary of the same type
> in Class C.
>
> Am I doing something wrong here? If not, is this a bug?
>
> Thanks in advance!!


ztRon,

I don't think this is a bug, but just the way the data is stored when
you use binary serialization.

I had a similar problem when serializing classes to a file, where my
class contained an array of strings. If the string values were all the
same, then only one copy of the string was stored rather than multiple
copies of the same string (which I think is quiet clever really, saves
space and is probably quicker or something).

My bug was that when I changed one of the strings the serialized class
size changed so shouldn't have been writen back to the same slot in my
file and I ended up corrupting my data file.

So I think you don't have a bug, just a feature of binary
serialization.

SMJT
 
Reply With Quote
 
not_a_commie
Guest
Posts: n/a
 
      3rd Jul 2008
I think you'll have better luck with the newer DataContractSerializer.
It works way better than the older serialization stuff. Here's a cut
from my code.

public byte[] GetDataBytes(params Type[] types)
{
var ds = new DataContractSerializer(GetType(), types);
using (var mem = new MemoryStream())
{
//using (var w = XmlDictionaryWriter.CreateTextWriter(mem)) // for
xml
using (var w = XmlDictionaryWriter.CreateBinaryWriter(mem))
{
ds.WriteObject(w, this);
}
return mem.ToArray();
}
}

And I prefer the DataContract and DataMember attributes more than the
Serializable one.
 
Reply With Quote
 
 
 
Reply

Thread Tools
Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Binary Serialization shapper Microsoft ASP .NET 2 10th Mar 2007 06:12 PM
Binary serialization =?Utf-8?B?SmFjcXVlcw==?= Microsoft Dot NET 15 28th Jun 2006 01:50 PM
binary vs. xml serialization Alex D. Microsoft ASP .NET 4 21st Dec 2005 05:47 PM
Binary Serialization over the net =?Utf-8?B?R2lsYWQgS2FwZWx1c2huaWs=?= Microsoft C# .NET 1 7th Nov 2005 02:21 PM
Serialization, binary AND xml =?Utf-8?B?SmVzcGVy?= Microsoft C# .NET 0 25th Sep 2004 03:33 PM


Features
 

Advertising
 

Newsgroups
 


All times are GMT +1. The time now is 10:11 AM.