PC Review


Reply
Thread Tools Rate Thread

datatable from xml doc and massive time differences on desktop / server

 
 
Hermit Dave
Guest
Posts: n/a
 
      11th Jul 2008
Hello All,

I have an xml document which can contain massive amounts of data with no
fixed child node sequence.
We have some 9000 odd fields and any ones can be a part of the xml.

The way i build the data table is that i process one record at a time and i
keep a list of columns in hash table (with ordinal position) and in a string
collection (for retrieval in correct order) and for each field, if the
column does not exists, i add the column and for that ordinal position, i
set the value in the data table.

For xml doc contain 85000 odd records with say 5 child fields, on my desktop
i can process the whole thing and display paged in list in 4 mins and 40
secs, however on the server it takes around 12 mins.
Same input file, same code, same binaries.

The desktop which performs faster is Core 2 Duo 2.4 GHz with 2 GB of RAM and
running XP
The Server runs 4 x 2.6 Dual core Opteron with 16 GB of RAM running Windows
2003 Server Enterprise edition.

What do you guys think of it ? TIA

Hermit

PS: Its code in C# on vs.net 2003 using standard frameworkclasses
(datatable, xmltextreader, xmldocument etc)

 
Reply With Quote
 
 
 
 
Pavel Minaev
Guest
Posts: n/a
 
      12th Jul 2008
On Jul 12, 12:01*am, "Hermit Dave"
<hermitd.REM...@CAPS.AND.DOTS.hotmail.com> wrote:
> Hello All,
>
> I have an xml document which can contain massive amounts of data with no
> fixed child node sequence.
> We have some 9000 odd fields and any ones can be a part of the xml.
>
> The way i build the data table is that i process one record at a time andi
> keep a list of columns in hash table (with ordinal position) and in a string
> collection (for retrieval in correct order) and for each field, if the
> column does not exists, i add the column and for that ordinal position, i
> set the value in the data table.
>
> For xml doc contain 85000 odd records with say 5 child fields, on my desktop
> i can process the whole thing and display paged in list in 4 mins and 40
> secs, however on the server it takes around 12 mins.
> Same input file, same code, same binaries.
>
> The desktop which performs faster is Core 2 Duo 2.4 GHz with 2 GB of RAM and
> running XP
> The Server runs 4 x 2.6 Dual core Opteron *with 16 GB of RAM running Windows
> 2003 Server Enterprise edition.
>
> What do you guys think of it ? TIA


Where is the source XML read from on your desktop, and on your server?
It could well be an I/O bottleneck, not a problem with your code as
such. Perhaps the server reads it from an SMB share, or something like
that?
 
Reply With Quote
 
Hermit Dave
Guest
Posts: n/a
 
      14th Jul 2008
Pavel,

Well the desktop is a standard ide / sata drive.
there server uses SAN.

I dont think its I/O issue as my team leader tried a hard way of assuming a
column exists for a given node (and catching any exceptions and handling it
correctly) and the server timings dropped to about 2 and half mins.
I guess it was something else. Maybe something in the code was being
optimised lot better for intel architecture

Regards,

Hermit

"Pavel Minaev" <(E-Mail Removed)> wrote in message
news:655c2d30-56bd-48fe-af4c-(E-Mail Removed)...
On Jul 12, 12:01 am, "Hermit Dave"
<hermitd.REM...@CAPS.AND.DOTS.hotmail.com> wrote:
> Hello All,
>
> I have an xml document which can contain massive amounts of data with no
> fixed child node sequence.
> We have some 9000 odd fields and any ones can be a part of the xml.
>
> The way i build the data table is that i process one record at a time and
> i
> keep a list of columns in hash table (with ordinal position) and in a
> string
> collection (for retrieval in correct order) and for each field, if the
> column does not exists, i add the column and for that ordinal position, i
> set the value in the data table.
>
> For xml doc contain 85000 odd records with say 5 child fields, on my
> desktop
> i can process the whole thing and display paged in list in 4 mins and 40
> secs, however on the server it takes around 12 mins.
> Same input file, same code, same binaries.
>
> The desktop which performs faster is Core 2 Duo 2.4 GHz with 2 GB of RAM
> and
> running XP
> The Server runs 4 x 2.6 Dual core Opteron with 16 GB of RAM running
> Windows
> 2003 Server Enterprise edition.
>
> What do you guys think of it ? TIA


Where is the source XML read from on your desktop, and on your server?
It could well be an I/O bottleneck, not a problem with your code as
such. Perhaps the server reads it from an SMB share, or something like
that?

 
Reply With Quote
 
 
 
Reply

Thread Tools
Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Time Sheet - Calculating Time Differences for Totals Kathy Microsoft Excel Misc 3 14th Jan 2010 11:04 PM
Linking to MASSIVE Sql Server DB Chris Vachio Microsoft Access Queries 3 8th Jan 2009 10:57 PM
Differences in error handling between Design-Time and Run-Time =?Utf-8?B?Y2hvcHNuc2F1Y2U=?= Microsoft VB .NET 0 26th Jul 2005 06:51 PM
Remote desktop - differences between XP SP1 and SP2 roman modic Windows XP General 1 26th Mar 2005 03:42 PM
System Time problem with Windows 2000 Desktop and Server Mr. Barnie Microsoft Windows 2000 0 10th Dec 2003 10:52 AM


Features
 

Advertising
 

Newsgroups
 


All times are GMT +1. The time now is 09:26 AM.