On Aug 26, 7:11*pm, Registered User <n4...@ix.netcom.com> wrote:
> On Thu, 25 Aug 2011 21:46:58 -0700 (PDT), RayLopez99
>
> <raylope...@gmail.com> wrote:
> >I have a text file that has five pieces of information in it per
> >line: *ASCII text and numbers delimited by commas and a line break on
> >each line.
>
> >What is the best way to organize this data so I can analyse it? *
>
> What is wrong with the way the data is already organized?
I want to grab the data, mash it up with other data, etc etc etc.
Where to store said data--should it be stored on the hard drive as
ASCII text files? Or in a database? If a database I can do stuff
like SQL queries on Linq-to-SQL. If a text or XML file I cannot do
SQL queries but it's easier I guess to code.
>
> >The
> >files are hundreds of MB large, so loading them all into memory via
> >XML is impractical (I think--or is it? *I have a 32 bit system with 4
> >GB RAM). *
>
> http://lmgtfy.com/?q=how+do+computers+work
Save me the work bro. Please tell me how much (many) text files can I
load into 3.6 GB of memory? Please tell me. And btw I don't know how
you can monitor your memory in Visual Studio so that you know when you
are running out of RAM--I just will let the OS figure it out and swap
stuff, etc, but I have a feeling that's not optimal from a user's
point of view. Any pointers as to how to get a program to throttle
itself if it is running out of RAM on a PC the program is loaded on
would be greatly appreciated.
My thanks in advance.
>
> >Better to read them "serially" using XML? *
>
> XML? You'll have to read the CSV files to create the XML files. Then
> you'll have to read the XML files before performing any analysis.
> Unless changing the format of the data adds value why bother?
See the above. I'm thinking (out loud) that perhaps changing the
format might add value. For example I'm thinking of converting the
string to a binary representation so I can manipulate it faster, but
perhaps string/ Stringbuilder is fast enough...I'll have to
experiment. But the main thing is that I think it might be helpful to
get the text data into a database so I can run SQL and/or Linq-to-SQL
queries.
>
> >Or better to set up a database that can be queried?
>
> >Which is better--or is there an even better way? *I'm leaning towards
> >a database in SQL Server.
>
> A transactional database might be suitable for layout-led analysis,
> If the intent is to extract business intelligence by data-led
> discovery use SQL Server Analysis Services and an OLAP database.
> Import the CSV data using SQL Server Integration Services. SQL Server
> Reporting Services is another valuable tool.
OK. Is that **** free? I'm a penniless student for purposes of this
project.
>
> >I plan to code in C# using Visual Studio 2010.
>
> Everything can be done using SQL Server's Business Intelligence Design
> Studio.
>
> regards
> A.G.
Thank you A.G.
RL