A
agarwaldvk
Hi All
I am developing an analytical model involving reasonably big datasets.
The source or the primary data is spread over say 10 to 12 worksheets
with each worksheet containing data in around 200000 to 250000 cells
spread over 50 to 60 columns and around 4000 rows. I have just almost
finished organising the source data and am about to start the
development of the processing bit.
A fair amount of processing involves searching on large ranges and
summing (possibly large ranges) of data conditionally upon meeting
various sets of conditions - a large proportion of these conditions are
more than likely going to be on multiple parameters. In essence, I am
looking at building a fairly calculation intensive model.
I am looking for advice as to which way should I be going to make the
processing as quick as possible realizing full well that no matter
which option I eventually take, I, reasonably, am looking at a
processing time in the vicinity of a couple of minutes.
WOULD I BE BETTER OFF USING :-
SUM(...)'s, SUM(IF(...)'s)'s and SUMPRODUCT(...)'s WITH INDEX() &
MATCH() functions (both normally entered or array entered as required)
OR
LOOKUP functions,
given MY PREFERNCE FOR AVOIDING USING HELPER COLUMNS TO STORE
TRANSITIONAL VALUES?
Any help/suggestions/tips would be greatly valued.
Best regards and thanks in advance!
Deepak Agarwal
I am developing an analytical model involving reasonably big datasets.
The source or the primary data is spread over say 10 to 12 worksheets
with each worksheet containing data in around 200000 to 250000 cells
spread over 50 to 60 columns and around 4000 rows. I have just almost
finished organising the source data and am about to start the
development of the processing bit.
A fair amount of processing involves searching on large ranges and
summing (possibly large ranges) of data conditionally upon meeting
various sets of conditions - a large proportion of these conditions are
more than likely going to be on multiple parameters. In essence, I am
looking at building a fairly calculation intensive model.
I am looking for advice as to which way should I be going to make the
processing as quick as possible realizing full well that no matter
which option I eventually take, I, reasonably, am looking at a
processing time in the vicinity of a couple of minutes.
WOULD I BE BETTER OFF USING :-
SUM(...)'s, SUM(IF(...)'s)'s and SUMPRODUCT(...)'s WITH INDEX() &
MATCH() functions (both normally entered or array entered as required)
OR
LOOKUP functions,
given MY PREFERNCE FOR AVOIDING USING HELPER COLUMNS TO STORE
TRANSITIONAL VALUES?
Any help/suggestions/tips would be greatly valued.
Best regards and thanks in advance!
Deepak Agarwal