PC Review


Reply
Thread Tools Rate Thread

how i calculate pearson correlation coefficient for excel graph

 
 
James Silverton
Guest
Posts: n/a
 
      18th Sep 2006
"Jerry W. Lewis" <post_a_reply@no_e-mail.com> wrote in message
news:BCD1076E-F457-423A-A454-(E-Mail Removed)...
> The numerical problems with the pre-2003 Pearson algorithm are
> the same as
> with the pre-2003 StDev, Rsq, Slope, etc. They have been
> discussed in the
> statistical literature for over 40 years and in these
> newsgroups for over 10.
> You might find
> http://groups.google.com/group/micro...a03470e7a1c650
> to be useful.
>
>


I did the numerical testing because, as I mentioned, the
information available in HELP was rather obtuse. Thank you for a
most enlightening reply!

I must admit that in the days when I relied on statistical
tests for hypothesis testing, I did not use functions from
general programs like Excel, nor indeed use Pearson's test.
These days, my data is such that trends are visible even from
graphical display. As will be obvious, I'm a user rather than a
theorist but it's always comforting to know the limitations of
calculation formulae:-)

Have the statistical formulae in Excel 2003 on been rigorously
tested? As you remarked in the reference, it is amazing how long
Microsoft kept using unstable calculation techniques

--
James Silverton
Potomac, Maryland

 
Reply With Quote
 
 
 
 
=?Utf-8?B?SmVycnkgVy4gTGV3aXM=?=
Guest
Posts: n/a
 
      18th Sep 2006
"James Silverton" wrote:

> ... Thank you for a
> most enlightening reply!


You're welcome, glad it helped.

> Have the statistical formulae in Excel 2003 on been rigorously
> tested? As you remarked in the reference, it is amazing how long
> Microsoft kept using unstable calculation techniques


Univariate statistics functions (StDev, StDevP, Var, VarP) worked fine out
of the box. Bivariate Statistics functions (Slope, Intercept, SteYX, etc)
could produce incorrect results with non-numeric or empty cells in the data
range; that was fixed in a March 2004 patch (as was a bug in the new algorith
for RAND).

When LINEST 2003 estimates a parameter to be exactly zero, the estimate
should be distrusted unless confirmed by alternate calculations; this is
fixed in 2007 beta.

Probability calculations (other than the much improved 2003 NORMDIST and
NORMINV) remain inadequate. ...INV functions do a better job of inverting
....DIST functions, but continuous distribution functions (...DIST) continue
to have limited accuracy and continue to become totally inaccurate in at
least one tail. Algorithms were changed for discrete distributions
functions, but they introduce new numerical problems that are still not fixed
in 2007 beta. The gold standard for probability calculations remains Ian
Smith's library
http://members.aol.com/iandjmsmith/examples.xls
which is actually better (both accuracy and working range) than any other
double precision implementation that I am aware of, (including dedicated
statistics packages and math libraries).

Jerry
 
Reply With Quote
Reply

Thread Tools
Rate This Thread
Rate This Thread:

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Correlation-Coefficient daniel chen Microsoft Excel Misc 3 31st Jan 2006 07:59 AM
spearman correlation coefficient =?Utf-8?B?TWFyY2VsIExhYm9udMOp?= Microsoft Excel Misc 1 12th May 2005 04:00 AM
Correlation Coefficient Issue Diane Microsoft Excel Worksheet Functions 8 17th Dec 2004 03:48 AM
Correlation coefficient (r) Phil C Microsoft Excel Charting 3 14th Aug 2004 04:27 AM
Correlation coefficient for log plot.. Phil C Microsoft Excel Charting 2 6th Aug 2004 08:34 AM


Features
 

Advertising
 

Newsgroups
 


All times are GMT +1. The time now is 06:38 PM.