| 
 | 
 | 
The correlation coefficient is a quantity which gives the quality of a Least Squares Fitting to the original
data.  To define the correlation coefficient, first consider the sum of squared values 
, 
, and
 of a set of 
 data points 
 about their respective means,
| (1) | |||
| (2) | |||
| (3) | 
| (4) | 
| (5) | 
| (6) | 
| (7) | 
The correlation coefficient 
 (sometimes also denoted 
) is then defined by
![]()  | 
(8) | 
| (9) | 
The correlation coefficient has an important physical interpretation.  To see this, define
| (10) | 
| (11) | |||
| (12) | |||
| (13) | |||
| (14) | 
![]()  | 
|||
![]()  | 
(15) | 
| (16) | 
| (17) | |||
![]()  | 
(18) | 
![]()  | 
|||
![]()  | 
(19) | ||
![]()  | 
|||
| (20) | 
| (21) | 
The square of the correlation coefficient 
 is therefore given by
| (22) | 
If there is complete correlation, then the lines obtained by solving for best-fit 
 and 
 coincide
(since all data points lie on them), so solving (6) for 
 and equating to (4) gives
| (23) | 
| (24) | 
The correlation coefficient is independent of both origin and scale, so 
| (25) | 
| (26) | |||
| (27) | 
See also Correlation Index, Correlation Coefficient--Gaussian Bivariate Distribution, Correlation Ratio, Least Squares Fitting, Regression Coefficient
References
Acton, F. S.  Analysis of Straight-Line Data.  New York: Dover, 1966.
 
Kenney, J. F. and Keeping, E. S.  ``Linear Regression and Correlation.''  Ch. 15 in Mathematics of Statistics, Pt. 1, 3rd ed.
  Princeton, NJ: Van Nostrand, pp. 252-285, 1962.
 
Gonick, L. and Smith, W.  The Cartoon Guide to Statistics.  New York: Harper Perennial, 1993.
 
Press, W. H.; Flannery, B. P.; Teukolsky, S. A.; and Vetterling, W. T.  ``Linear Correlation.''  §14.5 in
  Numerical Recipes in FORTRAN: The Art of Scientific Computing, 2nd ed.  Cambridge, England:
  Cambridge University Press, pp. 630-633, 1992.
 
| 
 | 
 | 
© 1996-9 Eric W. Weisstein