what is correlation
consideration of whether there is any relationship or association between two variables
describe the correlation model
define correlation analysis
a statistical tool used to study the closeness of the relationship between two or more variables.
what is the correlation matrix
presents correlation coefficients among a group of variables
what is the correlation coefficient
the index which defines the strength of association between two variables
can be used to predict the value of one of the variables using another if a relationship exists
to determine relationship random samples must be taken from both sets of the two variables. this data is known as bivariate data
what is the basic rule for determining a relationship betw/ two variables
what is a scatter diagram
a diagram thgat shows the relationship between two variables by plotting the x,y pairs
independant values (x) are plotted on x axis
dependant values (y) are plotted on y axis
the coordiate of the two points form a correlation on the graph
what is the pearson correlation coefficient (p)
A population parameter that measures the degree of association betw/ 2 varialbes
list the 5 correlation assumptions
what is bivariate normal distibution
the joint normal distribution of X&Y
inferencial values can only be taken from normal joint x,y distro(bivariate)
no inferences can be made from non normal distrubutions although descriptive means can be used
five parameters of BIVARIATE DISTRUBUTION
σx : σy: standard deviations of each data set
µx µy : means for each data set
p: correlation coefficient= measures strength of X&Y
what is the pearson coefficient
coeffecient used to asses the straight line assoc betw/ x & y and requires interval or ratio values
symbol for the sample correlation coefficient is r,
correlation varies from negative one to positive one (–1 r +1).
r-1 is perfect negative x,y relationship
r+1 is perfect positive x,y relationship
r=1 is a straight line
what is pearson product moment correlation
numerical measure of the degree of association between two variables
pearson product moment correlation continued
list the types of correlations
confidence interval for pearson’s correlation
aka
Fisher’s r-to-z transformation
Fisher developed a transformation of r that tends to
become normal quickly as N increases.
used to conduct tests of r and calc CI
z=0.5ln (1+r/1-r)
z=+/- (criterionz) x (standard deviation)
criterion z=1.96 in case of 95% ci
can be used to calc upper and lower limits
what method tests for the statistical significance of a correlation coeficient
based on a t-test that evalutes the H0 that p =0 in the population
what is the phi coefficient
its a product—moment coefficient of correlation variation of Pearson’s definition of r when the two states of each variable are given values of 0 and 1 respectively.
purpose of phi’s coefficient
designed for the comparison of truly dichotomous distributions ( only have 2 points on their scale for an unmeasurable attribute) i.e nominal values
aka known as the YUTE (φ)
relates to 2x2 tables
often used in psychoological and educational testing d/2 freq of applying dichotomy onto a continuous variable and PASS/ FAIL categories are found based on a threshold score
what is YULE’S Q and what is it used for
: a nominal measure of association used to determine the association betw/ variables
or
the ratio of dx betw/ the products of diagonal cell freq and the sum of products of diagonal cell frequencies
6 benefits of YULES Q
4 cons of YULE’S Q
what is spearman’s rank order coefficient psp
conditions for spearman’s rho (1904) and kendall’s tau (1938)