Correlation
 

Home Introduction Methods Samples Review

 

 

 

 

Case study
Questionnaire
Correlation
Experiment

This section describes the correlation research method. There are several sections on this page:

Purpose

Direction: positive correlation & negative correlation

Strength

Advantage & disadvantage

 

Purpose

The correlation is a way to measure how associated or related two variables are. The researcher looks at things that already exist and determines if and in what way those things are related to each other. The purpose of doing correlations is to allow us to make a prediction about one variable based on what we know about another variable.   

For example, there is a correlation between income and education. We find that people with higher income have more years of education. (You can also phrase it that people with more years of education have higher income.) When we know there is a correlation between two variables, we can make a prediction. If we know a group’s income, we can predict their years of education. 

Back to top

 

Direction 

There are two types or directions of correlation. In other words, there are two patterns that correlations can follow. These are called positive correlation and negative correlation.

Remember that in a correlational study, the researcher is measuring conditions that already exist. She or he is asking questions of a sample of participants, and finding out in what way pairs of variables are related. For example, a researcher could ask about the participants’ yearly income and years of education, to see if those two attributes are correlated.

Positive correlation

In a positive correlation, as the values of one of the variables increase, the values of the second variable also increase. Likewise, as the value of one of the variables decreases, the value of the other variable also decreases. The example above of income and education is a positive correlation. People with higher incomes also tend to have more years of education. People with fewer years of education tend to have lower income.

Here are some examples of positive correlations:

1. SAT scores and college achievement—among college students, those with higher SAT scores also have higher grades

2. Happiness and helpfulness—as people’s happiness level increases, so does their helpfulness (conversely, as people’s happiness level decreases, so does their helpfulness)

This table shows some sample data. Each person reported income and years of education.

Participant Income Years of Education
#1 125,000 19
#2 100,000 20
#3 40,000 16
#4 35,000 16
#5 41,000 18
#6 29,000 12
#7 35,000 14
#8 24,000 12
#9 50,000 16
#10 60,000 17

In this sample, the correlation is .79.

We can make a graph, which is called a scatterplot. On the scatterplot below, each point represents one person’s answers to questions about income and education. The line is the best fit to those points. All positive correlations have a scatterplot that looks like this. The line will always go in that direction if the correlation is positive.

   

Scatterplot positive correlation

 

Back to top

   

Negative correlation

In a negative correlation, as the values of one of the variables increase, the values of the second variable decrease. Likewise, as the value of one of the variables decreases, the value of the other variable increases.

This is still a correlation. It is like an “inverse” correlation. The word “negative” is a label that shows the direction of the correlation.

There is a negative correlation between TV viewing and class grades—students who spend more time watching TV tend to have lower grades (or phrased as students with higher grades tend to spend less time watching TV).

Here are some other examples of negative correlations:

1. Education and years in jail—people who have more years of education tend to have fewer years in jail (or phrased as people with more years in jail tend to have fewer years of education)

2. Crying and being held—among babies, those who are held more tend to cry less (or phrased as babies who are held less tend to cry more)

We can also plot the grades and TV viewing data, shown in the table below. The scatterplot below shows the sample data from the table. The line on the scatterplot shows what a negative correlation looks like. Any negative correlation will have a line with that direction.

Participant GPA TV in hours per week
#1 3.1 14
#2 2.4 10
#3 2.0 20
#4 3.8 7
#5 2.2 25
#6 3.4 9
#7 2.9 15
#8 3.2 13
#9 3.7 4
#10 3.5 21

In this sample, the correlation is -.63.

 

  Scatterplot negative correlation

Back to top

 

 

Strength

Correlations, whether positive or negative, range in their strength from weak to strong.

Positive correlations will be reported as a number between 0 and 1. A score of 0 means that there is no correlation (the weakest measure). A score of 1 is a perfect positive correlation, which does not really happen in the “real world.” As the correlation score gets closer to 1, it is getting stronger. So, a correlation of .8 is stronger than .6; but .6 is stronger than .3.

The correlation of the sample data above (income and years of education) is .79.

Negative correlations will be reported as a number between 0 and -1. Again, a 0 means no correlation at all. A score of –1 is a perfect negative correlation, which does not really happen. As the correlation score gets close to -1, it is getting stronger. So, a correlation of -.7 is stronger than -.5; but -.5 is stronger than -.2.

Remember that the negative sign does not indicate anything about strength. It is a symbol to tell you that the correlation is negative in direction. When judging the strength of a correlation, just look at the number and ignore the sign.

The correlation of the sample data above (TV viewing and GPA) is -.63.

 

Imagine reading four correlational studies with the following scores. You want to decide which study had the strongest results:

-.3  -.8   .4    .7

In this example, -.8 is the strongest correlation. The negative sign means that its direction is negative.

Back to top

 

 

Advantage

1. An advantage of the correlation method is that we can make predictions about things when we know about correlations. If two variables are correlated, we can predict one based on the other. For example, we know that SAT scores and college achievement are positively correlated. So when college admission officials want to predict who is likely to succeed at their schools, they will choose students with high SAT scores.

We know that years of education and years of jail time are negatively correlated. Prison officials can predict that people who have spent more years in jail will need remedial education, not college classes.

Back to top

 

Disadvantage

1. The problem that most students have with the correlation method is remembering that correlation does not measure cause. Take a minute and chant to yourself: Correlation is not Causation! Correlation is not Causation! I always have my in-class students chant this, yet some still forget this very crucial principle.

We know that education and income are positively correlated. We do not know if one caused the other. It might be that having more education causes a person to earn a higher income. It might be that having a higher income allows a person to go to school more. It might also be some third variable.

A correlation tells us that the two variables are related, but we cannot say anything about whether one caused the other. This method does not allow us to come to any conclusions about cause and effect.

Back to top

Read a correlation sample.

 

Last updated 03/29/2002

© 2002 Elizabeth Lanthier, Ph.D.

email elanthier@nvcc.edu