Study Guide for Exam 1 in Psychology 138
1. What is a nominal scale?
2. What is an ordinal scale?
3. What is an interval scale?
4. What feature distinguishes a ratio scale from
an interval
scale?
6. What is a sample?
7. What do we call descriptive characteristics of
a
population?
8. What do we call descriptive characteristics of
a sample?
9. Define "Central
Tendency"
10.
Define mean.
11.
Define mode.
12.
Define median.
13. What is the
only measure of central tendency that can be used for a nominal scale?
14.
What types
of central tendency can be used for an ordinal scale?
15.
For which
types of scales can the mean be used?
16.
What does
bimodal mean?
17.
In a
positively skewed distribution, which is higher, the mean or the median?
18.
What is the
best measure of central tendency for a skewed distribution?
19.
Why is the
mean a poor measure of central tendency for a skewed distribution?
20.
If many
samples are selected at random from the same population, which measure
of
central tendency is likely to be the most stable from sample to sample?
21.
If many
samples are selected at random from the same population, which measure
of
central tendency is likely to be the least stable from sample to sample?
22.
Define
variability.
23.
Define
outlier.
24.
Name 3
reasons why measures of variability are important.
25.
Define range
26.
Name
2 disadvantages of using the range as a measure of variability.
27.
Define
variance.
28.
Why is the
formula for variance different depending on whether the distribution is
a
population or a sample?
29.
Define
standard deviation.
30.
What is the
relationship between the standard deviation and variance?
31.
Name 2 advantages of using the standard deviation instead of the range
as a measure of
variability.
32.
Define z-score
33.
What is the
mean and standard deviation of a set of z-scores?
34.
Define
correlation coefficient
35.
A professor
mentions in lecture that income and vocabulary ability have a
correlation
coefficient of 3.2. What is the proper thing to do?
36.
If the
correlation between achievement motivation and actual income earned is
0, what
can we conclude about the relationship between these 2 variables?
37.
If violent
behavior and anxiety are negatively correlated, are highly violent
people more
likely to be anxious than less violent people?
38.
If income
and happiness are positively correlated, describe 3 possible patterns
of how
income and happiness are causally related.
39. If the
correlation between variables X and Y is .45, what is the correlation
between Y
and X?
40.
Pretend that
a researcher finds that the correlation between height and number of
baskets
per game for NBA starting players is very low. The researcher announces
that
height confers little advantage in the sport of basketball. Assume that
the
researcher's conclusion is not correct. Why might the researcher have
obtained
such a low correlation?
41.
A researcher
found that, in a sample of children and teenagers, vocabulary knowledge
is highly correlated
with age.
The researcher concludes that because vocabulary is correlated with
age,
elderly people must have the best vocabulary. What is wrong with this
conclusion?
42.
A researcher
found that in her small sample of 10 people, the 2 variables that she
hypothesized would correlate, were not correlated at all. She decides
to
include 1 more person in the sample. Now, with 11 people, the 2
variables were
highly correlated. Why should this researcher not trust her results?
43.
Define
regression
44.
In the
regression equation (Y = b0 + b1X), what does b1 represent?
45.
In the
regression equation (Y = b0 + b1X), what does b0 represent?
46.
In
regression, for any particular value of X, what do we call the
difference
between Y and the predicted value of Y?
47.
What do we
call the typical distance between the observed Y values and the
regression
line?
48.
If IQ
explains 36% of the variance in Academic Achievement, what is the
correlation
between IQ and Academic Achievement? (Assume that the correlation is
positive.)
49.
If the
correlation between the personality trait of psychopathy (i.e., gross
violations of the basic rights of others coupled with the inability to
feel
remorse) and anxiety is -0.35. What percentage of the variance in
psychopathy
is explained by anxiety?
50.
With an IQ
of 120, you were probably one of the smartest people in your high
school. The
mean IQ in your high school was 90 and the standard deviation was 10.
What
percentage of people had the same or lower IQ as you did?
51.
Then you
went to a selective university where the mean IQ was 115 and the
standard
deviation was 13. What percentage of people at your university had your
IQ or
above?
52.
Now you go
to graduate school where the mean IQ is 130 and the standard deviation
is
8.What percentage of the graduate students has your IQ or lower?
53.
Consider the
following scores for X and Y
X
Y
2
4
5
20
6
12
12
20
10
14
Calculate the correlation between X and Y.
54.
Using the
data above, calculate the slope of the regression line.
55.
Using the
data above, calculate the Y-intercept of the regression line.
56.
Using the
regression equation from above, what would you estimate Y to be if X
equaled 9?
57.
Consider the
following population of scores: 4 1 12 -5 -10 22
Calculate
the
standard deviation.
58.
Consider the
following sample of scores: 1 3 5 6 5
Calculate
the
estimated variance of the population from which this sample was
selected.
59. In the
following distribution, what is the mode? 5 45 45 88 109
60.
In the
following distribution, what is the median? 1 2 3 1000000 1000001
61.
In the
following distribution, what is the median? 1 2 3 7 20000 2000012121
62.
In the
following distribution, what is the mean? 1 2 4 5
63.
What is the
range of this distribution? 4 6 5 7 9 25 -9
64.
Which of the
following is the strongest correlation? 0.5, 0.09, -0.3, 0, 0.7, -0.8,
0.29
Answers
Note: These answers need not be memorized word-for-word. You do, however, need to define the terms clearly and precisely.
1. What is a nominal scale?
A nominal
scale is a set of categories that have no numerical order.
2. What is an ordinal scale?
An ordinal
scale is a set of categories that have a numerical order (i.e., the
categories
can be ranked) but the distance between categories is not equal.
3. What is an interval scale?
An interval
scale is a set of ordered categories like an ordinal scale with the
additional
requirement that the numerical distance between categories is equal.
4. What feature distinguishes a ratio scale from
an interval
scale?
A ratio
scale has an absolute zero (i.e., the point at which there is none of
the quantity being measured).
5. What is a population?
The entire
set of participants (or objects) that are of interest to the research
question.
6. What is a sample?
A subset of
individuals intended to represent a population.
7. What do we call descriptive characteristics of
a
population?
Parameters
8. What do we call descriptive characteristics of
a sample?
Statistics
9.
Define Central Tendency
Central
Tendency
is a statistical measure that identifies a single score as a
representative for
an entire distribution or set of data.
10.
Define "mean."
The
mean is the
average score in a distribution.
11.
Define "mode."
The
mode is the
most frequently occurring score in a distribution.
12.
Define "median."
The
median is
the midpoint of the distribution.
Another good answer: The score that half of the
scores in
the distribution are above and half are below
13. What is the
only measure of central tendency that can be used for a nominal scale?
Mode
14.
What types
of central tendency can be used for an ordinal scale?
Mode
and Median
15.
For which
types of scales can the mean be used?
Interval
and
Ratio
16.
What does
bimodal mean?
The
distribution
has 2 modes.
17.
In a
positively skewed distribution, which is higher, the mean or the median?
The
mean (NOTE:
The mean is always closer than the median to the tail of a skewed
distribution.)
18.
What is the
best measure of central tendency for a skewed distribution?
The
median
19.
Why is the
mean a poor measure of central tendency for a skewed distribution?
A skewed distribution has outliers in the tail. Outliers can make the
mean unrepresentative of the distribution as a whole.
20.
If many
samples are selected at random from the same population, which measure
of
central tendency is likely to be the most stable from sample to sample?
The
mean
21.
If many
samples are selected at random from the same population, which measure
of
central tendency is likely to be the least stable from sample to sample?
The
mode
22.
Define
variability.
Variability
is a
measure of the degree to which scores in a distribution are spread out
or
clustered together.
23.
Define
outlier.
Outliers
are
extreme scores in a distribution.
24. Name 3 reasons why measures of variability are important.
Here
are good answers:
25.
Define range
The
range is the
difference between the highest score and the lowest score in the
distribution.
26.
Name
2 disadvantages of using the range as a measure of variability.
1.It
does not take into account the variability of all of the scores in the
distribution.
2.It
is extremely sensitive to outliers.
27.
Define
variance.
Variance
is the
mean squared deviation from the mean of the distribution.
28.
Why is the
formula for variance different depending on whether the distribution is
a
population or a sample?
Using
the same
formula would mean that the sample variance would underestimate the
population variance. The slightly different formula corrects the
underestimation.
29.
Define
standard deviation.
The
standard
deviation is the typical distance of a set of scores from their
mean.
30.
What is the
relationship between the standard deviation and variance?
The
standard
deviation is the square root of the variance.
Another answer: The variance is the standard deviation squared.
31.
Name 2 advantages of using the standard deviation instead of the range
as a measure of
variability.
1. All of the scores in the distribution are used to calculate it.
2. It is less sensitive to outliers. (NOTE: The standard deviation is
still sensitive to outliers, just not as much as the range
32.
Define z-score
A z-score is a standard score that indicates how many standard
deviations a raw score deviates
from
the mean.
33.
What is the
mean and standard deviation of a set of z-scores?
The
mean is 0 and
the standard deviation is 1.
35.
A professor
mentions in lecture that income and vocabulary ability have a
correlation
coefficient of 3.2. What is the proper thing to do?
Stand
up and
denounce the professor as a fraud. Shout as loud as you can, "You FOOL!
Don't
you know that the correlation coefficient has a range of -1 to 1 and
that values
outside that range are IMPOSSIBLE?!!!!"
Okay, maybe that's a bit dramatic. Maybe just inform the professor
later that there must be some mistake because correlations can only
range from -1 to 1.
36.
If the
correlation between achievement motivation and actual income earned is
0, what
can we conclude about the relationship between these 2 variables?
They
do not have
a linear relationship.
37.
If violent
behavior and anxiety are negatively correlated, are highly violent
people more
likely to be anxious than less violent people?
No.
39.
If the
correlation between variables X and Y is .45, what is the correlation
between Y
and X?
.45
40.
Pretend that
a researcher finds that the correlation between height and number of
baskets
per game for NBA starting players is very low. The researcher announces
that
height confers little advantage in the sport of basketball. Assume that
the
researcher's conclusion is not correct. Why might the researcher have
obtained
such a low correlation?
Any
time there
is a restriction of range in 1 or both variables, the correlation might
be
misleading. In this case, almost all players are in the NBA are very
tall. If
the NBA included very short players, the correlation would probably be
much
higher.
41.
A researcher
found that, in a sample of children and teenagers, vocabulary knowledge
is highly correlated
with age.
The researcher concludes that because vocabulary is correlated with
age,
elderly people must have the best vocabulary. What is wrong with this
conclusion?
It
is improper
to generalize a conclusion beyond the ranges that were sampled (e.g.,
Just because
age and
vocabulary are correlated in teenagers does not necessarily mean that
they are
correlated at other age ranges.).
42.
A researcher
found that in her small sample of 10 people, the 2 variables that she
hypothesized would correlate, were not correlated at all. She decides
to
include 1 more person in the sample. Now, with 11 people, the 2
variables were
highly correlated. Why should this researcher not trust her results?
The
11thperson
was probably an outlier, possibly yielding misleading results.
43.
Define
regression
Regression
is a
statistical procedure that finds the best fitting line through a set of
data.
44.
In the
regression equation (Y = b0 + b1X), what does b1 represent?
The slope of the best fitting line in the scatterplot of X and Y.
45.
In the
regression equation (Y = b0 + b1X), what does b0 represent?
The
Y-intercept of the best fitting line in the scatterplot of X and Y.
Another answer: The estimated value of Y when X equals 0.
46.
In
regression, for any particular value of X, what do we call the
difference
between Y and the predicted value of Y (Y-hat)?
Error
47.
What do we
call the typical distance between the observed Y values and the
regression
line?
The
standard
error of the estimate.
48.
If IQ
explains 36% of the variance in Academic Achievement, what is the
correlation
between IQ and Academic Achievement? (Assume that the correlation is
positive.)
0.6
(NOTE: The
percentage of variance explained equals the correlation coefficient
squared.)
49.
If the
correlation between the personality trait of psychopathy (i.e., gross
violations of the basic rights of others coupled with the inability to
feel
remorse) and anxiety is -0.35. What percentage of the variance in
psychopathy
is explained by anxiety?
12.25%
(NOTE:
The percentage of variance explained equals the correlation coefficient
squared.)
50.
With an IQ
of 120, you were probably one of the smartest people in your high
school. The
mean IQ in your high school was 90 and the standard deviation was 10.
What
percentage of people had the same or lower IQ as you did?
Z =
(X-mu)/sigma
Z =
(120-90)/10
Z =
3.00
From
the Excel
worksheet,
you can calculate that 99.9% of the students had an IQ of 120 or less.
51.
Then you
went to a selective university where the mean IQ was 115 and the
standard
deviation was 13. What percentage of people at your university had your
IQ or
above?
Z =
(120-115)/13
Z =
.38
From
the Excel
worksheet,
you can calculate that 35% of the students had an IQ of 120 or
above.
52.
Now you go
to graduate school where the mean IQ is 130 and the standard deviation
is
8.What percentage of the graduate students has your IQ or lower?
Z =
(120-130)/8
Z =
-1.25
From
the Excel
worksheet,
you can calculate that 11% of the students have an IQ of 120 or
below.
53. Consider the following scores for X and Y
X
Y
2
4
5
20
6
12
12
20
10
14
Calculate the correlation between X and Y.
| X | Y | (X-μX) | (X-μX)2 | (Y-μY) | (Y-μY)2 | (X-μX)(Y-μY) | |
| 2 | 4 | -5 | 25 | -10 | 100 | 50 | |
| 5 | 20 | -2 | 4 | 6 | 36 | -12 | |
| 6 | 12 | -1 | 1 | -2 | 4 | 2 | |
| 12 | 20 | 5 | 25 | 6 | 36 | 30 | |
| 10 | 14 | 3 | 9 | 0 | 0 | 0 | |
| Sums | 35 | 70 | 0 | 64 | 0 | 176 | 70 |
| Means | 7 | 14 |
SSX
= 64
SSY = 176
SP = 70
r = 70 / sqrt(64 * 176) = .66
54.
Using the
data above, calculate the slope of the regression line.
b1 = SP/SSX = 70/64 = 1.09
55. Using the data above, calculate the Y-intercept of the regression line.

b0
= 14 - 1.09 * 7 = 6.36
56.
Using the
regression equation from above, what would you estimate Y to be if X
equaled 9?

16.17 = 6.36 + 1.09 * 9
57.
Consider the
following population of scores: 4 1 12 -5 -10 22
Calculate
the
standard deviation.
μ = ΣX/N = 24 / 6 = 4

SSX
=Σ(X - μ)2
SSX
=(4 - 4)2
+ (1 - 4)2 + (12 - 4)2 + (-5 - 4)2 + (-10 - 4)2 + (22 - 4)2 = 674
σ = √(674/6) = 10.60
58.
Consider the
following sample of scores: 1 3 5 6 5
Calculate the estimated variance of the population from which this sample was selected..
mean = ΣX/n = 20/5 = 4

SSX
=Σ(X - μ)2
SSX = (1 - 4)2 + (3 - 4)2 + (5 - 4)2 + (6 - 4)2 + (5 - 4)2 = 16
s2
=16 / (5 - 1) = 4
59.
In the
following distribution, what is the mode? 5 45 45 88 109
45
60.
In the
following distribution, what is the median? 1 2 3 1000000 1000001
3
61.
In the
following distribution, what is the median? 1 2 3 7 20000 2000012121
5
NOTE: Because 3
and7 are the middle scores, we average them.
62.
In the
following distribution, what is the mean? 1 2 4 5
3
63.
What is the
range of this distribution? 4 6 5 7 9 25 -9
34
NOTE: Range = 25 - (-9) = 34)
64.
Which of the
following is the strongest correlation? 0.5, 0.09, -0.3, 0, 0.7, -0.8,
0.29
-.8