Review for the Final Examination

This final examination will cover the entire semester.  There will be multiple choice and statistical questions.  Please bring a calculator and a pencil with an eraser.  Do NOT bring the Guide to Computing Descriptive Statistics or the Guide to Computing Margins of Error.  Fresh copies of these will be included with the exam.

Multiple Choice Questions.  The best way to review for the multiple choice questions is to study the Review Glossary at the end of each chapter.  You may also find it useful to read over the lecture notes.  It is also a good idea to review the questions on the first two examinations, which have been returned to you.

Statistical Questions. These questions, also, will be similar to those on the first two exams.  If you did not get these questions on the first two examinations, make sure you learn how to solve them for the final.  The best way to do this is to work some problems, such as those on this review page.  The answers will be posted, but it is best to consult them after you have worked the problems yourself.

Percentage Questions.
Consider the following data:

45 men support Schundler    25 women support Schundler
35 men support McGreevy     80 women support McGreevy

To answer these questions, you should first put the data in a table and compute the row and column totals.

                   Men            Women    Total

       Schundler   45             25       70

       McGreevy    35             80       115

       Total       80             105      185
 
 

1.    What percentage of the men support Schundler?  The number of men supporting Schundler is 45, the total number of men is 80, so the answer is 45/80 *100 or 56.3%.  Always give three significant digits
2.    What percentage of the Schundler supporters are men?  This one is the number of men supporting Schundler divided by the number of Schundler supporters  45/70 * 100 or 64.3%
3.    What percentage of the women support McGreevy?  76.2%
4.    What percentage of the McGreevy supporters are women?  69.6%

Expected Frequency Questions.
5.    If there were no relationship between gender and candidate preference, how many man would we expect to support Schundler?  Note: this questions asks for a frequency [the "expected frequency"], not for a percent.  The easiest way to compute an expected frequency is to multiply the row total for the cell in question by the column total for that cell and divide by the grand total.  In this case, the rot total is 70, the column total 80 and the grand total 185.  Answer 30.3.  If you make this a percent, 30.3%, it is wrong, since this is a frequency, not a percent.
6.    If there were no relationship between gender and candidate preference, how many women would we expect to support Schundler?  39.7

To do expected frequencies, you start with the same table.   There are three ways to do them, one is shown in red, another - the easiest, in blue.
                   Men            Women    Total

       Schundler   45 .378*80=30.24 25       70   .378 (this is the "row proportion")
One way to get the expected frequencies is to compute the row proportions and multiply them by the column totals.  Or you could compute the column proportions and mutiply them by the row totals.
       McGreevy    35 .622*80      80      115   .622

       Total       80             105      185   -

The easiest way to get expected frequencies is to multiply the row total associated with the cellin question with the column total associated with that cell and divide by the grand total.  For example, for the cell giving frequencies for the men voting for Schundler the expected frequency would be (70 * 80)/185 = 30.27 the grand total, 185, is the sum of both the row and the column totals.  Note that the result is slightly different because .378 is rounded off.  The two methods are arithmetically equivalent, the second involves less rounding and less risk of error.
 

Descriptive Statistics Questions.  These are explained in the Guide to Computing Descriptive Statistics.
7.   Five students make the following scores on this test: 85, 65, 92, 78, 51
8.   What is the mean score?  The sum of the scores is 371, N is 5, so the mean is 74.2.
9.   What is the median score?  The median is the one in the middle, but only after you arrange the scores in numerical order:  51, 65, 78, 85, 92.  The median is 78.
10.  What is the standard deviation of the scores?  To compute the standard deviation, follow the instruction in the Guide to Computing Descriptive Statistics.  Answer:  16.4

Here are the computations for the standard deviation, following the Guide to Computing Descriptive Statistics.
 
 
X = Each Score The Mean X - the Mean X-the mean squared
85 74.2 10.8 116.64
65 74.2 -9.2 84.64
92 74.2 17.8 316.84
78 74.2 3.8 14.11
51 74.2 -23.2 538.24

7.  Now add up the fourth column, this is the Sum of Squares.  Write it here:  . 1070.78 .

8.  Divide the sum of squares by the number of cases, minus one (N-1)  N = 5 since their are five cases, no N-1=4, 1070.78/4 = 267.7.  This is the variance.  Write it here: .   267.7  .

9.  The standard deviation is the square root of the variance.  Write it here: .   16.4  .

Sampling Questions.  Follow the instructions in the Guide to Computing Margins of Error.
11. In a college class with 125 students, 48 of whom are male, the mean on the final exam was 79. The standard deviation was 7. What is the margin of error for this mean? This is a mean score question, so we use Formula 4.  2 * 7/sqrt(125) = 1.25
12. What would the lower bound of the 95% confidence interval be for this mean?  Subtract the Margin of Error from the Mean, 79 - 1.25 = 77.8.  Note, this is not 77.8% unless you assume the test scores are percentage scores.
13. A researcher wants to obtain a margin of error of no more than 10% in a survey of a county with a population of 30,000. How large a sample is needed?  This is a sample size question, so we use Formula Three, 1/(.10*.10)= 100
14. 60% of the Republican respondents in a survey of a state with seven million Republican voters voted for Bush, 40% for Gore. There were 1625 respondents, of whom 1000 were Democrats, 600 Republicans and 25 other. What is the margin of error for the percent voting for percent of Republicans voting for Bush?  This question asks about the Republicans, so N = 600.  We are given a percent, 60%, so we use Formula Two.  Answer 4.0%
15. A survey is to be conducted of attitudes among white and nonwhite respondents in Camden County. The population is 300,000. Of this population, 80% is white, 20% nonwhite. The researcher wants to achieve a 6% margin of error for the estimates for each of the groups. How large a sample is needed?  A 6% margin of error requires 278 people, we need 278 from each of two groups, so the answer is 556.
16. In a survey of community residents, the mean income was $38,745 with a standard deviation of $1,345. There are 370,000 residents in the community. 100 were sampled in the survey. What is the margin of error for this mean score?  $269
17. What would the lower bound of the confidence interval be for this mean?   $38,745 - $269 = $38,476.
18. What would the upper bound of the confidence interval be?  $38,745 + $269 = $39,014.
19. A survey of the South Jersey respondent had 200 male respondents and 275 female respondents.  110 men and 250 women voted for Gore.  What is the margin of error for the percentage voting for Gore in this survey?  For this one, you have to compute the percents yourself!  There are 475 respondents and 360 voted for Gore, so 75.8% voted for Gore.  We use Formula Two with N = 475 and p = .758  = 3.93%
20. What is the margin of error for the percentage of women voting for Gore?  Here N is 275 and p = .909 = 3,47%

Regression questions.
The following variables were used in a regression analysis:
AUTO THEFT - MOTOR VEHICLE THEFTS PER 100,000
POV LINE - PERCENTAGE OF POPULATION BELOW THE POVERTY LEVEL
DENSITY - POPULATION PER SQUARE MILE
The results were as follows:

Analysis of Variance
Dependent Variable: AUTO THEFT
N: 50          Missing: 0
Multiple R-Square = 0.070     Y-Intercept = 250.561
Standard error of the estimate = 188.588
LISTWISE deletion (1-tailed test)     Significance Levels: **=.01, *=.05
 Source Sum of Squares DF Mean Square F Prob.
 REGRESSION 125249.515 2 62624.757 1.761 0.183
 RESIDUAL 1.672e+06 47 35565.585
 TOTAL 1.797e+06 49
         Unstand.b Stand.Beta Std.Err.b t
POV LINE 11.349     0.195     8.382     1.354
DENSITY   0.179     0.224     0.115     1.555

21.  What are the independent variables in this last analysis?  POV LINE and DENSITY

22.  What is the dependent variable in this analysis? AUTO THEFT

23.  Which variable is the best predictor of the dependent variable?  The best predictor is the independent variable with the highest Standardized Bets.  DENSITY is highest with a Stand.Beta of .224

24.  What percentage of the variance in the dependent variable is explained in this analysis?  This is the Multiple R-Square converted to a percentage, 7.0%

25. Fill in the blanks in this formula:  Here we fill in the intercept and the undstandardized betas AUTO THEFT = 250.561 +   ( 11.349 * POV LINE) + ( 0.179   * DENSITY)

26. What would the predicted rate of auto theft be for a state with 15% of its population below the poverty line and 200 people per square mile?   Here we use the formula in the preceeding question, substituting the values for the independent variables.  We leave them in the original unit of measurement that was used to compute the equation, % below the poverty line and number of people per square mile.  AUTO THEFT = 250.561 +   ( 11.349 * 15 ) + ( 0.179   * 200 ) = 456.60.
27. What would the predicted rate of auto theft be for a state with 10% of its population below the poverty line and 200 people per square mile?  399.85
28. What would the predicted rate of auto theft be for a state with 5% of its population below the poverty line and 100 people per square mile?  325.2
29. What would the predicted rate of auto theft be for a state with 9% of its population below the poverty line and 180 people per square mile?  384.92
30.  On average, how much does the auto theft rate go up for each one percent increase in the poverty rate?  11.349
31.  On average, how much does the auto theft rate go up for each 10 additional people per square mile? 1.79 (the regression coefficient is .179, but this is for one unit of DENSITY.  The questions asks for 10, so we multiply it by 10.