Module 27 – Linear Regression Lab (11 of 13)
Started: Apr 19 at 10:05pm
Quiz Instructions
In this activity, you will use StatCrunch and embed your results in an essay question. The essay questions are not automatically graded; your instructor will enter the points for these questions later. WARNING: you will need to enter your response to each essay question with every attempt. Your instructor will only grade the essay for your attempt with the highest total score for the automatically graded questions.
Recall the Question & Comments Tip for Success (feels like cheating but it’s not). If you have not earned 100% after your 2nd attempt on this quiz, use the discussion board below to ask for help. Give your peers or me some time to respond. Then return to the discussion board, review the responses, and submit your 3rd attempt. It’s o.k. if you submit your 3rd attempt on a quiz or checkpoint after the due date.
Progress Check
Use this activity to assess whether you can:
- Use StatCrunch to graph a scatterplot with its least-squares regression line and to simultaneously produce the equation of the regression line along with its correlation coefficient, r.
- Identify the x with the largest absolute prediction error.
- Explain why a given data point is an outlier.
Module 27 Discussion Board
Use the Module 27 (opens in a new tab) to ask questions or provide feedback about the problems in any Module 27 activity – including this lab.
Content by Cuyamaca College math faculty and licensed under the .
Learn by Doing
Some features of this activity may not work well on a cell phone or tablet. We highly recommend that you complete this activity on a computer.
A list of StatCrunch directions is provided at the bottom of this text-box.
Context
The modern Olympic Games have changed dramatically since their inception in 1896. Are athletes getting better? We will use regression to investigate the change in winning times for one eventthe men’s 1,500 meter race.
Variables
Year: the year of the Olympic Games, from 1896 to 2000.
Time: the winning time for the 1,500 meter race, in seconds.
Since the winning time depends on the year, the Year since 1896 is the explanatory variable, and the Winning time is the response variable.
Data
The olympics datafile includes the winning times for the men’s 1,500-meter race since 1896. Open the olympics datafile in the Stats at Cuyamaca College group on StatCrunch ( – opens in a new tab).
Prompt
In the first two questions below, you will use StatCrunch to produce and examine the scatterplot for the olympics datafile. You will also use StatCrunch to find the regression equation and correlation coefficient.
List of StatCrunch Directions
As you work through numbers 1) and 2) below, refer back to these StatCrunch directions when you need a quick reminder.
- (do not take screenshots; please use these directions)
- (do not post attachments; please use these directions)
- (All at the same time!)
Question 13 pts
If you have not done so, open the olympics datafile in the Stats at Cuyamaca College group on StatCrunch ( – opens in a new tab).
- Using the year since 1896 as the explanatory variable and the winning time as the response variable: graph the scatterplot with the regression line and produce the regression equation with the correlation coefficient – all at the same time ()
- Toggle to the output page with the scatterplot and regression line. Notice that the data has a strong linear association, so it makes sense to use linear regression. (Always check the form of the scatterplot before using linear regression.)
- Download the StatCrunch output page with your scatterplot and regression line graphed together. ()
- Embed the .png file for your scatterplot and regression line in the text-box below. ()
p
Question 23 pts
These directions assume you have produced the Simple linear regression results in a multipage StatCrunch output window. If not please see the previous question.
- Toggle to the StatCrunch output page with the regression equation, correlation coefficient, and other statistics.
- Under the heading Simple linear regression results, copy and paste the first five lines (dependent variable, independent variable, linear equation, sample size, and R) into the text-box below. ()
p
Question 32 pts
For which of the years 1900, 1940, or 2000 is the absolute prediction error the largest?
Question 42 pts
For the year 1896, the winning time for the men’s 1500-meter race is an outlier. In what ways is this data point an outlier? Check all that apply
Leave a Reply
You must be logged in to post a comment.