Concept 4 – Statistics

Week 34 – Measurement and Distributions

As you enter the room, your height was measured. Add your data to the class data table.
• Examine the sets of data.  Record any initial observations you can make.
• Are there any measures that look very different from the others?  These extreme values are called outliers, because they are far away from most of the data.
• What is the range for the set of data?  What is the median?  What is the first quartile?  The third quartile?
8-5. Creating visual representations of data makes data sets easier to compare.
• Make a histogram for the set of data.  (How To)
• Create a Box-and-Whisker Plot (or box plot for short) for the set of data.(How To)
• Compare the center, shape, spread, and outliers for the two graphs (histogram and the box plot).  How are they similar? How are they different?
8-19. Mr. Sanders is just starting a round of golf.  This first hole is 130 yards long.  He needs to decide which club to use for his first shot.  He has kept careful records about how close his first shots came to the hole, all from this same distance.  His records include data from his use of two different golf clubs, a wedge and an 8-iron, over the past  year.
 Wedge, distance to hole (yards): 0    3    1    2   7   2   15  25   5   22 8-iron, distance to hole (yards): 19  12   12   8   3  11   5   7   10  13  8  10  11   20
a. Create a histogram and box plot for each of the clubs. Place the box plot above the histogram on the same number line for each club to make a combination plot.  Use a bin width of 5 yards. b. To find the “typical” distance that Mr. Sanders hits the ball from the hole with a wedge, is the mean or the median a better choice?  Find the typical distance Mr. Sanders hits the ball from the hole with a wedge and compare it to the typical distance he hits the ball with an 8-iron. c. Advise Mr. Sanders which club to use.  Explain your thinking.
8-20. It is just as important to consider the spread of the data as it is to consider the center when comparing data sets.
• Calculate the Interquartile Range (IQR) for each golf club in the previous problem.  With which club is Mr. Sanders more consistent?
• If Mr. Sanders decided to use the 8-iron, he could "typically" expect to hit the ball so that it lands between 8 and 12 yards from the hole.  This is indicated by the box on the box plot display and corresponds with the IQR.  If Mr. Sanders decided to use the wedge, what is a “typical” interval of distances from the hole he could expect the ball to land?
• Compare the typical interval of distances for the 8-iron with the interval you found for the wedge.
8-21. Mr. Klinck has only one more starting position available on his basketball team, but two students have tried out for it.  He wants to choose the student who is likely to score the most points. The two students from whom he can choose are described below.
• In her most recent games, Jana scored: 7  46  9  6  11  7  9  11  19  7  9  11  9  55  11 7 points.
• Alejandra scored 13  15  9  18  13  17  17  15 points.
• Which girl has the higher average (mean) number of points?
• Which student do you think Mr. Klinck should select and why?  Use parallel box plots (two box plots on the same number line) to support your explanation.
• Why was the mean not a good measure of the girl’s typical performance
• Calculate the IQR to measure the variability of each girl’s performance.
• Who had the higher median and by how many points?  How large is the difference between the medians measured by how many IQRs would fit into it?
8-22. Gregory and his sister shared a tablet at home. In order to be fair, they kept track of how much time they spent socializing with friends each day for the last two weeks. Their usage in minutes follows: Gregory: 49 48 51 52 68 40 73 68 61 60 69 55 51 59 His sister: 49 45 37 63 56 57 62 50 42 48 55 64 40 42
• Make a parallel box plot to compare their usage.
• Is their median usage notably different from each other?
• If the usage is notably different, how much more did one of them use the tablet than the other?

Assignments:

8-29.

As the social director of the Class Council, Ramin would like to survey a few students about their interests.

When Ramin analyzes the results from the survey, he wants to make claims about the interests of all of the students in his school.  If he were to survey only students on the Class Council, for example, it might be hard to make claims about what all students think.  Students who are on the Class Council may not have the same social interests as other students.  Consider this idea as you think about the samples described below.

1. If Ramin wanted to generalize the opinions of all students at his school, would it make sense to go to the grocery store and survey the people there?  Why or why not?

2. If he wanted to generalize the opinions of all students at his school, would it make sense to ask all of his friends at school?  Why or why not?

3. If he wanted to generalize the opinions of all students at his school, would it make sense to ask every third person who entered the cafeteria at lunch?  Why or why not?

8-30.

There are a variety of ways to choose samples of the population you are studying.  Every sample has features that make it more or less representative of the larger population.  For example, if you want to represent all of the students at your school, but you survey all of the students at school 30 minutes after the last class has ended, you are likely to get a disproportionate number of students who play school sports, attend after-school activities, or go to after-school tutoring.

1. If you ask the opinion of the people around you, then you have used a convenience sample.  If you took a convenience sample right now, what would be some features of the sample?  Would you expect a convenience sample to represent the entire student population at your school?  Why or why not?

2. If you email or create an online questionnaire then you have used a voluntary response sample.  What are some features of the people in a volunteer response sample?  Could it represent the sample of all of the students at school accurately?

3. You use a cluster sample if you first divide the students into smaller groups so that each of the smaller groups represents all of the students at your school.  Then you randomly select one or more of these groups to sample.  How might you divide the students at your school into groups that each represent the whole school?  Explain.  Are there any reasons that these clusters might not be fully representative of all the students at your school?

8-31.

From what population is each of these samples taken?  Write down the actual population for each of these sampling techniques.

8-32.

A study at the University of Iowa in 2008 concluded that children that play violent video games are more aggressive in real life.  Children ages 9 to 12 were studied to determine how much they played violent video games; peers and teachers were asked how much these students hit, kicked, and got into fights with other students.

1. Can you legitimately conclude from this study that teenagers who play violent video games tend to be more aggressive?  Why or why not?

2. Can you legitimately conclude from this study that children ages 9 to 12 who play violent video games are more likely to commit violent crimes?  Why or why not?

3. Can you legitimately conclude from this study that children ages 9 to 12 who play violent video games tend to hit and kick more in school?

4. Can you legitimately conclude from this study that playing a lot of violent video games will cause 9 to 12-year-old students to become more violent at school?

8-33.

Addie was helping children in a kindergarten class learn to read.  She was curious how old the typical child was when they entered kindergarten.  It was not practical to look up the school records of all 100 kindergarteners.  So on the first day of school, Addie took a sample: she asked the parent of the first fifteen students to be dropped off at the school how old (in months) their child was.  Her data is listed below.

67  61  69  72  71  65  67  67  57  68  71  72  61  59  62

Make an inference (a statistical prediction) of the mean age of kindergarten children at the school.

Assignment:

[/expand]