Data - AP Statistics

Card 1 of 552

0

Didn't Know

0

Didn't Know

Knew It

0

1 of 2019 left

Question

Obtain a normal distribution table or calculator for this problem.

Approximate the $\small $90^{th}$$ -percentile on the standard normal distribution.

Tap to reveal answer

Answer

The $\small $90^{th}$$ -percentile is the value such that $\small 90$ percent of values are less than it.

Using a normal table or calculator (such as R, using the command qnorm(0.9)), we get that the approximate $\small $90^{th}$$ -percentile is about $\small 1.28$ .

← Didn't Know|Knew It →

Question

Find the first and third quartile for the set of data

Tap to reveal answer

Answer

In order to find the first and third quartiles, we haave to find the 25th and 75th percentiles, respectively.

To find the percentile, we find the product of and the number of items in the set.

We then round that number up if it is not a whole number, and the term in the set is the percentile.

For this problem, to find the and percentile, we first find that there are 14 items in the set. We find their respective products to be

and

As such, the and percentiles are the fourth and eleventh terms in the set, or

← Didn't Know|Knew It →

Question

Use the following five number summary to determine if there are any outliers in the data set:

Minimum:

Q1:

Median:

Q3:

Maximum:

Tap to reveal answer

Answer

An observation is an outlier if it falls more than above the upper quartile or more than below the lower quartile.

. The minimum value is so there are no outliers in the low end of the distribution.

. The maximum value is so there are no outliers in the high end of the distribution.

← Didn't Know|Knew It →

Question

For a data set, the first quartile is , the third quartile is and the median is .

Based on this information, a new observation can be considered an outlier if it is greater than what?

Tap to reveal answer

Answer

Use the $1.5 \cdot IQR$ criteria:

This states that anything less than or greater than will be an outlier.

Thus, we want to find

where .

$1.5 \cdot (70 - 40) =1.5\cdot 30=45$

$Q_3 + (1.5\cdot IQR) = 70 + 45 = 115$

Therefore, any new observation greater than 115 can be considered an outlier.

← Didn't Know|Knew It →

Question

Which values in the above data set are outliers?

Tap to reveal answer

Answer

Step 1: Recall the definition of an outlier as any value in a data set that is greater than or less than .

Step 2: Calculate the IQR, which is the third quartile minus the first quartile, or . To find and , first write the data in ascending order.

. Then, find the median, which is . Next, Find the median of data below , which is . Do the same for the data above to get . By finding the medians of the lower and upper halves of the data, you are able to find the value, that is greater than 25% of the data and , the value greater than 75% of the data.

Step 3: . No values less than 64.

. In the data set, 105 > 104, so it is an outlier.

← Didn't Know|Knew It →

Question

You are given the following information regarding a particular data set:

Q1:

Q3:

Assume that the numbers and are in the data set. How many of these numbers are outliers?

Tap to reveal answer

Answer

In order to find the outliers, we can use the and formulas.

Only two numbers are outside of the calculated range and therefore are outliers: and .

← Didn't Know|Knew It →

Question

Use the following five number summary to answer the question below:

Min:

Q1:

Med:

Q3:

Max:

Which of the following is true regarding outliers?

Tap to reveal answer

Answer

Using the and formulas, we can determine that both the minimum and maximum values of the data set are outliers.

This allows us to determine that there is at least one outlier in the upper side of the data set and at least one outlier in the lower side of the data set. Without any more information, we are not able to determine the exact number of outliers in the entire data set.

← Didn't Know|Knew It →

Question

A certain distribution has a 1st quartile of 8 and a 3rd quartile of 16. Which of the following data points would be considered an outlier?

Tap to reveal answer

Answer

An outlier is any data point that falls $1.5\ast IQR$ above the 3rd quartile and below the first quartile. The inter-quartile range is and $1.5\ast8=12$ . The lower bound would be and the upper bound would be . The only possible answer outside of this range is .

← Didn't Know|Knew It →

Question

Obtain a normal distribution table or calculator for this problem.

Approximate the $\small $90^{th}$$ -percentile on the standard normal distribution.

Tap to reveal answer

Answer

The $\small $90^{th}$$ -percentile is the value such that $\small 90$ percent of values are less than it.

Using a normal table or calculator (such as R, using the command qnorm(0.9)), we get that the approximate $\small $90^{th}$$ -percentile is about $\small 1.28$ .

← Didn't Know|Knew It →

Question

Find the first and third quartile for the set of data

Tap to reveal answer

Answer

In order to find the first and third quartiles, we haave to find the 25th and 75th percentiles, respectively.

To find the percentile, we find the product of and the number of items in the set.

We then round that number up if it is not a whole number, and the term in the set is the percentile.

For this problem, to find the and percentile, we first find that there are 14 items in the set. We find their respective products to be

and

As such, the and percentiles are the fourth and eleventh terms in the set, or

← Didn't Know|Knew It →

Question

A study is trying to determine if a particular medication (Y) is effective in weight loss. Patients participating in the study were randomly assigned to groups A, B, C, D, or E. Group A will receive one dose of Y, Group B will receive two doses of Y, Group C will receive three doses of Y, Group D will receive four doses of Y, and Group E will serve as the control group.

Which group will be receiving the placebo (a sugar pill)?

Tap to reveal answer

Answer

The control group in an experiment typically receives placebo treatments (in this case - Group E). Since all of the other groups are receiving at least one dose of the medication, they are considered to be experimental groups.

← Didn't Know|Knew It →

Question

To test if vitamin C actually makes people feel better, a vitamin company decides to run a 5-day study where they give one group of 100 sick participants vitamin C pills and another group of 100 sick people placebo pills, and monitored another group of 100 sick people who took no pills.

At the end of the 5-day experiment, 90 participants in the vitamin C group reported feeling better. 30 participants in the no-pill group felt better after the 5-day period. Interestingly, 50 participants in the placebo group felt better after the 5-day period.

What could explain these numbers?

Tap to reveal answer

Answer

The placebo effect is when effects are seen in a group of people who did not actually receive a treatment.

In the vitamin C group, 90 participants felt better.

Naturally (no-pill), 30 participants felt better.

With the placebo, 50 participants felt better. Since more people felt better with the placebo than with no treatment at all, it appears that some percentage of people believed that they would feel better with a pill and actually began to feel better due to the placebo effect.

← Didn't Know|Knew It →

Question

On a residual plot, the -axis displays the and the -axis displays .

Tap to reveal answer

Answer

A residual plot shows the difference between the actual and expected value, or residual. This goes on the y-axis. The plot shows these residuals in relation to the independent variable.

← Didn't Know|Knew It →

Question

Let us suppose a company wants to evaluate whether a new medical device works better than current devices. It conducts a small experiment to assess the effectiveness of the new device. To conduct the experiment, the company randomly assigns one group to the new medical device, which requires users to stay well hydrated, and the other group to the old device.

How should we control for confounding variables?

Tap to reveal answer

Answer

When comparing the effectiveness of a treatment, one should try to ensure that only the treatment varies across groups. In this case, the new device is compared to an old device. However, the new device also requires that users stay well hydrated. If we observe any positive effects from the new device, we won't know whether the new device is effective, or if merely staying well hydrated is actually what is effective. To rule out this confounding variable, we should also ask the group using the old machine condition to stay hydrated as well.

← Didn't Know|Knew It →

Question

An experiment was done by medical researchers to determine the association between drinking caffeine and severity of lung cancer. Results showed that there was a high association between the two variables. Which of the following could be a potential confounding variable in the experiment?

Tap to reveal answer

Answer

A confounding variable is one that could potentially have an effect on both the independent and dependent variables in a study. In this case, it is possible that there is an association between smoking and caffeine as well as smoking and lung cancer.

← Didn't Know|Knew It →

Question

A study finds that caffeine intake has a strong positive correlation with grades for college students. In other words, on average, the more caffeine intake a student has, the higher a grade the student gets.

Which of the following could potentially be a confounding variable in this experiment?

Tap to reveal answer

Answer

The only confounding variable in this experiment is the amount of sleep that each student gets. A confounding variable is one that has an impact on both the dependent and independent variable. It is possible that the amount of sleep a student gets is related to caffeine intake, which in turn affects the grade a student receives on a test or assignment.

← Didn't Know|Knew It →

Question

An experiment testing the effects of caffeine on endurance performance in athletes assigns caffeine to a randomly selected group of athletes and has them exercise. Another trial was conducted in which the same group exercised without anything given to them to take. The results did not match the expected results. What should be done to improve this experiment?

Tap to reveal answer

Answer

The placebo effect can potentially be a confounding variable. By knowingly taking a substance, participants may feel more energenized. By administering the same substance both trials, with the only thing changed being caffeine content, this corrects for this possible confounding variable.

← Didn't Know|Knew It →

Question

A small local umbrella company is trying to test the effectiveness of their umbrellas by looking at how many umbrellas they sell each year.

In 2014, the company sold 2,000 umbrellas.

In 2015, they sold 1,500 umbrellas.

They assume that their umbrellas are less effective which is why sales decreased.

However, there could be many confounding factors. Which of the following is NOT a possible confounding factor?

Tap to reveal answer

Answer

Any of these answers could explain why umbrella sales dropped. You cannot assume any specific cause explains a change in data like this-- further experimentation should be done rather than assuming cause and effect.

← Didn't Know|Knew It →

Question

A bird watcher observed how many birds came to her bird feeder for four days. These were the results:

Day 1: 15

Day 2: 12

Day 3: 10

Day 4: 13

Which answer is closest to the standard deviation of the number of birds to visit the bird feeder over the four days?

Tap to reveal answer

Answer

Standard deviation is essentially the average distance from the mean of a group of numbers. There are a number of steps in computing standard deviation, but the steps are not too complicated if you take them one at a time. First, find the mean of the values. Second, subtract the mean from the first value and square the result. Do this for all remaining values. Third, add these results together. Fourth, divide this value by the number of values. Finally, find the square root of the result.

1:

2:

3:

4: $13\div 4=3.25$

5: $$\sqrt{3.25}$=1.8$

← Didn't Know|Knew It →

Question

A bird watcher observed how many birds came to her bird feeder for four days. These were the results:

Day 1: 15

Day 2: 12

Day 3: 10

Day 4: 13

What is the variance of the number of birds that visited the bird feeder over the four days?

Tap to reveal answer

Answer

Variation measures the average difference between values within a group. The process is not complicated but there are four steps that can take time. First, find the mean of the values. Second, subtract the mean from the first value and square the result. Do this for all remaining values. Third, add these results together. Fourth, divide this value by the number of values in the group minus one (in this case, there are four days).

1:

2:

3:

4:

Note that to find the standard deviation, we would simply take one additional step of finding the square root of the variance.

← Didn't Know|Knew It →

0

Knew It