0% found this document useful (0 votes)
4 views8 pages

Final Practice

The document contains practice problems for a statistics course (STAT 213), covering topics such as the Central Limit Theorem, probability calculations, hypothesis testing, and confidence intervals. It includes multiple-choice questions related to various statistical concepts and scenarios, along with specific calculations and interpretations. The problems are designed to test understanding of statistical principles and applications.

Uploaded by

xc68p2pj4c
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views8 pages

Final Practice

The document contains practice problems for a statistics course (STAT 213), covering topics such as the Central Limit Theorem, probability calculations, hypothesis testing, and confidence intervals. It includes multiple-choice questions related to various statistical concepts and scenarios, along with specific calculations and interpretations. The problems are designed to test understanding of statistical principles and applications.

Uploaded by

xc68p2pj4c
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

© Samprit Ghosh 2025, all rights reserved.

Practice Problems for Final STAT 213

1. Let X be a random variable with mean µ and standard deviation σ. Suppose the
distribution of X is highly skewed and definitely not normal. Suppose random samples
of size n are selected from this population. Which of the following statements best
describes what the Central Limit Theorem guarantees?

(a) For any value of n, the distribution of the sample averages becomes approximately
normal.
(b) For sufficiently large n, the population distribution becomes approximately nor-
mal.
(c) For sufficiently large n, the distribution of the sample averages becomes approxi-
mately normal.
(d) The Central Limit Theorem only applies if the population is symmetric.
(e) For sufficiently large n, the average of every sample will be equal to µ.

2. A call center receives customer calls with an average rate of 3.6 calls per hour. If an
operator works an eight-hour shift, what is the probability that they receive exactly 2
calls in their last 30 mins? (Round to four decimals.)

(a) 0.2678 (b) 0.0005 (c) 0.1771 (d) 0.7306 (e) 0.3027

3. A survey revealed, in Calgary 22% of adults regularly use the CTrain. In a random
sample of 15 adults, what is the probability that at least 2 regularly use CTrain?
(Round to four decimal places.)

(a) 0.2010 (b) 0.8745 (c) 0.1258 (d) 0.22 (e) 0.9243

4. The average time required to process a customer order in an Amazon warehouse follows
an exponential distribution with mean value of 3 hours. What is the probability that
a randomly selected order takes less than 1.5 hours to process?
√ 1 √ 1
(a) e (b) √ (c) 1 − e (d) 1 − √ (e) None of the above.
e e
5. Let A and B be two events. Which of the following is always true?

(a) P (A ∩ B c ) = P (A) − P (B).


(b) If A and B are mutually exclusive, then they must be independent.
(c) If A and B are independent, then they must be mutually exclusive.
(d) P (Ac ∩ B c ) = 1 − P (A ∪ B).
(e) None of the above is always true.

6. Let X ∼ normal (mean = 2, sd = 4) and let Y = 2X + 3. Random samples of size 20


are selected from Y . Let y i denote the averages of these samples. Then approximately
4
(a) y ∼ normal (mean = 2, sd = √ )
20
4
(b) y ∼ normal (mean = 7, sd = √ )
20
16
(c) y ∼ normal (mean = 7, sd = √ )
20
8
(d) y ∼ normal (mean = 7, sd = √ )
20
(e) None of the above.

7. A wildlife biologist believes that more than 30% of the turtles in a certain wetland show
signs of shell damage caused by predators. To investigate this, she takes a random
sample of turtles and plans to perform a hypothesis test for a population proportion.
Which of the following is the correct hypothesis?

(a) H0 : p > 0.30


(b) H0 : p < 0.30
(c) H0 : p̂ < 0.30
(d) H0 : p ≤ 30
(e) H0 : p̂ ≤ 30.

Q8 - 10 are based on the following premise : A board game club surveys its players
about whether they prefer strategy games or party games, and whether they usually
play online or in-person. They found that 60% of the players prefer strategy games; the
rest prefer party games. The probability of a player playing online, given they prefer
strategy games is 0.4. Whereas, only 25% of the party games players, play online.

8. What is the probability that a randomly selected player prefers strategy games and
plays online?

(a) 0.24 (b) 0.40 (c) 0.15 (d) 0.60 (e) 0.90

9. Given that a player plays online, what is the probability that they prefer party games?
25 1 1 10
(a) 34
(b) 7
(c) 49
(d) 34
(e) Non of the above.

10. The two events : “Preference for strategy games” and “plays online” are independent.

(a) True (b) False

11. The test scores in a large university course are normally distributed with a mean of
75% and a standard deviation of 10%. What is the probability that the mean score of a
random sample of 49 students is greater than 77%? [Given that pnorm(1.4)=0.9192]

(a) 0.0808 (b) 0.9192 (c) 0.3085 (d) 0.6915 (e) 1.4

2
12. A city wants to estimate and compare the average monthly electricity usage of its house-
holds based on number of family members (1-member household, 2-member household,
etc). The city is divided into 20 neighborhoods. To collect data efficiently, the re-
searchers randomly select 4 neighborhoods and then survey every household within
those selected neighborhoods. This is an example of

(a) Simple random sampling


(b) Stratified Sampling
(c) Cluster Sampling
(d) Convenience sampling.
(e) Voluntary sampling.

13. Four fair dice (six-sided, numbered 1 through 6) were tossed. What is the probability
of getting four numbers in a row?
3 72 24 1 1
(a) (b) (c) (d) (e)
64 64 64 6 2

Q 14 - 22 are all related to the following experiment.

A public-health researcher wants to model how daily moderate exercise amount (E,
in minutes) relate to resting heart rate (H, per minute). Data were collected from
100 adults. Here are some boxplots and statistics for E and H.

. Boxplot for E Boxplot for H

14. Choose the correct statement below.

3
(a) The “exercise amount” sample contains some potential outliers, but the “heart
rate” sample does not.
(b) The“heart rate” sample contains some potential outliers, but the “exercise amount”
sample does not.
(c) None of the two samples contain any potential outliers.
(d) Both of the two samples contain potential outliers.
(e) Since the raw data is not given, we cannot determine whether or not these samples
have outliers.

15. The Upper fence for E-values is and the Lower fence for H-values is
.
Consider the R - code :

16. The coefficient of determination for this data is (round up to 4 decimal


places).

17. Interpret the coefficient of determination in the context of the data.

18. Use the regression model : Ĥi = β̂0 + β̂1 Ei . Consider the R-code below.

Choose the correct values for the regression coefficients and fill in the blanks :

Ĥ = + E.

19. Interpret the coefficient β̂0 in the context of the data.

20. Suppose we are thinking of predicting the resting heart rate of a person who only
exercises for 10 mins daily. Select the correct statement from below.

(a) The above regression model should be used, just plugging in E = 10 will give us
a prediction.

4
(b) The above regression model should be used, just solving for Ĥ = 10 will give us
a prediction.
(c) We should not use this regression model to predict, as cor(E,H) is negative.
(d) We should not use this regression model to predict, as this will require extrapo-
lation. E = 10 is not with in the range of values of E.
(e) There is not enough data to conclude whether we should or should not predict.

21. Suppose a participant in this data has a resting heart rate of 73 and they moder-
ately exercise for 33.8 minutes daily. Compute the prediction error (residual) for this
participant. (Round up to 4 decimal places).

22. Consider the normal probability plot of the residuals for this linear model (below).

Choose the correct answer.

(a) This plot determines normality, and it shows the assumption is satisfied.
(b) This plot determines normality, and it shows the assumption is not satisfied.
(c) This plot determines homoscedasticity, and it shows the assumption is satisfied.
(d) This plot determines homoscedasticity, and it shows the assumption is not satis-
fied.
(e) The plot does not tell us anything about normality or homoscedasticity.

Q23 - 26 are all related to the following experiment.

23. A nutritionist measures the daily calcium intake (in mg) for adults. In a random sample
of 40 adults the average intake was found to be 812 mg and a standard deviation of
120 mg. Assuming normality, suppose we want to construct a 95% confidence interval
for the population mean daily calcium intake. Which formula should we use?
r
X −µ X −µ p̂(1 − p̂)
(a) √ . (b) √ . (c) p̂ ± Z α2 .
σ/ n s/ n n

5
σ s
(d) X ± Z α2 √ (e) X ± T α2 ,n−1 √
n n
24. The calculated 95% confidence interval is (use 5 decimal places)
(a, b) = ( , )
(Use the tables provided in the last page, if needed.)
25. Choose the correct statement below.
(a) There is a 95% chance that the mean calcium intake of the population lies between
your calculated values of a and b.
(b) There is a 95% chance that the mean calcium intake of the sample lies between
your calculated values of a and b.
(c) 95% of all adults in the population have daily calcium intakes between your cal-
culated values of a and b.
(d) We are 95% confident that the mean calcium intake of the population will be
between your calculated values of a and b.
(e) The confidence interval sows that the probability of the population mean being
exactly 812 mg is 95%.
26. A health policy group claims that all adults on average consume at most 825 mg of
calcium per day. Based on your interval, do you support this claim?
(a) No. Because the calculated interval includes values greater than 825 mg.
(b) Yes. Since 825 mg is within our calculated interval.
(c) Yes. Since the sample mean (812 mg) is below 825 mg, the population mean
within the calculated interval, must be below 825 mg as well.
(d) No. Because the claim is at most 825 mg, which means it can be anything smaller,
like 725 mg which is outside our calculated interval.
(e) No. The calculated confidence interval cannot be used to assess inequalities in-
volving population mean.

Q27 - 30 are related to the following premise.


27. A game designer records how long (in minutes) it takes players to complete one level
in a fantasy dungeon adventure game. The observed completion times for 6 players
are :
45.3, 38.7, 52.1, 47.8, 41.0, 49.6
The following R-code was run to create a bootstrap sample B.

6
© Samprit Ghosh 2025, all rights reserved.

Use a calculator to compute the median for this bootstrapped sample.


.

28. Use a calculator to compute the mean of this bootstrapped sample.


.

29. Use a calculator to compute the standard deviation of this bootstrapped sample.
.

30. Consider the following R-code which now computes several confidence intervals based
on bootstrap percentiles.

You wish to test the null hypothesis H0 : µ = 42 vs. Ha : µ ̸= 42 using a significance


level of 0.05, based entirely on the bootstrap percentile interval above.

(a) Fail to reject H0 , the value 42 lies inside the bootstrap percentile interval.
(b) Fail to reject H0 , the value 42 lies outside the bootstrap percentile interval.
(c) Reject H0 , the value 42 lies inside the bootstrap percentile interval.
(d) Reject H0 , the value 42 lies outside the bootstrap percentile interval.
(e) Cannot determine. The given bootstrap percentile intervals cannot be used for
testing this hypothesis.

7
© Samprit Ghosh 2025, all rights reserved.

Here are some Z-scores and t-scores that might be useful.

qnorm(0.95) qnorm(0.975) qnorm(0.99) qnorm(0.995)


1.64485 1.95996 2.32635 2.57583

For degrees of freedom = 39,

qt(0.95,39) qt(0.975,39) qt(0.99,39) qt(0.995,39)


1.68486 2.02269 2.42584 2.70791

For degrees of freedom = 40,

qt(0.95,40) qt(0.975,40) qt(0.99,40) qt(0.995,40)


1.68385 2.02108 2.42326 2.70446

For degrees of freedom = 812,

qt(0.95,812) qt(0.975,812) qt(0.99,812) qt(0.995,812)


1.64673 1.96289 2.33095 2.5819

You might also like