Anonymous

Register for more FREE stuff!

my subscriptions

Hypothesis Testing

Question 1

[Maximum mark: 12]



Write down the null and alternative hypotheses for the following tests. Say if it’s a one or two-tailed test.



a) We assume that the 14-year-olds in a city are 160cm on average, but a self-conscious teacher feels like this is incorrect. Maybe they are taller than that.


b) A prestigious college advertises a starting salary of at least $50,000 after graduation. Recent graduates are trying to ascertain if the real figure isn’t lower in reality.


c) European citizens are supposed to get 6 weeks of paid leave a year. We want to have an idea if this is accurate.


d) Two high school classes A and B have about the same grades on average, according to the teachers. The students decide to run a test to see if that’s the truth.


e) On principle, men and women in the workforce should be paid the same, but there is reason to believe women still have a lower average salary.


f) A company A states that their lightbulbs last longer than company B’s. A consumer association sees no reason they wouldn’t be about the same, but still runs the tests.

Answers and Explanations

Show Answer

Question 2

[Maximum mark: 8]



A restaurant advertises that their burgers weigh 200g on average. A food critic suspects that they weigh less, and decides to run a 1-sample \( t \)-test by buying a random sample of 15 burgers. We assume the weights are normally distributed, and we write the null hypothesis \( H_0 \colon \mu = 200 \text{g} \).


The food critic finds that his burgers weigh on average 198g, with a standard deviation of 10g.


a) State the alternative hypothesis.


b) Calculate the \( p \)-value of this test.


c) At the 10% level of significance, what can you conclude from this test?


Answers and Explanations

Show Answer

Question 3

[Maximum mark: 6]



Two video games A and B are released at the same time. A gaming website tries to ascertain if game B is overall better. For this purpose, they ask random subscribers to rate them on a scale of 1 to 10.


They ask 27 subscribers to rate Game A and find an average of 8.3, with a standard deviation of 1.5.


They ask 23 different subscribers to rate Game B and find an average of 9.2, with a standard deviation of 1.2.


We assume normal distribution and pooled variances. Is videogame B better than A? Test at the 5% level of significance.

Answers and Explanations

Show Answer

Question 4

[Maximum mark: 7]



A gardener wants to see if there is a difference between the two types of fertilizer they use. As a metric, they choose to measure the height of plants using fertilizer 1 against the height of plants using fertilizer 2. Here is the data the gardener collected, expressed in centimeters.


Height of plants with fertilizer 1 Height of plants with fertilizer 2
68 62
59 55
64 73
38 66
49 63
34 73
62 49
57 69

We assume a normal distribution and equal population variances. Run the tests at significance level \( \alpha = 10% \) to see if the fertilizers are equivalent and state your conclusion.

Answers and Explanations

Show Answer

Question 5

[Maximum mark: 8]



For each model in the following situations, calculate the expected frequencies.


a) A company monitors the traffic to their website. At the end of the week, they know there were 3,500 visits in total. They suppose that there is the same amount of visits each day. Write how many visits are expected each day.


b) The number of computers in a home vary household by household. A study has found the following distribution:

Number of computers 0 1 2 3 4+
Percentage of households 10% 15% 40% 20% 15%

We will study a sample of 180 households. Write how many are expected in each category.


c) The birth weight of babies follows a normal distribution with mean 3.4kg and standard deviation 0.4kg. We have the following categories:

  • weight ≤ 3
  • 3 < weight ≤ 3.4
  • 3.4 < weight ≤ 3.8
  • weight > 3.8

There are currently 60 newborns in a hospital ward. Write how many are expected in each weight category.

Answers and Explanations

Show Answer

Question 6

[Maximum mark: 13]



You decide to make your own six-sided die to play tabletop games with. Before you use it with your friends, however, you want to ensure that it is fair. You roll it many times and write down the results, which are presented in the following table:


Result Occurrences
1 87
2 109
3 85
4 106
5 99
6 114

You decide to carry out a \(\chi^2\) goodness of fit test on those results, at a 5% significance level.


a) Find how many occurrences we expect to have for each face.


b) Write down the null and alternative hypotheses.


c) Write down the degrees of freedom for this test.


d) Find the \(p\)-value.


e) Give your conclusion.

Answers and Explanations

Show Answer

Question 7Register

[Maximum mark: 14]



When asked about their daily schedule, a survey participant estimates that their time is divided as follows:


Activity Work Chores Hobbies Socializing Sleeping
Percentage of time spent 30% 10% 20% 10% 30%

They are wearing a smart watch, which conveniently records the time spent doing each of these activities. The data it recorded is shown in minutes in this next table:

Activity Work Chores Hobbies Socializing Sleeping
Minutes spent 441 161 278 166 394

With the knowledge that there are 1,440 minutes in a day, we run a test to see if the interviewed person has a good estimation of how they spend their day. The chosen level of significance is 10%.


a) State the null and alternative hypotheses.


b) Find the expected time spent on each activity.


c) Find the \( p \)-value.


d) State your conclusion for this test.


e) Would your conclusion be different with a different level of significance? If yes, for which level of significance would your conclusions be different?

Answers and Explanations

Show Answer

Question 8Register

[Maximum mark: 10]



After a small tour, an artist’s label makes a model of the number of spectators at each concert. This model expects that the same amount of people attends all concert dates.


The data from ticket sales is shown in the following table.


Concert Audience Members
Date 1 20,361
Date 2 19,946
Date 3 19,799
Date 4 20,194

We use a goodness of fit test at the 5% significance level to determine whether the label’s model is apt. The critical value for this test is 7.81 and the hypotheses are:


\( H_0 \): the data follows the label’s model.


\( H_1 \): the data does not follow the label’s model.



a) With the same total amount of spectators, how many would attend a concert each date, according to the model?


b) What are the degrees of freedom of this test?


c) What is the value of \(\chi^2\) for this test?


d) Using the critical value, what conclusion to the test can you draw?

Answers and Explanations

Show Answer

Question 9Register

[Maximum mark: 11]



A group of friends are arguing over which vegetable is the tastiest, and they start to wonder if tastes might be influenced by the place you are from.


The group decides to conduct a test, asking 190 people from 3 different cities which vegetable they prefer. They find the following results:


City A City B City C
Carrot 20 20 25
Zucchini 15 10 15
Green Beans 10 20 15
Cauliflower 20 15 10

They conduct a \(\chi^2\) test for independence at the significance level \(\alpha = 5\%\).


a) Write down the test hypotheses.


b) Write down the number of degrees of freedom.


c) What’s the p-value of this test?


d) What conclusion can you draw from this test?

Answers and Explanations

Show Answer

Question 10Premium

[Maximum mark: 10]



A gym is trying to figure out to what public they should market their new ad campaign. They run a survey asking 4 groups of people to rate if they are physically Very Active, Somewhat Active, or Not Active. The groups are divided by age, and the results are as follows:


Younger than 20 Between 20 and 40 Between 40 and 60 Older than 60
Very active 4 10 6 2
Somewhat active 8 15 8 11
Not active 5 4 12 15

They wonder if it’s pertinent to pick any age group over the other, or if all groups are overall as active.


We are going to run a \(\chi^2\) test for independence at the significance level \(\alpha = 5\%\). The critical value for the test is 12.59.


a) Write down \(H_0\) and \(H_1\).


b) Write down the number of degrees of freedom.


c) Find the \(\chi^2\) for this test.


d) What conclusion can you draw?

Answers and Explanations

Show Answer

Question 11Premium

[Maximum mark: 8]



A newly found business starts producing pencils. They say that the length of the pencils will be 20 cm, with a standard deviation of 0.5 cm and a normal distribution. Bob bought 20 of these pencils and tested whether this claim was true. He found the mean length to be 19.8 cm using a 5% level of significance.



a) What is the:

i. null hypothesis

ii. alternative hypothesis


b) What are the \( z \) and \( p \)-values of this test?


c) What conclusion can we draw from this?


Answers and Explanations

Show Answer

Question 12Premium

[Maximum mark: 8]



Some friends team up and go for a run. The distance in kilometers they run per hour can be modelled with a Poisson distribution. Student A predicts they will run on average 10 kilometers an hour. Student B predicts they will only run less than 10 kilometers in 1 hour.


Student B decides to test this by measuring how far they run in 1 hour, and if it's less than 9 kilometers, Student A will be rejected.


a) What are the null and alternative hypotheses for this test?


b) What is the probability of a type I error?


c) What is a type II error? The group ends up running 9.5 km. What is the probability of a type II error?


Answers and Explanations

Show Answer

Question 13Premium

[Maximum mark: 10]



A newly found business starts producing pens. Alex buys 10 pens from this store, and measures the lengths of the pens, which are as follows:

\[ 20,\;19.5,\;22,\;19,\;20.4,\;21.3,\;20,\;20.3,\;20.4,\;19.3 \]


a) Assuming these lengths follow a normal distribution, find unbiased estimates for \(\mu\) and \(\sigma^2\).


b) Calculate the 95% confidence interval for the mean. Correct to 3 decimal places.


c) Alex predicted that the mean length of the pens will be 19 cm, which he makes his null hypothesis.

i. Find the t-statistic for this hypothesis and the p-value corresponding to this using a two-tailed test. Round your answers to two decimal places.

ii. State the conclusion at a 5% significance level.


Answers and Explanations

Show Answer

Question 14Premium

[Maximum mark: 8]



The mass of pens was recorded in grams and was normally distributed, with a mean \(\mu\) and standard deviation of 1.5. The masses of 5 of the pens are as follows:

\[ 11,\;10,\;9.5,\;10.5,\;10.2 \]


a) Find a 95% confidence interval estimating \(\mu\).


b) Find the confidence level interval that is one-third of what was obtained in part (a).


Answers and Explanations

Show Answer