a) This is just a usual mean calculation where we sum up all the numbers and divide it by 7, so the answer is 110642.86.
b) After ordering the salaries in ascending order and counting how many of them we have (there is 7 observations in total), it can be clearly seen that the 4th observation is 111000.
c) Since the previous mean was 110642.86, adding a new observation higher than the mean will lead to its increase.
a) As it can be seen from looking at the graph, 40km/h corresponds to cumulative frequency of 60, and 60km/h corresponds to cumulative frequency of 100. So, 40 drivers drove between 40km/h to 60km/h.
b) (i) Since there was 150 observations, the median will be at \( \frac{150 + 1}{2} \), so between the 75th and 76th observation. From the graph we can see that it will be around the speed of 45km/h.
b) (ii) Q3 occurs for the \( \frac{3}{4} * (n+1) \), so between the 113th and 114th observation. From the graph we can see that it will be around the speed of 65km/h.
b) (iii) Q1 similarly occurs for the \( \frac{1}{4} * (n+1) \), so between the 37th and 38th observation, so for the speed of around 33km/h.
c) Based on the answers from part (b), the IQR is equal to \( 65 - 33 = 32 \).
d) We can see that the speed of 90km/h occurs for the cumulative frequency of 135, so \( p = 150 - 135 = 15 \).
Closea) (i) Mean we can get simply by plugging the values in the GDC:
\[ mean = 12.6\]
a) (ii) The observations are already ordered, so median is the average of the 5th and 6th observation, so 12.
b) (i) The value of \( a \) represents the median, so \( a = 12 \).
b) (ii) The value of \( b \) is the maximum point in the data, so \( b = 25 \).
c) To answer this question we need to identify beyond which value are there outliers:
\[ Q_3 + 1.5 \times IQR = 17.3 + 1.5 * 10.3 = 32.3\]
So, \( p = 32.3 \)
d) We first need to find the standard deviation of the first group of students, which can be easily done in the GDC. It is equal to \( 6.61 \).
Now, we know that the standard deviation of the second group is two times smaller, so \( \sigma = 3.31 \).
Closea) This is a cumulative frequency graph, so the total number of customers is the last value on the y-axis, which is 200.
b) Since the limit is $10, it can be seen from the graph that this value corresponds to 50 people. Thus, since we know from the previous part that there were 200 customers, the answer is \( \frac{50}{200} \) so 25%.
c) Highest 10% of customers means that 20 top spending people have to be considered (10% of 200 is 20). Therefore, the minimum amount to qualify for that will be the amount spent by the 180th person. By looking at the graph, we can see that this value is close to $90.
Closea) To find the value of x we need to go backwards using the knowledge we have about the mean. In a normal case, to caluclate the value for the mean we first have to sum up all values.
Therefore, we know that the value of 0 happened 7 times, the value of 1 happened 11 times, the value of 2 happened 18 times, and so on. This can be written as: 0,0,0,0,0,0,0,1,1,1,1,1,1,1,1,1,1,1,2... Then, all these values have to be summed up and divided by the total number of observations:
\[ \text{mean} = \frac{0*7 + 1*11 + 2*18 + 3*x + 4*3 + 5*1}{7+11+18+x+3+1} \]
\[ \text{mean} = \frac{3x+64}{x+40} \]
Now, knowing that the mean is equal to 2:
\[ \text{2} = \frac{3x+64}{x+40} \]
\[ x = 16 \]
b) The modal value is the one which happens most often. As we can, 2 is our modal value, since it occurred 18 times.
Closea) To find \( a \), we need the total frequency to be 130:
\[ 2 + 4 + 8 + a + 21 + 20 + 37 + 28 + 0 + 1 = 130 \]
Thus, \( a = 9 \).
b)
The midpoints of each score interval are:
Score | Frequency | Midpoint |
---|---|---|
0 ≤ x < 10 | 2 | 5 |
10 ≤ x < 20 | 4 | 15 |
20 ≤ x < 30 | 8 | 25 |
30 ≤ x < 40 | 9 | 35 |
40 ≤ x < 50 | 21 | 45 |
50 ≤ x < 60 | 20 | 55 |
60 ≤ x < 70 | 37 | 65 |
70 ≤ x < 80 | 28 | 75 |
80 ≤ x < 90 | 0 | 85 |
90 ≤ x < 100 | 1 | 95 |
By plugging it into the GDC we get that:
\[ \text{mean} = 55.6 \]
c) To find the maximum and minimum scores which are not considered outliers, we need to calculate the interquartile range (IQR). First, we calculate the cumulative frequencies:
Score | Frequency | Cumulative Frequency |
---|---|---|
0 ≤ x < 10 | 2 | 2 |
10 ≤ x < 20 | 4 | 6 |
20 ≤ x < 30 | 8 | 14 |
30 ≤ x < 40 | 9 | 23 |
40 ≤ x < 50 | 21 | 44 |
50 ≤ x < 60 | 20 | 64 |
60 ≤ x < 70 | 37 | 101 |
70 ≤ x < 80 | 28 | 129 |
80 ≤ x < 90 | 0 | 129 |
90 ≤ x < 100 | 1 | 130 |
\[ Q_1 = \frac{1}{4} \times 130 = 32.5th \ observation \]
\[ Q_3 = \frac{3}{4} \times 130 = 97.5th \ observation \]
\[ Q_1 \approx 45 \]
\[ Q_3 \approx 65 \]
\[ IQR = Q_3 - Q_1 = 65 - 45 = 20 \]
\[ \text{Lower Boundary} = Q_1 - 1.5 \times IQR = 45 - 1.5 \times 20 = 15 \]
\[ \text{Upper Boundary} = Q_3 + 1.5 \times IQR = 65 + 1.5 \times 20 = 95 \]
The scores between 15 and 95 are not considered outliers.
d)
There were 29 students who scored 70 points or more.
\[ \text{Percentage} = \left( \frac{29}{130} \right) \times 100 \approx 22.3\% \]
Closea) There is 28 students, so hte median is between the 14th and 15th student, so at around 7.8 points.
b) (i) & (ii) By looking at the graph, we can see that:
\[ a = 4 \]
\[ b = 9 \]
c) To find the standard deviation we input the values from the updated frequency table, including the midpoints, into the GDC:
Score | \( 0 \leq x < 3 \) | \( 3 \leq x < 6 \) | \( 6 \leq x < 10 \) | \( 10 \leq x < 15 \) |
---|---|---|---|---|
Frequency | 7 | 4 | 9 | 8 |
Midpoint | 1.5 | 4.5 | 8 | 12.5 |
\[ std = 4.17 \]
Closea)
\[ \text{Mean} = \frac{S}{N} \]
\[ S = 18 \times 9 + 19 \times 11 + 20 \times 13 + 21 \times 10 + 22 \times x + 23 \times 7 \]
\[ S = 162 + 209 + 260 + 210 + 22x + 161 = 1002 + 22x \]
\[ N = 9 + 11 + 13 + 10 + x + 7 = 50 + x \]
The mean age is given as 21:
\[ 21 = \frac{1002 + 22x}{50 + x} \]
\[ 21 \times (50 + x) = 1002 + 22x \]
\[ 1050 + 21x = 1002 + 22x \]
\[ 1050 - 1002 = 22x - 21x \]
\[ 48 = x \]
So, \( x = 48 \).
b)
Total number of students between ages 19 and 22:
\[ N_{19-22} = 11 + 13 + 10 + 48 = 82 \]
Total sum of ages for this range:
\[ S_{19-22} = 19 \times 11 + 20 \times 13 + 21 \times 10 + 22 \times 48 \]
\[ S_{19-22} = 209 + 260 + 210 + 1056 = 1735 \]
\[ \text{Average Age} = \frac{1735}{82} \approx 21.2 \]
c)
By inputting the values into the GDC we get that:
\[ \sigma \approx 1.48 \]
Closea) (i) & (ii) There is 366 days in a leap year, so:
\[ 366 = 141 + 93 + b + 22 + a \]
\[ 366 = 141 + 93 + b + 22 + 10b \]
\[ b = 10 \]
\[ a = 1 \]
b) The standard deviation can be found by inputting the values into the GDC, using the midpoints:
\[ \sigma \approx 1.82 \]
c) There will be in total 23 days with rainfall of at least 6mm, which means that the government will have to pay 30000$.
CloseNumber of travels abroad | 0 | 1 | 2 | 3 | 4 | 5 |
---|---|---|---|---|---|---|
Number of people | 7 | 11 | 18 | x | 3 | 1 |
Score | Frequency |
---|---|
\( 0 \leq x < 10 \) | 2 |
\( 10 \leq x < 20 \) | 4 |
\( 20 \leq x < 30 \) | 8 |
\( 30 \leq x < 40 \) | a |
\( 40 \leq x < 50 \) | 21 |
\( 50 \leq x < 60 \) | 20 |
\( 60 \leq x < 70 \) | 37 |
\( 70 \leq x < 80 \) | 28 |
\( 80 \leq x < 90 \) | 0 |
\( 90 \leq x < 100 \) | 1 |
Score | \( 0 \leq x < 3 \) | \( 3 \leq x < 6 \) | \( 6 \leq x < 10 \) | \( 10 \leq x < 15 \) |
---|---|---|---|---|
Frequency | 7 | \( a \) | \( b \) | 8 |
Age | Number of students |
---|---|
18 | 9 |
19 | 11 |
20 | 13 |
21 | 10 |
22 | x |
23 | 7 |
Rainfall (mm) | Frequency |
---|---|
\( 0 \leq x < 2 \) | 141 |
\( 2 \leq x < 4 \) | 93 |
\( 4 \leq x < 6 \) | \( b \) |
\( 6 \leq x < 8 \) | 22 |
\( 8 \leq x < 10 \) | \( a \) |