The Student Room Group

Edexcel GCSE statistics 2017 unofficial markscheme

I don't think anyone has started a thread yet for edexcel stats so thought I would. :smile: I guess because it's a pretty unpopular GCSE.

How did everyone find the exam? (higher)

I'm gonna try and remember as many questions as I can and I'll put my answers, but I only think I got about 70% of the paper right so don't go by them as an unofficial markscheme... didn't really revise and I'm not really an A* maths/stats student.

What do you guys think the grade boundaries will be? I think it was easier than last year's so I reckon somewhere between 78 and 82 will be the A* boundary.

A C grade will still be no more than 30% imo.

Scroll to see replies

Reply 1
Lmao, the exam was all good up until the last page , I think I did the standardized score the wrong way around :\ (and Icompletely forgot how to do weighted mean)

Grade boundaries won't be that high I think, a C will probably be around 30-40 marks, hopefully I got somewhere between 70-80%
Original post by Gulled Bulhan
Lmao, the exam was all good up until the last page , I think I did the standardized score the wrong way around :\ (and Icompletely forgot how to do weighted mean)

Grade boundaries won't be that high I think, a C will probably be around 30-40 marks, hopefully I got somewhere between 70-80%


What did you get for the standardised score?
The formula is score - mean divided by standard deviation
I got -0.6 for the standardised score.

The weighted mean (I think) would be (0.35 x 55) + (0.45 x 41) + (0.25 x 57)
I can't remember the actual grades given lol but you multiply the % each component carries by the mark achieved, then add all of those up for the weighted mean.

Yeah, I was reeeaaaallly hoping for an A* until the stupid bloody teacher who marked my work decided he 'didn't want to look like he was over-marking anybody' so gave me 27/40 when I know my grade was worthy of a 32/40 at least. I'm praying that edexcel will remark my coursework when they moderate the few courseworks that were sent off. It can't only be mine that was seriously marked down, but it makes me so pissed because I worked incredibly hard, I did 3 hypotheses and really, really thoroughly investigated them. Used pretty much every statistical method for strand 2 and wrote a good summary. I think I'm probably more likely to get an A grade or a high B. I think I got about 76 in the paper today. If my grade is a B I'll be devastated - I'm expecting an A at least in all of my subjects so I will honestly try to get the coursework thoroughly remarked because that will be where I lost so many marks that I really should have been given. I wish there was a way I could complain.

Good luck for results day! :smile:
What did everyone get for the last question?
Original post by dreamerkitten
What did you get for the standardised score?
The formula is score - mean divided by standard deviation
I got -0.6 for the standardised score.

The weighted mean (I think) would be (0.35 x 55) + (0.45 x 41) + (0.25 x 57)
I can't remember the actual grades given lol but you multiply the % each component carries by the mark achieved, then add all of those up for the weighted mean.

Yeah, I was reeeaaaallly hoping for an A* until the stupid bloody teacher who marked my work decided he 'didn't want to look like he was over-marking anybody' so gave me 27/40 when I know my grade was worthy of a 32/40 at least. I'm praying that edexcel will remark my coursework when they moderate the few courseworks that were sent off. It can't only be mine that was seriously marked down, but it makes me so pissed because I worked incredibly hard, I did 3 hypotheses and really, really thoroughly investigated them. Used pretty much every statistical method for strand 2 and wrote a good summary. I think I'm probably more likely to get an A grade or a high B. I think I got about 76 in the paper today. If my grade is a B I'll be devastated - I'm expecting an A at least in all of my subjects so I will honestly try to get the coursework thoroughly remarked because that will be where I lost so many marks that I really should have been given. I wish there was a way I could complain.

Good luck for results day! :smile:


I think you have done the weighted mean question wrong. You don't utilise the percentages- it didn't ask for it
HERE ARE MY ANSWERS (the ones I can remember) TO THE GCSE STATS 2017 HIGHER PAPER.
I probably wouldn't recommend using as an unofficial markscheme. I forgot a couple questions, and forgot the order. I only remembered 14 of them. and I don't think I got many answers right but here goes. Let me know what you guys got if you can remember !! :smile: Correct me where I'm wrong.




QUESTION 1 TIME SERIES GRAPH.


This was 4 marks to pretty much just draw a trend line and read off what the graph stated, I can’t remember it.


MARKS = 4


QUESTION 2 ELECTRONIC TABLET USERS


a) 8 } For these 2 it was just reading off the graph [1]

b) 28 } and doing 87 59 = 28 (or something like that) [2]

c) No, as there may be a different number of people in the UK and USA and we only know the % rather than actual numbers. Trends may also differ significantly for each country. [1]

d) IQR and Median were higher for tablet users, suggesting that overall the average age of a smartphone user is younger than for a tablet user (in the USA). Plus reference to the values and the difference. [4]



MARKS = 8


COMPOSITE BAR CHART QUESTION


A, b, and c were 6 easy marks you just had to read off the chart and then draw a new one.

- 35-56 year olds - 10% more guessed the price of a stamp correct, but both those and the 16-34 guessed too low(?) similarly. Plus evidence in the form of values/percentage. [2]



MARKS = 8


TELEPHONE INTERVIEW QUESTION


a) Electoral roll/register [1]

b) Is cheaper as no postage costs for researcher or respondent, so higher response rate. [1]

c) One source of bias interviewers may choose people they know to interview [1]

d) 80% said they would vote [2]

e) Method 1 = cluster sampling and Method 2 = stratified sampling [2]

f) Method two is better because 1) you will not get too many of a certain type of person (i.e. males) 2) less chance of bias [2]



MARKS = 9


BOX PLOT QUESTION


a) The box plot can be used to support this as it is slightly negatively skewed, and the median value shows that over 50% paid over 200k for their house [1]

b) 1.5 x IQR + UQ = 455,000. 500,000 is higher than the upper outlier boundary so is an outlier [3]



MARKS = 4


FORMS OF DATA


a) Age, as it is quantitative (which one can be shown in a stem&leaf) [2]

b) Time taken, as it will be discrete and has suitable class widths(?) [2]

c) Median (The name of the statistic needed) [1]



MARKS = 5


BAKER COLLECTING DATA QUESTION


a) Why he would have to collect primary data: he is the first to carry out this study [1]

b) If the sample is too small it will not be representative [1]

c) Tick boxes with non-overlapping values (I did less than 0.5 miles, 0.5-1 mile, 1.1-1.5 miles, 1.6-2 miles, more than 2 miles) [2]



MARKS = 8


SPEARMAN’S RANK CORRELATION COEFFICIENT


a) Show the sum of d^2 is 10: 0+0+1+4+4+1+0 = 10 [1]

b) SRCC = 0.82 [3]

c) Relatively strong correlation [1]

d) Yes, because the positive correlation suggests that the higher ice cream consumption has a relationship with the GDP value thing. [2]



MARKS = 7


CHAIN BASE INDEX


a) At first the prices increased rapidly, then they slowly fell by 5% by 2015. [1]

b) 2011 price = £3304.88 (3032 + 9% increase). [1]

c) Find the price for 2015. Show that if it increased by 25% it would be (something like) £3720 and the price for 2015 is £3820 so more than a 25% increase. [2]



MARKS = 4


VENN DIAGRAM QUESTION


a) 19/30 [1]

b) 16/30 [2]

c) 1/25 chance [2]

d) No, Greg is wrong. Chance of over 29 degrees was 10/30, chance of over 29 when rain was 6/30. So more likely to be over 29 when NOT raining. [3]



These are just guesses at what I put, I can’t remember what I actually answered for a and b... I can't remember the questions!!! Venn Diagram was: 10/30 over 29 degrees, 6/30 rain and over 29, 5/30 rain and 9/30 neither.

MARKS = 8


GEESE CAPTURE RECAPTURE QUESTION


I didn’t revise this topic and probably got it all wrong.

a) I got 500 but I think the answer was actually 402. [2]

b) It allowed the geese to move around as they would naturally, to distribute themselves. [1]

c) No, because the population may have increased more than he thought, he only took one sample. More offspring may have been born, geese could have died or moved to live somewhere else. [2]



MARKS = 5


WEIGHTED MEAN AND STANDARD DEVIATION


a) Mean = 51 (based on the scores I think it gave). I can’t remember his scores, I think it was 55, 41 and 57? But you had to find the mean. [1]

b) Weighted mean will be less because each component counts for a different % so one holding more weight counts for the most amount of mark, for example. [1]

c) Weighted mean = (0.35 x 55) + (0.45 x 41) + (0.25 x 57). I don’t know the exact marks or weightings, sorry, but I THINK that is the formula. [2]

d) Number of students higher than him I got something like 28. I just used the graph to add up the frequencies and then halved the class width that he was in and added that lol. [2]

e) 11.5 (or something like that was what I got for the Standard deviation.) [2]

f) SD is better because it will show how the data is spread and how close to the mean average it is. [1]



MARKS = 9


QUESTION 15. BINOMIAL DISTRIBUTION, RED/BLUE MARBLES


a) 0.7 chance of red [1]

b) 0.49 chance of red both times [2]

c) 0.83. I can’t remember exactly what it askes, but it was something like more than ½ of 5? This wording was confusing but I did p^5, 5p^4q, 10p^3q^2. Which would be 0.16 + 0.36 + 0.31 = 0.83 [3]

d) Show that the most likely number is 4 = 5p^4q worked out is 0.36 to 1dp = 0.4 which is 4 marbles [3]

e) No, binomial can only be used for independent trials. [2]



MARKS = 11


QUESTION 16. STANDARDISED SCORES AND NORMAL DISTRIBUTION.


a) 0.6 (plus working out) [2]

b) She did better in the vault, as her standardised score was higher for that than the beam. Evidence in the fact that -0.6 < 0.5 [2]

c) Could it be modelled by a normal distribution? NO, as 95% should lie between 13.3 and 15.7 and the highest score at 15.3 is below that 95%. Plus working to show mean +- 2 sd. [4]



MARKS = 8
Original post by SorabVij
I think you have done the weighted mean question wrong. You don't utilise the percentages- it didn't ask for it


I'm just using the formula given in the book now but I could be using it the wrong way. I can't actually remember exactly what I answered but I really hardly looked at weighted mean (I did the course last minute in 2 months) so you're probably right - what did you get for that question? I've added all of my answers that i can remember in a new comment on this thread.
Original post by dreamerkitten
HERE ARE MY ANSWERS (the ones I can remember) TO THE GCSE STATS 2017 HIGHER PAPER.
I probably wouldn't recommend using as an unofficial markscheme. I forgot a couple questions, and forgot the order. I only remembered 14 of them. and I don't think I got many answers right but here goes. Let me know what you guys got if you can remember !! :smile: Correct me where I'm wrong.





QUESTION 1 TIME SERIES GRAPH.



This was 4 marks to pretty much just draw a trend line and read off what the graph stated, I can’t remember it.



MARKS = 4



QUESTION 2 ELECTRONIC TABLET USERS


a) 8 } For these 2 it was just reading off the graph [1]

b) 28 } and doing 87 59 = 28 (or something like that) [2]

c) No, as there may be a different number of people in the UK and USA and we only know the % rather than actual numbers. Trends may also differ significantly for each country. [1]

d) IQR and Median were higher for tablet users, suggesting that overall the average age of a smartphone user is younger than for a tablet user (in the USA). Plus reference to the values and the difference. [4]




MARKS = 8



COMPOSITE BAR CHART QUESTION


A, b, and c were 6 easy marks you just had to read off the chart and then draw a new one.

- 35-56 year olds - 10% more guessed the price of a stamp correct, but both those and the 16-34 guessed too low(?) similarly. Plus evidence in the form of values/percentage. [2]




MARKS = 8



TELEPHONE INTERVIEW QUESTION


a) Electoral roll/register [1]

b) Is cheaper as no postage costs for researcher or respondent, so higher response rate. [1]

c) One source of bias interviewers may choose people they know to interview [1]

d) 80% said they would vote [2]

e) Method 1 = cluster sampling and Method 2 = stratified sampling [2]

f) Method two is better because 1) you will not get too many of a certain type of person (i.e. males) 2) less chance of bias [2]




MARKS = 9



BOX PLOT QUESTION


a) The box plot can be used to support this as it is slightly negatively skewed, and the median value shows that over 50% paid over 200k for their house [1]

b) 1.5 x IQR + UQ = 455,000. 500,000 is higher than the upper outlier boundary so is an outlier [3]




MARKS = 4



FORMS OF DATA


a) Age, as it is quantitative (which one can be shown in a stem&leaf) [2]

b) Time taken, as it will be discrete and has suitable class widths(?) [2]

c) Median (The name of the statistic needed) [1]




MARKS = 5



BAKER COLLECTING DATA QUESTION


a) Why he would have to collect primary data: he is the first to carry out this study [1]

b) If the sample is too small it will not be representative [1]

c) Tick boxes with non-overlapping values (I did less than 0.5 miles, 0.5-1 mile, 1.1-1.5 miles, 1.6-2 miles, more than 2 miles) [2]




MARKS = 8



SPEARMAN’S RANK CORRELATION COEFFICIENT


a) Show the sum of d^2 is 10: 0+0+1+4+4+1+0 = 10 [1]

b) SRCC = 0.82 [3]

c) Relatively strong correlation [1]

d) Yes, because the positive correlation suggests that the higher ice cream consumption has a relationship with the GDP value thing. [2]




MARKS = 7



CHAIN BASE INDEX


a) At first the prices increased rapidly, then they slowly fell by 5% by 2015. [1]

b) 2011 price = £3304.88 (3032 + 9% increase). [1]

c) Find the price for 2015. Show that if it increased by 25% it would be (something like) £3720 and the price for 2015 is £3820 so more than a 25% increase. [2]




MARKS = 4



VENN DIAGRAM QUESTION


a) 19/30 [1]

b) 16/30 [2]

c) 1/25 chance [2]

d) No, Greg is wrong. Chance of over 29 degrees was 10/30, chance of over 29 when rain was 6/30. So more likely to be over 29 when NOT raining. [3]



These are just guesses at what I put, I can’t remember what I actually answered for a and b... I can't remember the questions!!! Venn Diagram was: 10/30 over 29 degrees, 6/30 rain and over 29, 5/30 rain and 9/30 neither.


MARKS = 8



GEESE CAPTURE RECAPTURE QUESTION


I didn’t revise this topic and probably got it all wrong.

a) I got 500 but I think the answer was actually 402. [2]

b) It allowed the geese to move around as they would naturally, to distribute themselves. [1]

c) No, because the population may have increased more than he thought, he only took one sample. More offspring may have been born, geese could have died or moved to live somewhere else. [2]




MARKS = 5



WEIGHTED MEAN AND STANDARD DEVIATION


a) Mean = 51 (based on the scores I think it gave). I can’t remember his scores, I think it was 55, 41 and 57? But you had to find the mean. [1]

b) Weighted mean will be less because each component counts for a different % so one holding more weight counts for the most amount of mark, for example. [1]

c) Weighted mean = (0.35 x 55) + (0.45 x 41) + (0.25 x 57). I don’t know the exact marks or weightings, sorry, but I THINK that is the formula. [2]

d) Number of students higher than him I got something like 28. I just used the graph to add up the frequencies and then halved the class width that he was in and added that lol. [2]

e) 11.5 (or something like that was what I got for the Standard deviation.) [2]

f) SD is better because it will show how the data is spread and how close to the mean average it is. [1]




MARKS = 9



QUESTION 15. BINOMIAL DISTRIBUTION, RED/BLUE MARBLES


a) 0.7 chance of red [1]

b) 0.49 chance of red both times [2]

c) 0.83. I can’t remember exactly what it askes, but it was something like more than ½ of 5? This wording was confusing but I did p^5, 5p^4q, 10p^3q^2. Which would be 0.16 + 0.36 + 0.31 = 0.83 [3]

d) Show that the most likely number is 4 = 5p^4q worked out is 0.36 to 1dp = 0.4 which is 4 marbles [3]

e) No, binomial can only be used for independent trials. [2]




MARKS = 11



QUESTION 16. STANDARDISED SCORES AND NORMAL DISTRIBUTION.


a) 0.6 (plus working out) [2]

b) She did better in the vault, as her standardised score was higher for that than the beam. Evidence in the fact that -0.6 < 0.5 [2]

c) Could it be modelled by a normal distribution? NO, as 95% should lie between 13.3 and 15.7 and the highest score at 15.3 is below that 95%. Plus working to show mean +- 2 sd. [4]




MARKS = 8

Venn diagram question-
Wouldn't the probability for it being over 29 degrees when it's raining be 6/11 as there was 11 days when it was raining and 6 of these had a max temp above 29 degrees
Original post by SorabVij

Venn diagram question-
Wouldn't the probability for it being over 29 degrees when it's raining be 6/11 as there was 11 days when it was raining and 6 of these had a max temp above 29 degrees


Yeah - it would. b/c it's conditional probability P(A|B)
My answer is wrong; he IS correct in saying that there is more chance of the higher temp with rain, as the chance of it being over 29 degrees was 0.33 which is lower than 0.54 for rain and over 29 degrees.
I pretty much messed up quite a few questions and only realised it a few hours later... Always the case. I wrote up most of those answers straight after the exam, and have since noticed my mistakes. Oh well :/
Original post by dreamerkitten
HERE ARE MY ANSWERS (the ones I can remember) TO THE GCSE STATS 2017 HIGHER PAPER.
I probably wouldn't recommend using as an unofficial markscheme. I forgot a couple questions, and forgot the order. I only remembered 14 of them. and I don't think I got many answers right but here goes. Let me know what you guys got if you can remember !! :smile: Correct me where I'm wrong.





QUESTION 1 TIME SERIES GRAPH.



This was 4 marks to pretty much just draw a trend line and read off what the graph stated, I can’t remember it.



MARKS = 4



QUESTION 2 ELECTRONIC TABLET USERS


a) 8 } For these 2 it was just reading off the graph [1]

b) 28 } and doing 87 59 = 28 (or something like that) [2]

c) No, as there may be a different number of people in the UK and USA and we only know the % rather than actual numbers. Trends may also differ significantly for each country. [1]

d) IQR and Median were higher for tablet users, suggesting that overall the average age of a smartphone user is younger than for a tablet user (in the USA). Plus reference to the values and the difference. [4]




MARKS = 8



COMPOSITE BAR CHART QUESTION


A, b, and c were 6 easy marks you just had to read off the chart and then draw a new one.

- 35-56 year olds - 10% more guessed the price of a stamp correct, but both those and the 16-34 guessed too low(?) similarly. Plus evidence in the form of values/percentage. [2]




MARKS = 8



TELEPHONE INTERVIEW QUESTION


a) Electoral roll/register [1]

b) Is cheaper as no postage costs for researcher or respondent, so higher response rate. [1]

c) One source of bias interviewers may choose people they know to interview [1]

d) 80% said they would vote [2]

e) Method 1 = cluster sampling and Method 2 = stratified sampling [2]

f) Method two is better because 1) you will not get too many of a certain type of person (i.e. males) 2) less chance of bias [2]




MARKS = 9



BOX PLOT QUESTION


a) The box plot can be used to support this as it is slightly negatively skewed, and the median value shows that over 50% paid over 200k for their house [1]

b) 1.5 x IQR + UQ = 455,000. 500,000 is higher than the upper outlier boundary so is an outlier [3]




MARKS = 4



FORMS OF DATA


a) Age, as it is quantitative (which one can be shown in a stem&leaf) [2]

b) Time taken, as it will be discrete and has suitable class widths(?) [2]

c) Median (The name of the statistic needed) [1]




MARKS = 5



BAKER COLLECTING DATA QUESTION


a) Why he would have to collect primary data: he is the first to carry out this study [1]

b) If the sample is too small it will not be representative [1]

c) Tick boxes with non-overlapping values (I did less than 0.5 miles, 0.5-1 mile, 1.1-1.5 miles, 1.6-2 miles, more than 2 miles) [2]




MARKS = 8



SPEARMAN’S RANK CORRELATION COEFFICIENT


a) Show the sum of d^2 is 10: 0+0+1+4+4+1+0 = 10 [1]

b) SRCC = 0.82 [3]

c) Relatively strong correlation [1]

d) Yes, because the positive correlation suggests that the higher ice cream consumption has a relationship with the GDP value thing. [2]




MARKS = 7



CHAIN BASE INDEX


a) At first the prices increased rapidly, then they slowly fell by 5% by 2015. [1]

b) 2011 price = £3304.88 (3032 + 9% increase). [1]

c) Find the price for 2015. Show that if it increased by 25% it would be (something like) £3720 and the price for 2015 is £3820 so more than a 25% increase. [2]




MARKS = 4



VENN DIAGRAM QUESTION


a) 19/30 [1]

b) 16/30 [2]

c) 1/25 chance [2]

d) No, Greg is wrong. Chance of over 29 degrees was 10/30, chance of over 29 when rain was 6/30. So more likely to be over 29 when NOT raining. [3]



These are just guesses at what I put, I can’t remember what I actually answered for a and b... I can't remember the questions!!! Venn Diagram was: 10/30 over 29 degrees, 6/30 rain and over 29, 5/30 rain and 9/30 neither.


MARKS = 8



GEESE CAPTURE RECAPTURE QUESTION


I didn’t revise this topic and probably got it all wrong.

a) I got 500 but I think the answer was actually 402. [2]

b) It allowed the geese to move around as they would naturally, to distribute themselves. [1]

c) No, because the population may have increased more than he thought, he only took one sample. More offspring may have been born, geese could have died or moved to live somewhere else. [2]




MARKS = 5



WEIGHTED MEAN AND STANDARD DEVIATION


a) Mean = 51 (based on the scores I think it gave). I can’t remember his scores, I think it was 55, 41 and 57? But you had to find the mean. [1]

b) Weighted mean will be less because each component counts for a different % so one holding more weight counts for the most amount of mark, for example. [1]

c) Weighted mean = (0.35 x 55) + (0.45 x 41) + (0.25 x 57). I don’t know the exact marks or weightings, sorry, but I THINK that is the formula. [2]

d) Number of students higher than him I got something like 28. I just used the graph to add up the frequencies and then halved the class width that he was in and added that lol. [2]

e) 11.5 (or something like that was what I got for the Standard deviation.) [2]

f) SD is better because it will show how the data is spread and how close to the mean average it is. [1]




MARKS = 9



QUESTION 15. BINOMIAL DISTRIBUTION, RED/BLUE MARBLES


a) 0.7 chance of red [1]

b) 0.49 chance of red both times [2]

c) 0.83. I can’t remember exactly what it askes, but it was something like more than ½ of 5? This wording was confusing but I did p^5, 5p^4q, 10p^3q^2. Which would be 0.16 + 0.36 + 0.31 = 0.83 [3]

d) Show that the most likely number is 4 = 5p^4q worked out is 0.36 to 1dp = 0.4 which is 4 marbles [3]

e) No, binomial can only be used for independent trials. [2]




MARKS = 11



QUESTION 16. STANDARDISED SCORES AND NORMAL DISTRIBUTION.


a) 0.6 (plus working out) [2]

b) She did better in the vault, as her standardised score was higher for that than the beam. Evidence in the fact that -0.6 < 0.5 [2]

c) Could it be modelled by a normal distribution? NO, as 95% should lie between 13.3 and 15.7 and the highest score at 15.3 is below that 95%. Plus working to show mean +- 2 sd. [4]




MARKS = 8



For question 16, a) I got -0.6
On the Venn diagram question how would the answer be 1/25? I think I put like 21/30
Original post by Silverdps
For question 16, a) I got -0.6


What did u put for the very last one
Original post by Silverdps
For question 16, a) I got -0.6


So did I - that was a typo in my answers!!! I missed the - sign.
Original post by SorabVij
On the Venn diagram question how would the answer be 1/25? I think I put like 21/30


I don't know, I'm pretty sure I got it wrong. 21/30 seems much more likely - I can't actually remember the question but I don't think that 1/25 is right.
Also, for the chain base index question- did u get the actual percentage as 26. Something which is higher than 25%?
Also, for the chain base index question- did u get the actual percentage as 26. Something which is higher than 25%?
Original post by SorabVij
What did u put for the very last one

It cant be modeled by a normal distribution because the maximum score is only within 2 standard deviations etc
Original post by dreamerkitten
So did I - that was a typo in my answers!!! I missed the - sign.


At first I missed the - too, but then i changed it. I dont know why, Loads of people missed it, LOL
Original post by SorabVij
Also, for the chain base index question- did u get the actual percentage as 26. Something which is higher than 25%?

Yes it was a 26% increase.
Original post by Silverdps
Yes it was a 26% increase.

did u get a decimal after it?

Quick Reply

Latest

Trending

Trending