Collecting and sampling data

Choose fair methods for gathering useful data.

Pearson EdexcelGCSE MathsStatisticsFoundation and Higher
Visual model

A sample should represent the whole population

sampleavoid bias: choose fairly
Gold-standard guide
20 mins

What you will learn

Choose fair methods for gathering useful data.
Use a clear step-by-step method for collecting and sampling data.
Check your answer and avoid the most common exam mistake.
Useful before you start
Core number skillsEarlier statistics skillsShowing clear working
Core knowledge

Know the rule, then use it

These are the short notes. Read each one, then check you can use it in the worked example below.

Method

Method

Sampling methods: simple random — every member has equal chance, for example lottery

Step 1

Simple random sampling

Number all 600 students 1–600

Step 2

Stratified sampling by year group

Find the number per year group

Step 3

State the key difference

Stratified ensures proportional representation; simple random does not guarantee it

Watch out

Watch out

Students say 'random is best' without explaining why

f
Random sample

Every member of the population should have a fair chance.

f
Stratified sample

sample from each group in proportion to the population.

Worked example

A school has 600 students. A researcher wants a sample of 60. Compare simple random sampling with stratified sampling by year group.

1

Simple random sampling: Number all 600 students 1–600. Generate 60 random numbers. Select those students.Advantage: unbiased and each student equally likely. Disadvantage: may not represent year groups proportionally.

2

Stratified sampling by year group: Find the number per year group.Calculate each year's share of the 60-person sample proportionally.Advantage: guaranteed representation of each year group. Disadvantage: requires a complete list sorted by year group.

3

State the key difference: Stratified ensures proportional representation; simple random does not guarantee it.

Final answer

Stratified ensures proportional sub-group representation; simple random may under- or over-sample a group

Question ladder

Build up to the hardest questions

Do them in order. If you miss a step, read the solution, then redo the question without looking.

Workedreasoning

A school has 600 students. A researcher wants a sample of 60. Compare simple random sampling with stratified sampling by year group.

3 marks4 minscollecting-and-sampling-data-worked
Show solution
Worked solution
  1. 1.Simple random sampling: Number all 600 students 1–600. Generate 60 random numbers. Select those students.Advantage: unbiased and each student equally likely. Disadvantage: may not represent year groups proportionally.
  2. 2.Stratified sampling by year group: Find the number per year group.Calculate each year's share of the 60-person sample proportionally.Advantage: guaranteed representation of each year group. Disadvantage: requires a complete list sorted by year group.
  3. 3.State the key difference: Stratified ensures proportional representation; simple random does not guarantee it.
Final answer

Stratified ensures proportional sub-group representation; simple random may under- or over-sample a group

Mark points
  • M1: simple random sampling
  • M1: stratified sampling by year group
  • M1: state the key difference
  • A1: Stratified ensures proportional sub-group representation; simple random may under- or over-sample a group
Watch out

Students say 'random is best' without explaining why.Marks come from the trade-off: what does each method guarantee, and at what cost?Always give both an advantage and a disadvantage for each method asked.

Diagnosticrecall

Give one advantage of stratified sampling.

1 mark2 minscollecting-and-sampling-data-q1
Show solution
Worked solution
  1. 1.Spot the skill: Sampling methods: simple random — every member has equal chance, for example lottery.
  2. 2.Use the simple random sampling stage first, then stratified sampling by year group.
  3. 3.Keep the final answer visible: Ensures all sub-groups are represented proportionally.
Final answer

Ensures all sub-groups are represented proportionally

Mark points
  • g. lottery. systematic — every nth member from a list. stratified — sample in proportion to sub-groups.cluster — sample whole groups. each has advantages and disadvantages relating to bias, cost and representation.
  • A1: Ensures all sub-groups are represented proportionally
Watch out

Students say 'random is best' without explaining why.Marks come from the trade-off: what does each method guarantee, and at what cost?Always give both an advantage and a disadvantage for each method asked.

Easyprocedure

Why might a questionnaire question be biased?

2 marks3 minscollecting-and-sampling-data-q2
Show solution
Worked solution
  1. 1.Spot the skill: Sampling methods: simple random — every member has equal chance, for example lottery.
  2. 2.Use the stratified sampling by year group stage first, then state the key difference.
  3. 3.Keep the final answer visible: Leading wording, only offering responses that agree, or not including a 'no' option.
Final answer

Leading wording, only offering responses that agree, or not including a 'no' option

Mark points
  • g. lottery. systematic — every nth member from a list. stratified — sample in proportion to sub-groups.cluster — sample whole groups. each has advantages and disadvantages relating to bias, cost and representation.
  • A1: Leading wording, only offering responses that agree, or not including a 'no' option
Watch out

Students say 'random is best' without explaining why.Marks come from the trade-off: what does each method guarantee, and at what cost?Always give both an advantage and a disadvantage for each method asked.

Mediumreasoning

What is a census?

3 marks4 minscollecting-and-sampling-data-q3
Show solution
Worked solution
  1. 1.Spot the skill: Sampling methods: simple random — every member has equal chance, for example lottery.
  2. 2.Use the state the key difference stage first, then simple random sampling.
  3. 3.Keep the final answer visible: Data collected from every member of the population, not a sample.
Final answer

Data collected from every member of the population, not a sample

Mark points
  • g. lottery. systematic — every nth member from a list. stratified — sample in proportion to sub-groups.cluster — sample whole groups. each has advantages and disadvantages relating to bias, cost and representation.
  • A1: Data collected from every member of the population, not a sample
Watch out

Students say 'random is best' without explaining why.Marks come from the trade-off: what does each method guarantee, and at what cost?Always give both an advantage and a disadvantage for each method asked.

Hardproblem solving

School has 300 yr10 and 200 yr11 students. A stratified sample of 50 needed. How many from yr10?

3 marks5 minscollecting-and-sampling-data-q4
Show solution
Worked solution
  1. 1.Spot the skill: Sampling methods: simple random — every member has equal chance, for example lottery.
  2. 2.Use the simple random sampling stage first, then stratified sampling by year group.
  3. 3.Keep the final answer visible: 30.
Final answer

30

Mark points
  • g. lottery. systematic — every nth member from a list. stratified — sample in proportion to sub-groups.cluster — sample whole groups. each has advantages and disadvantages relating to bias, cost and representation.
  • A1: 30
Watch out

Students say 'random is best' without explaining why.Marks come from the trade-off: what does each method guarantee, and at what cost?Always give both an advantage and a disadvantage for each method asked.

Exam-stylemulti-step

State the difference between discrete and continuous data.

4 marks6 minscollecting-and-sampling-data-q5
Show solution
Worked solution
  1. 1.Spot the skill: Sampling methods: simple random — every member has equal chance, for example lottery.
  2. 2.Use the stratified sampling by year group stage first, then state the key difference.
  3. 3.Keep the final answer visible: Discrete takes exact countable values; continuous can take any value in a range.
Final answer

Discrete takes exact countable values; continuous can take any value in a range

Mark points
  • g. lottery. systematic — every nth member from a list. stratified — sample in proportion to sub-groups.cluster — sample whole groups. each has advantages and disadvantages relating to bias, cost and representation.
  • A1: Discrete takes exact countable values; continuous can take any value in a range
Watch out

Students say 'random is best' without explaining why.Marks come from the trade-off: what does each method guarantee, and at what cost?Always give both an advantage and a disadvantage for each method asked.

Grade 9 stretchproblem solving

A year group contains 180 girls and 120 boys. A stratified sample has size 50. How many girls should be selected?

4 marks7 minssampling-g9
Show solution
Worked solution
  1. 1.Find the proportion of girls.
  2. 2.Multiply the proportion by the sample size.
Final answer

30 girls

Mark points
  • M1: use 18030018\frac{0}{3}00 × 50
  • A1: 30
Watch out

Do not rush straight into arithmetic. Select the relevant method and show a complete chain of working.

Timed checkpoint
12 mins - 9 marks

Switch between skills

Set a timer and attempt all four questions before opening any answers. This is closer to the way skills appear in a real paper.

1Collecting and sampling data - 2 marksGive one advantage of stratified sampling.Mark answer
Answer

Ensures all sub-groups are represented proportionally

2Averages and range - 2 marksFind the mean of: 6, 6, 7, 9, 12.Mark answer
Answer

8

3Grouped data and estimated mean - 2 marksWhich class contains the median for 30 values with frequencies 8, 12, 10?Mark answer
Answer

The 15th and 16th values — both in the second class [frequency 8, then 8+12=20]

4Charts, tables and diagrams - 3 marksA back-to-back stem-and-leaf shows boys' scores and girls' scores. How do you compare distributions?Mark answer
Answer

Compare medians and ranges for each group

Mastery check
  • I can explain the method for collecting and sampling data.
  • I can show clear working without skipping key steps.
  • I can avoid this mistake: Students say 'random is best' without explaining why.Marks come from the trade-off: what does each method guarantee, and at what cost?Always give both an advantage and a disadvantage for each method asked.
Related topics
Official exam-board sources

This guide follows the Pearson Edexcel GCSE Mathematics 1MA1 specification. Practice questions are original Learnova questions shaped around official content and exam skills.

Ready for the next step?

Get help with anything that still feels tricky.

Ask Nova Bot