The Large Data Set
Edexcel provides a real weather Large Data Set (LDS) that exam questions draw on. You will not memorise the numbers, but you should know its structure, its quirks, and how to handle missing or anomalous values.
What you'll be able to do
- Understand what the Large Data Set is
- Know the kinds of variables it contains
- Handle missing data and anomalies
- Use familiarity with the LDS in exam questions
What the LDS is
The Edexcel Large Data Set is real from UK and overseas weather stations across set time periods. Exam questions assume you have worked with it and understand its layout and units.
Variables and units
It contains both quantitative variables (temperature, rainfall, wind speed, pressure) and qualitative ones (location). Knowing the units and typical ranges helps you spot impossible or anomalous values.
Tip — Know the units (e.g. wind speed in knots, rainfall in mm) — exam questions test whether a value is realistic.
Missing data and anomalies
Some entries are blank (data not recorded — often shown as "n/a" or "tr" for trace rainfall). These are not zeros and should not be treated as such. Anomalies (clearly wrong readings) may be identified and excluded.
Formula recap
Common mistakes to avoid
Key takeaways
- The LDS is real weather data Edexcel provides for exams.
- It mixes quantitative (temperature, rainfall…) and qualitative (location) variables.
- Missing entries are not zeros; know units and spot anomalies.
Test yourself
Ready to lock in The Large Data Set? Pick a mode and earn XP & Dobloons.