Skip to main content

Subsection 3.1.1 Introduction to Statistics

In everyday life, individuals, organizations, and governments make decisions based on information. Schools analyse learners’ performance, hospitals record patient data, businesses track sales trends, farmers decide when to plant crops and governments collect population data to plan development projects. This systematic process of gathering, organizing, analysing, and interpreting the information above is known as Statistics.
Statistics enables us to transform raw data into meaningful information that supports informed decision-making. Understanding where data comes from and how it is collected is essential before organizing, representing, and interpreting it.

Subsubsection 3.1.1.1 Sources of Data

Learner Experience 3.1.1.

Work in groups
A teacher assigned students a research topic on the effects of social media on teenage mental health in Kenya.
(a)
Suggest one potential primary data collection method that could be used.
(b)
Suggest one potential secondary data source that could be used.
(d)
Why is secondary data important for research and decision-making?

Key Takeaway 3.1.3.

Data is a collection of facts, figures, numbers, observations, measurements, or descriptions that can be recorded, stored, and used to study or understand something.
There are two primary categories of data sources: primary data and secondary data.
  • Primary data is collected firsthand by the researcher or investigator specifically for the current study.
  • Secondary data is information that has already been collected by someone else for a different purpose and is now used by the researcher.

Example 3.1.4.

Classify the following as primary or secondary data sources:
  1. A student conducts a survey to find out the favorite sports of their classmates.
  2. A teacher uses last year’s national exam results to analyze student performance trends.
  3. A researcher reads a government report on the most common diseases in Kenya.
  4. A doctor observes a patient’s symptoms and records them for a medical study.
  5. A scientist conducts an experiment to test the growth rate of plants under different conditions.
Solution.
  1. Primary because the student is collecting first-hand data directly from people.
  2. Secondary because the exam results were already collected and recorded by an external body.
  3. Secondary because The report was collected and published by someone else for a different purpose.
  4. Primary because The doctor is directly collecting new data from real-life observation.
  5. Primary because The scientist is generating new data through an experiment.

Example 3.1.5.

A community group wants to understand the needs of youth in their area.
  1. What primary data collection methods could the group use?
  2. What problems might they have when trying to find this information?
  3. How can they use the information they find to help the community?
  4. What secondary data sources could be useful to this community group?
Solution.
  1. Focus groups, surveys, interviews.

Exercises Exercises

2.
In a market, a shop owner watches what customers buy most often and looks for trends based on what they see in the store.
  1. Which sources of data can the shop owner use to collect the data?
Answer.
Direct observation, sales transaction records, and customer surveys/interviews.
3.
What are the advantages of using secondary data over primary data in some research situations?
Answer.
It is faster to obtain, cheaper than conducting new research, and provides access to larger historical datasets.

Subsubsection 3.1.1.2 Methods of Data Collection

Learner Experience 3.1.2.

Work in groups
  1. The following data represents the number of hours people spend watching television per week.
  2. Create a table to organize this data.
  3. What type of data collection method was most likely used to gather this data?
  4. What are some other questions that could be asked to further explore this topic?
  5. Discuss and share with other groups.

Key Takeaway 3.1.9.

  • Methods of collecting data help us gather information.
  • Data can be collected using:
    Primary data collection methods (interviews, observations, questionnaires, experiments) involve gathering new data directly from sources.
    Secondary data collection methods involve using existing data records.
    1. Surveys/Questionnaires: A set of questions designed to gather information from a group of people. They can be conducted in person, over the phone, or online.
    2. Interviews: A conversation between an interviewer and a respondent to gather detailed information on a specific topic.
    3. Observations: The process of watching and recording behaviors or events as they occur in their natural setting.
    4. Experiments: A method of collecting data by manipulating one variable to determine its effect on another variable.
    5. Existing records (Secondary data): The use of existing data that has been collected by someone else for a different purpose. This can include government reports, academic studies, and historical records.

Example 3.1.10.

A student wants to find out the most popular extracurricular activities among their classmates.
  1. What data collection method would be most appropriate?
  2. What are two examples of specific questions they could ask?
Solution.
  1. Survey/Questionnaire.
  2. Questions for students to ask are;

Example 3.1.11.

A local bakery, conducted a study asking customers about their favorite types of pastries. The results are:
  1. What type of data collection method was most likely used to gather this data?
  2. What is the most popular type of pastry among customers?
  3. What percentage of customers prefer either Cakes or Cookies?
  4. What type of pastry is the least popular among customers?
Solution.
  1. A survey.
  2. Cakes
  3. \(45\% \) + \(30\%\)= \(75\%\)
    Therefore, the percentage of customers who prefer either Cakes or Cookies are \(75\%\)
  4. Doughnuts

Exercises Exercises

2.
Describe one ethical consideration you should keep in mind when conducting interviews with community members.
Answer.
Obtaining informed consent from the participants.
3.
Why is it important to keep accurate records when conducting observations?
Answer.
To ensure the reliability and validity of the final analysis.
4.
Give one example of a situation where you would use secondary data collection in a Geography class.
Answer.
Using a government census report to study population density in a region.