PDF

What is Data Science?

Data science is like solving mysteries using information. The information (called "data") can be numbers, words, or pictures. A data scientist asks a question, looks at data to find answers, and tells others what was learned.

Step-by-step: How to do a simple data science project

  1. Ask a question.

    Pick something you want to find out. Example: "Which candy color do my friends like best?" or "How many hours do kids in my class read each week?"

  2. Collect data.

    Gather answers. For the candy question, give each friend a small bowl of candies and count how many of each color they pick. Write the counts on paper or in a notebook.

  3. Organize and clean the data.

    Put your counts in a simple table. If a number looks wrong, check with the person who gave the answer. Cleaning means fixing mistakes or empty spots.

  4. Look for patterns (analyze).

    Find which color shows up the most. You can also calculate simple numbers:

    • Total: add all counts together.
    • Mode: the color that appears most (the favorite).
    • Mean (average): add the numbers and divide by how many items. Example: if five people picked 3, 4, 5, 3, 5 candies, the mean = (3+4+5+3+5)/5 = 20/5 = 4.
    • Percent: part divided by total times 100. If 6 out of 20 candies are red, percent = (6/20)*100 = 30%.
  5. Make a picture (visualize).

    Draw a bar chart: list candy colors along the bottom and draw bars up to the number for each color. Pictures make it easier to see the favorite color at a glance.

  6. Share what you learned.

    Tell a friend or make a short poster: "Red is the most popular candy color — 30% of the candies were red." Explaining your results is part of data science.

Mini-project you can try (Candy count)

  1. Ask 5 friends to choose 5 candies each from a mixed bag.
  2. Count how many of each color there are and write them down. Example table:
    Color:  Red  Blue  Green  Yellow
    Count:   6     3      4      2
        
  3. Find the total: 6+3+4+2 = 15 candies.
  4. Find the mode: Red (6) is the most.
  5. Find the percent of red: (6/15)*100 = 40%.
  6. Draw a bar chart on paper so each color has a bar up to its count (6, 3, 4, 2).
  7. Write one sentence conclusion: "Red is the favorite color because 40% of the candies were red."

Tools you can use

  • Paper and pencil — great for the first projects.
  • Spreadsheet (like Google Sheets) — helps add numbers and make charts.
  • Scratch or beginner coding (like Python with help) — lets you explore bigger projects later.

Tips and safety

  • Always ask permission before collecting other people's data (ask parents or teachers for help).
  • Keep personal details private (names, addresses) unless an adult says it's okay.
  • Start small and keep it fun. Big data science skills come from lots of tiny experiments like these.

Next steps if you like this

Try another question: "Which game do kids in my class play most?" or "How many minutes do I read each day for a week?" Do the same steps and compare results. If you enjoy charts and patterns, you might like learning spreadsheets or basic coding later.

You did it! You just did a simple data science project: asked a question, collected data, looked for patterns, made a picture, and shared what you learned.


Ask a followup question

Loading...