What is Data Science?
Data science is like solving mysteries using information. The information (called "data") can be numbers, words, or pictures. A data scientist asks a question, looks at data to find answers, and tells others what was learned.
Step-by-step: How to do a simple data science project
- Ask a question.
Pick something you want to find out. Example: "Which candy color do my friends like best?" or "How many hours do kids in my class read each week?"
- Collect data.
Gather answers. For the candy question, give each friend a small bowl of candies and count how many of each color they pick. Write the counts on paper or in a notebook.
- Organize and clean the data.
Put your counts in a simple table. If a number looks wrong, check with the person who gave the answer. Cleaning means fixing mistakes or empty spots.
- Look for patterns (analyze).
Find which color shows up the most. You can also calculate simple numbers:
- Total: add all counts together.
- Mode: the color that appears most (the favorite).
- Mean (average): add the numbers and divide by how many items. Example: if five people picked 3, 4, 5, 3, 5 candies, the mean = (3+4+5+3+5)/5 = 20/5 = 4.
- Percent: part divided by total times 100. If 6 out of 20 candies are red, percent = (6/20)*100 = 30%.
- Make a picture (visualize).
Draw a bar chart: list candy colors along the bottom and draw bars up to the number for each color. Pictures make it easier to see the favorite color at a glance.
- Share what you learned.
Tell a friend or make a short poster: "Red is the most popular candy color — 30% of the candies were red." Explaining your results is part of data science.
Mini-project you can try (Candy count)
- Ask 5 friends to choose 5 candies each from a mixed bag.
- Count how many of each color there are and write them down. Example table:
Color: Red Blue Green Yellow Count: 6 3 4 2 - Find the total: 6+3+4+2 = 15 candies.
- Find the mode: Red (6) is the most.
- Find the percent of red: (6/15)*100 = 40%.
- Draw a bar chart on paper so each color has a bar up to its count (6, 3, 4, 2).
- Write one sentence conclusion: "Red is the favorite color because 40% of the candies were red."
Tools you can use
- Paper and pencil — great for the first projects.
- Spreadsheet (like Google Sheets) — helps add numbers and make charts.
- Scratch or beginner coding (like Python with help) — lets you explore bigger projects later.
Tips and safety
- Always ask permission before collecting other people's data (ask parents or teachers for help).
- Keep personal details private (names, addresses) unless an adult says it's okay.
- Start small and keep it fun. Big data science skills come from lots of tiny experiments like these.
Next steps if you like this
Try another question: "Which game do kids in my class play most?" or "How many minutes do I read each day for a week?" Do the same steps and compare results. If you enjoy charts and patterns, you might like learning spreadsheets or basic coding later.
You did it! You just did a simple data science project: asked a question, collected data, looked for patterns, made a picture, and shared what you learned.