Lesson 1: Learning from Data

App Lab

Overview

In this lesson students explore the Google Trends tool in order to tell a "data story" which explains both what the data shows and why that might be. Following this, students are introduced to the concept of metadata and look for the metadata of datasets on App Lab.

Purpose

The three main purposes of this lesson are:

  1. Navigating and using a real data tool (Google Trends, see below) that is external to the course
  2. Getting acquainted with talking and writing about data. In particular we want to:
    • Draw a distinction between describing what the data shows and describing why it might be that way
    • In other words: describe connections and trends in data separate from drawing conclusions.
    • We want students to get in the habit of separating the what from the why when it comes to talking and writing about data
  3. Introduce the concept of metadata. We want students to understand the value of data about data, that can be used to find, organize, and manage information. It also increases effective use of data by providing extra information.

Agenda

Lesson Modifications

Warm Up (5 mins)

Activity (30 mins)

Wrap Up (10 mins)

View on Code Studio

Objectives

Students will be able to:

  • Differentiate between what data shows and why that might be the case
  • Use Google Trends to tell a data story
  • Explain the usefulness of metadata

Preparation

  • Preview Google Trends and prepare for the demo

Links

Heads Up! Please make a copy of any documents you plan to share with students.

For the Teachers

For the Students

Teaching Guide

Lesson Modifications

Attention, teachers! If you are teaching virtually or in a socially-distanced classroom, please read the full lesson plan below, then click here to access the modifications.

Warm Up (5 mins)

Remarks

Welcome to Unit 9: Data! In this unit, we are going to learn how to organize and visualize data to answer questions. We'll make charts, look for patterns, and consider the impact that data collection has on our world.

Let's get things started by asking a question.

Discussion Goal

Goal: Student answers will vary, but they may include major holidays like Valentine's Day. Students may also have several different ideas for how those answers could be checked, and ultimately you will want to guide your students towards the idea of using a search tool.

Prompt: What time of year do people tend to search online for chocolate? How could you check your guess?

Remarks

Search tools are useful for efficiently finding information. How can we analyze our searches to discover patterns or trends? Let's find out!

Activity (30 mins)

Remarks

When you post information to a social network, watch a video online, or simply search for information on a search engine, some of that data is collected, and you reveal what topics are currently on your mind. When a topic is quickly growing in popularity, it is often said to be trending, but there are many different trends or patterns we might find in this data, including historical trends. These patterns might help us to identify, understand, and predict how our world is changing.

We will be using Google Trends which is a tool that allows you to visualize data about search history across different times and locations. We can use it to look for interesting patterns, trends, or relationships between multiple trends and try to tell the story that pattern is showing.

Display: The chart relates to the warm-up question and shows searches for chocolate over the past 5 years.

Prompt: What pattern do you notice? What could be the reason for that pattern? Does this data support your earlier guesses?

Remarks

This chart gives us information: a collection of facts and patterns extracted from data. We can use this information to identify trends, make connections and address problems.

There are two distinctions we need to make when looking at a chart or visualization:

  • What does the data show?
  • Why might that be the case?

The "what" is the facts of the matter. Chocolate searches spike in December. The "why" is an informed opinion. Chocolate searches spike around the time of the winter holidays because it's a common gift.

Do This: Demonstrate how to use Google Trends in front of the class. Add several search items (for example: streaming, mp3, cd) or take suggestions from the class. Demonstrate usage of the dropdowns to narrow focus. As a class look for patterns in the data. Practice stating the What and the Why as you make charts with the class.

Do This: Now students use Google Trends themselves to look for "data stories". Depending on class time, allow students to share their data stories with partners or with the class as a whole.

Remarks

Using Google Trends to find patterns represents an important concept in understanding how programs are used to gain insight and knowledge from data. When you worked with the tool, there were elements of both interaction and iteration. You interacted with the tool trying out several different options, and repeated the process (iteration) until you came to a possible conclusion.

As you were determining the Why in your data stories, you may have been tempted to draw concrete connections. However, it's important to remember that correlation (similarities, patterns) does not equal causation (this thing caused that thing). There can be any number of reasons why a pattern or interesting data point may appear in a chart - and our job is to make an informed decision while recognizing that there may be multiple factors at play. Usually additional research with several data sets is necessary to understand the exact nature of the relationship between data. Did this one thing cause another thing to happen? Do more people search for chocolate because they want to give it as a gift, or could there perhaps be another reason?

In this unit we will be making charts to help answer questions:

  • "I think this visualization tells me this…"
    • Something is more popular than something else
    • Something is more important than something else
    • Something has become more or less searched over time
  • "... but I'm not sure because…"
    • I don't know exactly how the data was collected
    • This might tell me people searched for green more than red, but it doesn't tell me why they do that or that green is a better color
    • We need more data!

When we consider datasets, it's helpful to know as much about those datasets as possible. Where did the data come from? How much data is included? When was it collected?

All of this information is considered metadata which is defined as "data about data".

We can have metadata about any digital data. For example, this picture contains metatdata that tells us the when the picture was created, what the resolution is, and how many people have downloaded it. Datasets can also have metadata that explains more about the information in the dataset. You've seen this in App Lab!

Discussion Goal

Goal: Students may bring up the sources of the data (Google Searches) or the dropdowns that control how the chart is built. If students want to dig deeper into the metadata, consider pointing them to the Google Trends Help Page.

Prompt: What is the metadata for the chart you created in Google Trends?

Remarks

Now let's explore metadata in App Lab in the datasets you've used in previous units. Where can we find this information? What can it tell us about the data? How can it help us organize the data?

Do This: Students navigate to Level 2 on Code Studio where they open the data tab and look at the metadata for a table. Then they should share with a partner where the data comes from and what they can learn from the metadata.

Wrap Up (10 mins)

Review: Review key takeaways on the slide. Students may want to jot down notes in their journals.

Journal: Students add the following word and definition to their journal: metadata.


Assessment: Check For Understanding

Check For Understanding Question(s) and solutions can be found in each lesson on Code Studio. These questions can be used for an exit ticket.

Question: Below is an image from Google Trends that plots Cats and Dogs. Choose the most accurate description of what this data is actually showing based on what you know about how Google Trends works.

Standards Alignment

View full course alignment

CSTA K-12 Computer Science Standards (2017)

DA - Data & Analysis
  • 3A-DA-11 - Create interactive data visualizations using software tools to help others better understand real-world phenomena.
  • 3B-DA-05 - Use data analysis tools and techniques to identify patterns in data representing complex systems.

CSP2021

DAT-2 - Programs can be used to process data
DAT-2.A - Describe what information can be extracted from data.
  • DAT-2.A.1 - Information is the collection of facts and patterns extracted from data.
  • DAT-2.A.2 - Data provide opportunities for identifying trends, making connections, and addressing problems.
  • DAT-2.A.3 - Digitally processed data may show correlation between variables. A correlation found in data does not necessarily indicate that a causal relationship exists. Additional research is needed to understand the exact nature of the relationship.
  • DAT-2.A.4 - Often, a single source does not contain the data needed to draw a conclusion. It may be necessary to combine data from a variety of sources to formulate a conclusion.
DAT-2.B - Describe what information can be extracted from metadata.
  • DAT-2.B.1 - Metadata are data about data. For example, the piece of data may be an image, while the metadata may include the date of creation or the file size of the image.
  • DAT-2.B.2 - Changes and deletions made to metadata do not change the primary data.
  • DAT-2.B.3 - Metadata are used for finding, organizing, and managing information.
  • DAT-2.B.4 - Metadata can increase the effective use of data or data sets by providing additional information.
  • DAT-2.B.5 - Metadata allow data to be structured and organized.
DAT-2.D - Extract information from data using a program.
  • DAT-2.D.3 - Search tools are useful for efficiently finding information.
DAT-2.E - Explain how programs can be used to gain insight and knowledge from data.
  • DAT-2.E.1 - Programs are used in an iterative and interactive way when processing information to allow users to gain insight and knowledge about data.