DSCI 325 @ WSU
Spring 2025
My Schedule


Course Materials


  • Midterm Study Guide: Link | DB Folder | Solution

  • SQL Window Functions:
    • Database - Baseball - Lahman: Link
    • Database - US Baby Names: Link

    • Notes - Top Baby Names by State & Year: Link
    • Notes - Window Functions: Link

  • SQL Joins:
    • Database - Podcasts: Link

    • Notes - Joins (Advanced): Link
    • Notes - Joins (Simple): Link


  • SQL UNION / Concatenate:
    • Database - Nursing Home Providers: Link | Source: Link
    • Database - World Happiness: Link

    • Notes - Union: Link | World Happiness Top 10 List: SQL


  • SQL Strings & Dates:
    • Database - Captial Bike: Link | Source:
    • Database - Nursing Home Providers: Link | Source: Link

    • Notes - Dates: Link
    • Notes - Strings: Link

  • SQL Aggregation:

  • SQL Advanced Filtering:
    • Database - AirlineDelays: Link
    • Notes: Link


  • SQL Introduction:
    • Database - AirlineDelays: Link
    • Database - OECDEmployment: Link
    • Notes: Link


  • Window Functions in Prep

  • Summaries + Joins with Prep
    • MN Schools Open Enrollment: Link; Click List Files; Download 2023-2024 Open Enrollment

  • Getting started with Prep
    • Airline Delays (Counts): Link
    • Airline On-Time Data (Raw Data): Link

  • Instructions to get Tableau Prep
    • Sign into an existing Tableau.com account, or create a new account using your school-issued email
    • Once signed in, visit the TFT Activation page to download the latest versions of Tableau Desktop and Tableau Prep Builder
    • Activate with product key: TC3B-683E-F7F0-8F9C-75BE
    • Already have a copy of Tableau Desktop installed? Update the license key in the application: Help menu → Manage Product Keys

  • Syllabus: Link

Tasks / Homework

  • Task #2: Due Tuesday, Feb 25
  • Task #1 - Part B: Due Tuesday, Feb 3; Groups of Size 2 or less; 10 points
    • Using Excel, import the Netflow.csv from Task #1
    • Create a Pivot Table report that provides the sum of Netflow the school district of interest
    • Create a Pareto Chart that show the Top 5 Netflow (districts that provided the most students to the district of interest) and the Bottom 5 Netflow (districts for which the school district of interest lost the most students to). This chart should have the school districts sorted by Netflow from largest to smallest.
    • In Powerpoint, create a slide that automatically gets updated when the flow is rerun in Prep and the linked data in Excel is refreshed. Your slide should include: 1) Name of School District of interest, 2) Sum of Netflow, and 3) Your Pareto Chart.
  • Task #1: Due Tuesday, Jan 28; Groups of Size 2 or less; 15 points
    • Data: Folder
    • Task: Link
    • Submission: I will grade each group in class, I will specify a school district and your files should automatically update for this district