DSCI 325 @ WSU
Spring 2024
My Schedule


Course Materials

  • Initiate a iPython Notebook: iPython Notebook
  • Initiate a R Colab Notebook: https://colab.to/r

  • SQL Aggregation:

  • SQL Advanced Filtering:
    • Database - AirlineDelays: Link
    • Notes: Link


  • SQL Introduction:
    • Database - AirlineDelays: Link
    • Database - OECDEmployment: Link
    • Notes: Link


  • Window Functions in Prep

  • Summaries + Joins with Prep
    • MN Schools Open Enrollment: Link; Click List Files; Download 2023-2024 Open Enrollment

  • Getting started with Prep
    • Airline Delays (Counts): Link
    • Airline On-Time Data (Raw Data): Link

  • Instructions to get Tableau Prep
    • Sign into an existing Tableau.com account, or create a new account using your school-issued email
    • Once signed in, visit the TFT Activation page to download the latest versions of Tableau Desktop and Tableau Prep Builder
    • Activate with product key: TC3B-683E-F7F0-8F9C-75BE
    • Already have a copy of Tableau Desktop installed? Update the license key in the application: Help menu → Manage Product Keys

  • Syllabus: Link

Tasks / Homework

  • Task #2: Due Tuesday, Feb 25
    • Database - CDC_BirthRates: Link
    • Questions: Link
  • Task #1 - Part B: Due Tuesday, Feb 3; Groups of Size 2 or less; 10 points
    • Using Excel, import the Netflow.csv from Task #1
    • Create a Pivot Table report that provides the sum of Netflow the school district of interest
    • Create a Pareto Chart that show the Top 5 Netflow (districts that provided the most students to the district of interest) and the Bottom 5 Netflow (districts for which the school district of interest lost the most students to). This chart should have the school districts sorted by Netflow from largest to smallest.
    • In Powerpoint, create a slide that automatically gets updated when the flow is rerun in Prep and the linked data in Excel is refreshed. Your slide should include: 1) Name of School District of interest, 2) Sum of Netflow, and 3) Your Pareto Chart.
  • Task #1: Due Tuesday, Jan 28; Groups of Size 2 or less; 15 points
    • Data: Folder
    • Task: Link
    • Submission: I will grade each group in class, I will specify a school district and your files should automatically update for this district