STSCI 2220

STSCI 2220

Course information provided by the 2025-2026 Catalog.

Statistics courses usually use clean and well-behaved data, this leaves many unprepared for the messiness and chaos of data in the real world. This course will follow on from STSCI 2120 and cover more advanced data wrangling topics including how to tidy data using the tidyverse R packages to better facilitate data analysis. This includes string processing with regular expressions, manipulating date and time data, web scraping, and text mining. Data visualization topics will cover visualization principles, the use of ggplot2 to create custom plots, and how to communicate data-driven findings.


Distribution Requirements (STA-IL)

Last 3 terms offered (None)

Learning Outcomes REF-FA25

  • Demonstrate ability to combine and tidy data using the tidyverse R package.
  • Produce professional and informative data visualizations using the ggplot2 R package.
  • Create reports to document data analysis and communicate findings using RMarkdown.

View Enrollment Information

Syllabi: none
  •   Seven Week - Second.  Choose one lecture and one discussion. Combined with: STSCI 5220

  • 2 Credits Opt NoAud

  • 19884 STSCI 2220   LEC 001

    • TR
    • Oct 15 - Dec 5, 2025
    • Entner, J

  • Instruction Mode: In Person

    For Bowers Computer and Information Science (CIS) Course Enrollment Help, please see: https://tdx.cornell.edu/TDClient/193/Portal/Home/

  • 19886 STSCI 2220   DIS 201

    • R
    • Oct 15 - Dec 5, 2025
    • Entner, J

  • Instruction Mode: In Person

  • 19887 STSCI 2220   DIS 202

    • R
    • Oct 15 - Dec 5, 2025
    • Entner, J

  • Instruction Mode: In Person