INFO 5304

INFO 5304

Course information provided by the 2025-2026 Catalog.

Massive amounts of data are collected by many companies and organizations and the task of a data scientist is to extract actionable knowledge from the data - for scientific needs, to improve public health, to promote businesses, for social studies and for various other purposes. This course will focus on the practical aspects of the field and will attempt to provide a comprehensive set of tools for extracting knowledge from data.The course will cover the topics needed to solve data-science problems, which include problem formulation (business understanding), data preparation (collection, sampling, integration, cleaning), data modeling (characterization, model selection, and analysis), implementation (large-scale data processing, feedback loops, QA) and communication (data presentation, visualization). Advanced topics such as causal inference and processing streaming data will be presented.Throughout the course, the students will perform a data-science mission with all the required steps, from problem formulation to result presentation.


Enrollment Priority Enrollment limited to: Cornell Tech students.

Last 4 Terms Offered 2025SP, 2024SP, 2023SP, 2022SP

View Enrollment Information

Syllabi: none
  •   Regular Academic Session.  Combined with: CS 5304

  • 3 Credits Graded

  •  9789 INFO 5304   LEC 030

    • MW
    • Jan 20 - May 5, 2026
    • Kim, H

  • Instruction Mode: In Person

    Enrollment limited to Cornell Tech students.