INFO 5368

INFO 5368

Course information provided by the 2025-2026 Catalog.

This course provides hands-on experience developing and deploying foundational machine learning algorithms on real-world datasets for practical applications (e.g., healthcare, computer vision). Students will learn about the machine learning pipeline end-to-end including dataset creation, pre- and post-processing, annotation, annotation validation, preparation for machine learning, training and testing a model, and evaluation. Students will focus on real-world challenges at each stage of the ML pipeline while handling bias in models and datasets. Lastly, students will analyze the strengths and weaknesses of regression, classification, clustering, and deep learning algorithms.


Prerequisites recommended coursework in Python Programming

Last 4 Terms Offered 2025SP, 2024SP, 2023SP

Outcomes

  • Collect a new dataset and prepare it for a ML task, train a model, and evaluate it.
  • Apply regression, classification, clustering, and deep learning algorithms to practical applications.
  • Analyze and identify key differences in regression, classification, clustering, and deep learning algorithms.
  • Understand core challenges of dataset creation including handling missing data, bias, unlabeled data, among others.
  • Represent features in datasets to be used for ML tasks.
  • Evaluate model quality using appropriate metrics of performance.

View Enrollment Information

Syllabi: none
  •   Regular Academic Session. 

  • 3 Credits Graded

  •  9083 INFO 5368   LEC 030

    • MW
    • Jan 20 - May 5, 2026
    • Taylor, A

  • Instruction Mode: In Person

    Enrollment limited to: Cornell Tech students.