INFO 5368

INFO 5368

Course information provided by the Courses of Study 2024-2025.

This course provides hands-on experience developing and deploying foundational machine learning algorithms on real-world datasets for practical applications (e.g., healthcare, computer vision). Students will learn about the machine learning pipeline end-to-end including dataset creation, pre- and post-processing, annotation, annotation validation, preparation for machine learning, training and testing a model, and evaluation. Students will focus on real-world challenges at each stage of the ML pipeline while handling bias in models and datasets. Lastly, students will analyze the strengths and weaknesses of regression, classification, clustering, and deep learning algorithms.

When Offered Spring.

Prerequisites/Corequisites Prerequisite: recommended coursework in Python Programming

Outcomes
  • Collect a new dataset and prepare it for a ML task, train a model, and evaluate it.
  • Apply regression, classification, clustering, and deep learning algorithms to practical applications.
  • Analyze and identify key differences in regression, classification, clustering, and deep learning algorithms.
  • Understand core challenges of dataset creation including handling missing data, bias, unlabeled data, among others.
  • Represent features in datasets to be used for ML tasks.
  • Evaluate model quality using appropriate metrics of performance.

View Enrollment Information

Syllabi: none
  •   Regular Academic Session. 

  • 3 Credits Graded

  • 19566 INFO 5368   LEC 030

    • MW
    • Jan 21 - May 6, 2025
    • Taylor, A

  • Instruction Mode: In Person
    Enrollment limited to: Cornell Tech students.