ORIE 4741
Last Updated
- Schedule of Classes - February 6, 2017 7:14PM EST
- Course Catalog - February 6, 2017 7:15PM EST
Classes
ORIE 4741
Course Description
Course information provided by the Courses of Study 2016-2017.
Modern data sets, whether collected by scientists, engineers, medical researchers, government, financial firms, social networks, or software companies, are often big, messy, and extremely useful. This course addresses scalable robust methods for learning from big messy data. We'll cover techniques for learning with data that is messy --- consisting of real numbers, integers, booleans, categoricals, ordinals, graphs, text, sets, and more, with missing entries and with outliers --- and that is big --- which means we can only use algorithms whose complexity scales linearly in the size of the data. We will cover techniques for cleaning data, supervised and unsupervised learning, finding similar items, model validation, and feature engineering. The course will culminate in a final project in which students extract useful information from a big messy data set.
When Offered Fall.
Prerequisites/Corequisites Prerequisite: familiarity with linear algebra and matrix notation, a modern scripting language (such as Python, Matlab, Julia, R), and basic complexity and O(n) notation.
Regular Academic Session. Choose one lecture and one discussion.
-
Credits and Grading Basis
4 Credits Graded(Letter grades only)
-
Class Number & Section Details
-
Meeting Pattern
- TR Olin Hall 165
Instructors
Udell, M
-
Class Number & Section Details
-
Meeting Pattern
- M Frank H T Rhodes Hall 453
Instructors
Udell, M
-
Class Number & Section Details
-
Meeting Pattern
- M Frank H T Rhodes Hall 453
Instructors
Udell, M
-
Class Number & Section Details
-
Meeting Pattern
- F Frank H T Rhodes Hall 453
Instructors
Udell, M
-
Class Number & Section Details
-
Meeting Pattern
- F Frank H T Rhodes Hall 453
Instructors
Udell, M
Share
Disabled for this roster.