INFO 2950

INFO 2950

Course information provided by the 2026-2027 Catalog.

INFO 2950 is an applied introductory course on the foundations of data science, focusing on using data to identify patterns, evaluating the strength and significance of relationships, and generating predictions using data. Topics covered include the core principles of statistical programming (such as data frames, Python/R packages, reproducible workflows, and version control), univariate and multivariate statistical analysis of small and medium-size datasets, regression methods, hypothesis testing, probability models, basic supervised and unsupervised machine learning, data visualization, and network analysis. Students will learn how to use data to make effective arguments in a way that promotes the ethical usage of data. Students who complete the course will be able to produce meaningful, data-driven analyses of real-world problems and will be prepared to begin more advanced work in data-intensive domains.


Prerequisites one course is core statistics (MATH 1710 or equivalent) and one course in core programming (CS 1110 or CS 1112) or permission of instructor.

Distribution Requirements (DLG-AG, OPHLS-AG), (SDS-AS), (STA-IL)

Last 4 Terms Offered 2025FA, 2024FA, 2024SU, 2024SP

Learning Outcomes

  • Identify, acquire and clean project-relevant data from diverse sources. Analyze data fitness for purpose and evaluate solutions for data quality issues such as missing records.
  • Use data ethically, including by applying best practices for data sourcing, documentation, and privacy preservation.
  • Create data science projects using reproducible methods, common tools, and standard software libraries.
  • Apply methods to transform tabular data through operations such as filtering, aggregation, and pivoting.
  • Analyze data-generating processes via simulation.
  • Apply exploratory data analysis through visualization, summary statistics and outlier identification. Use exploratory results to generate research hypotheses.
  • Identify, measure, and evaluate the significance of patterns in large datasets.
  • Select and apply common learning algorithms to make predictions about unobserved data.
  • Create compelling, reliable arguments informed by data.

View Enrollment Information

Syllabi: none
  •   Regular Academic Session.  Choose one lecture and one discussion.

  • 4 Credits Graded

  • 16429 INFO 2950   LEC 001

    • TR
    • Aug 24 - Dec 7, 2026
    • Wilkens, M

  • Instruction Mode: In Person

    For Bowers Computer and Information Science (CIS) Course Enrollment Help, please see: https://tdx.cornell.edu/TDClient/193/Portal/Home/

  • 16430 INFO 2950   DIS 201

    • F
    • Aug 24 - Dec 7, 2026
    • Wilkens, M

  • Instruction Mode: In Person

  • 16431 INFO 2950   DIS 202

    • F
    • Aug 24 - Dec 7, 2026
    • Wilkens, M

  • Instruction Mode: In Person

  • 16432 INFO 2950   DIS 203

    • F
    • Aug 24 - Dec 7, 2026
    • Wilkens, M

  • Instruction Mode: In Person

  • 16433 INFO 2950   DIS 204

    • F
    • Aug 24 - Dec 7, 2026
    • Wilkens, M

  • Instruction Mode: In Person

  • 16434 INFO 2950   DIS 205

    • F
    • Aug 24 - Dec 7, 2026
    • Wilkens, M

  • Instruction Mode: In Person

  • 16435 INFO 2950   DIS 206

    • F
    • Aug 24 - Dec 7, 2026
    • Wilkens, M

  • Instruction Mode: In Person

  • 16436 INFO 2950   DIS 207

    • F
    • Aug 24 - Dec 7, 2026
    • Wilkens, M

  • Instruction Mode: In Person

  • 16437 INFO 2950   DIS 208

    • F
    • Aug 24 - Dec 7, 2026
    • Wilkens, M

  • Instruction Mode: In Person