Class Roster

Last Updated

Schedule of Classes - May 17, 2026 7:07PM EDT

CS 6674

Multimodal Computer Vision

Course information provided by the 2025-2026 Catalog.

Multimodal representations are reshaping computer vision, driving advances in both understanding and generation across a wide range of perceptual tasks. This research-oriented course explores computer vision techniques that integrate images with additional modalities such as language and 3D geometry for addressing challenges in both analysis and synthesis tasks. Possible topics include visual grounding, multimodal alignment, and text-guided generation and editing over multiple 2D and 3D representations.

Enrollment Priority Enrollment limited to: Cornell Tech PhD Students. Recommended prerequisites: CS 3780/CS 5780 or equivalent, CS 5670 or equivalent.

Last 4 Terms Offered 2026SP

Learning Outcomes

Analyze state-of-the-art multimodal techniques and understand their architectures and capabilities.
Identify open challenges in the field of multimodal computer vision.
Design deep learning pipelines that combine multiple modalities for addressing a research-oriented problem in computer vision.

View Enrollment Information

Syllabi:

Regular Academic Session.
Credits and Grading Basis

3 Credits Graded(Letter grades only)

Class Number & Section Details

18477 CS 6674 LEC 001
Meeting Pattern
- TR 10:10am - 11:25am
- Jan 20 - May 5, 2026
- Instructors
  
  Elor, H
To be determined. There are currently no textbooks/materials listed, or no textbooks/materials required, for this section. Additional information may be found on the syllabus provided by your professor.

For the most current information about textbooks, including the timing and options for purchase, see the Cornell Store.
Additional Information

Instruction Mode: Distance Learning-Synchronous

Syllabi:

Regular Academic Session.
Credits and Grading Basis

3 Credits Graded(Letter grades only)

Class Number & Section Details

18466 CS 6674 LEC 030
Meeting Pattern
- TR 10:10am - 11:25am
- Jan 20 - May 5, 2026
- Instructors
  
  Elor, H
To be determined. There are currently no textbooks/materials listed, or no textbooks/materials required, for this section. Additional information may be found on the syllabus provided by your professor.

For the most current information about textbooks, including the timing and options for purchase, see the Cornell Store.
Additional Information

Instruction Mode: In Person

Enrollment limited to Cornell Tech PhD Students. Masters' students may enroll with instructor permission.

Section Menu

Last Updated

Classes

CS 6674

Multimodal Computer Vision

Course Description

Credits and Grading Basis

Class Number & Section Details

Meeting Pattern

Instructors

Additional Information

Credits and Grading Basis

Class Number & Section Details

Meeting Pattern

Instructors

Additional Information

Share

About the Class Roster

CS 6674

Last Updated

Classes

CS 6674 Multimodal Computer Vision

Course Description

Credits and Grading Basis

Class Number & Section Details

Meeting Pattern

Instructors

Additional Information

Credits and Grading Basis

Class Number & Section Details

Meeting Pattern

Instructors

Additional Information

Share

CS 6674

Multimodal Computer Vision