Course Description: This seminar (Frontiers in Foundation Models) explores the frontier of Large Language Models (LLMs) and next-generation multimodal foundation models. We move beyond standard autoregressive language modeling to study how modern systems integrate text, vision, video, code, and action to enable grounded reasoning and real-world decision making. Topics include multimodal reasoning and evaluation, vision representation learning and segmentation, video generation, foundation models for robotics, and action-centric representation learning. We also cover emerging paradigms such as hierarchical and latent reasoning, diffusion-based language modeling, reinforcement learning for reasoning, and agentic workflows. Throughout the course, we emphasize both mechanistic understanding and practical safety alignment, connecting recent advances in interpretability, test-time compute scaling, and robust evaluation to hands-on paper presentations and open-ended research projects.
We welcome students from diverse backgrounds who are interested in learning about SOTA foundation models.
📍 Time & Location: Friday, Periods 3 & 4 (12:10 PM - 3:10 PM) in Hill 009
| Week | Topic & Readings |
|---|---|
| Week 1Jan 23 |
Introduction & Interpretation Overview of Foundation Models. Safety and Interpretation. Reading: SELFIE |
| Week 2Jan 30 |
LLM Frontiers DeepSeek-V3.2 Kimi-K2 |
| Week 3Feb 6 |
Part 1: Student Presentation
Paper: Dino v3
|
| Week 4Feb 13 |
Guest Lecture (Starts 2:00 PM)
Didac Suris (Meta Super Intelligence Lab)
Topic: SAM 3 (Vision Foundation) |
| Week 5Feb 20 |
Part 1: Student Presentation
Paper: Why Do Multi-Agent LLM Systems Fail?
Part 2: Guest Lecture (Starts 1:40 PM)
Sachit Menon (Columbia University)
Topic: Multimodal Reasoning with Code Generation |
| Week 6Feb 27 |
Guest Lecture (Starts 1:40 PM)
Wenhao Ding (NVIDIA Scientist)
Topic: Accelerating the Development and Deployment of Reasoning Models for Physical AI |
| Week 7Mar 6 |
Part 1: Student Presentation (Aparajita)
Paper: DataComp-LM: In search of the next generation of training sets for language models
|
| Week 8Mar 13 | |
| Spring RecessMar 20 |
NO CLASS Rutgers Spring Recess (March 14 - March 22) |
| Week 9Mar 27 |
Part 1: Student Presentation (Alborz)
Paper: SAT Solvers in LLMs
Part 2: Student Presentation (Sinchana)
Paper: Wan
|
| Week 10Apr 3 |
<
Part 1: Student Presentation (Qiwei Zhao)
Paper: TBD
Part 2: Student Presentation (Prajwal)
Paper: ReasoningBank: Scaling Agent Self-Evolving with Reasoning Memory
|
| Week 11Apr 10 |
Part 1: Student Presentation (Pranika)
Paper: OpenThoughts (arXiv:2506.04178), https://www.open-thoughts.ai/, https://www.openthoughts.ai/blog/ot3,
|
| Week 12Apr 17 |
<
Part 1: Student Presentation (Varun)
Paper: TBD
Part 2: Student Presentation (Harsha)
Paper: World Models - NVIDIA Cosmos 2.5
|
| Week 13Apr 24 |
<
Part 1: Student Presentation (Lokesh Kota)
Paper: The Art of Scaling RL for LLMs
Part 2: Student Presentation (Santhosh)
Paper: TBD
|
| Week 14May 1 |
Final Final Project Presentations (Last day of Regular Classes) |