Foundation Models

Elective course I took in Q4 of my first year while at Twente. It’s basically a in-depth continuation of the GenAI course. A foundation model is anything that gives you meaningful re-usable features. Foundation Models are trained on massive datasets and perform well in zero-shot (no fine-tuning) scenarios (so like ChatGPT, Gemini, all these LLMs).

Topics covered

Introduction to Foundation Models

Transformers, Introduction to Transformers in Deep Learning

Transformers in depth

Efficient FoMos, Mixture of Experts (MOE), Flash Attention

CLIP, DINO

Self-Supervision Objectives for FoMo Pre-training

Training Foundation Models

Improving the Generalization of ViTs for Action Understanding with VLM Pre-Training

FoMo Post-training and Adaption

Beyond attention-based methods — Structured State Space Sequence Models (SSM)

🚀 Costin Chitic

Recent Notes

Actor-Critic Methods

Deep Q-Learning

Monte Carlo Learning

Proximal Policy Optimization (PPO)

Q-Learning

Foundation Models

Graph View

Backlinks