COMPSCI 714 · AI Architecture and Design
This course focuses on the design and implementation of modern AI systems — transformer architectures, large language models, training dynamics, and system-level considerations in deep learning.
Notes here focus on:
- architectural reasoning: why design choices are made, not just what they are
- implementation-level understanding — building models from scratch to expose internals
- connecting theory to real systems (GPT-2, attention mechanisms, tokenization)