We may earn commissions from brands listed on this site, which can influence how listings are presented.Advertising Disclosure
Last updated: June 2026 · Reviewed by the AI for Zebras Team · Methodology · Disclosure
Why trust us? AI for Zebras scores every product against a published methodology. Affiliate commissions help fund our work and never change our scores or rankings. How we disclose.

AI for Experts: Go Deep on LLMs and Research

You already build with AI. Now understand it from first principles - how models are trained, why they behave the way they do, and where the frontier is heading. These are the best advanced courses, the talks worth your time, and the researchers to learn from directly.

Good for
  • ML engineers and AI researchers
  • Engineers building production model pipelines
  • People who want depth beyond typical tutorials
Not for

Where to start: a two-step sequence

Watch these in order. The first gives you the visual architecture intuition. The second builds everything from scratch in code. Together they get you to the level where you can read papers and form your own opinions.

Step 1: how transformers work, visually

3Blue1Brown's "Transformers, how LLMs work" - the clearest visual explanation of attention and the transformer architecture. No code, pure intuition.

Step 2: build one from scratch

Andrej Karpathy's "Intro to Large Language Models" - the full engineering mental model, from tokens to RLHF. One hour that replaces three blog posts.

Some links on this page are affiliate links. If you click through and sign up or buy, we may earn a commission - at no extra cost to you. Full disclosure.

Best ways to go deep

1
Neural Networks: Zero to Hero Andrej Karpathy · free ★ Start Here

Build a GPT from scratch, line by line, in plain PyTorch. The single best way to make the abstractions stop being magic. Completely free.

  • From backprop to a working transformer
  • Code-along, no hand-waving
  • Free on YouTube and karpathy.ai
  • Strong programming assumed; maths helps
9.6Essential
Start free
2
Practical Deep Learning (Part 2) fast.ai · free Best Applied Depth

"Deep Learning Foundations to Stable Diffusion" - implement modern generative models from the ground up. Rigorous, free, and famously effective.

  • Build diffusion models from scratch
  • Top-down, code-first teaching
  • Free, no ads
  • Pairs well with the maths refreshers below
9.1Exceptional
Visit site
3
Stanford CS224N: NLP with Deep Learning Stanford · free lectures Best University Course

The canonical graduate course on language models. Lectures, slides, and assignments are public. Rigorous theory behind the systems you already use.

  • Transformers, attention, training dynamics
  • Full lecture videos on YouTube
  • Assignments and notes online
  • Maths-heavy - the real thing
8.9Excellent
View course
4
Deep Learning Specialization Andrew Ng, DeepLearning.AI · on Coursera Best Structured Foundation

If your fundamentals are shaky, this is the most trusted structured path through neural networks, optimization, and sequence models before you specialize.

  • Five-course structured sequence
  • Employer-recognized certificate
  • Free to audit; aid available
  • Best for filling theory gaps
8.6Excellent
Visit site Read review
5
Read the primary sources free papers & explainers Stay at the Frontier

Eventually the field moves faster than any course. Learn to read papers: start with the transformer paper and the best visual explainer ever written.

9.0Essential
Read the paper

Voices worth following

The people actually building and explaining the frontier - learn from them directly, not from secondhand hype.

Frequently asked questions

Do I need a maths background to go deep on AI?

It helps. Linear algebra, calculus, and probability are the working language of the field. You can start with Karpathy's Neural Networks: Zero to Hero with strong programming alone, but to read papers comfortably you will want the maths. 3Blue1Brown is the gentlest visual on-ramp to the intuition.

Should I build a model from scratch or use frameworks?

Both, in order. Build a tiny GPT from scratch once (Karpathy's course) so the abstractions are not magic, then use PyTorch and the Hugging Face stack for real work. Understanding the internals makes you far better at debugging and designing systems.

How do I keep up with the frontier?

Follow the researchers directly rather than secondhand hype. Read the primary papers (start with Attention Is All You Need), watch Karpathy and Yannic Kilcher break down architectures, and use Lex Fridman's long-form interviews to understand how the people building AI actually think.