Last updated: June 2026 · Reviewed by the AI for Zebras Team · Methodology · Disclosure

Why trust us? AI for Zebras scores every product against a published methodology. Our scores and rankings are never influenced by commissions. How we disclose.

AI for Experts: Go Deep on LLMs and Research

You already build with AI. Now understand it from first principles - how models are trained, why they behave the way they do, and where the frontier is heading. These are the best advanced courses, the talks worth your time, and the researchers to learn from directly.

Good for

ML engineers and AI researchers
Engineers building production model pipelines
People who want depth beyond typical tutorials

Not for

Beginners - start at Beginners path
Developers new to AI - try Software Engineers path first

Where to start: a two-step sequence

Watch these in order. The first gives you the visual architecture intuition. The second builds everything from scratch in code. Together they get you to the level where you can read papers and form your own opinions.

Step 1: how transformers work, visually

3Blue1Brown's "Transformers, how LLMs work" - the clearest visual explanation of attention and the transformer architecture. No code, pure intuition.

Step 2: build one from scratch

Andrej Karpathy's "Intro to Large Language Models" - the full engineering mental model, from tokens to RLHF. One hour that replaces three blog posts.

Some links on this page are affiliate links. If you click through and sign up or buy, we may earn a commission - at no extra cost to you. Full disclosure.

Best ways to go deep

Neural Networks: Zero to Hero Andrej Karpathy · free ★ Start Here

Build a GPT from scratch, line by line, in plain PyTorch. The single best way to make the abstractions stop being magic. Completely free.

From backprop to a working transformer
Code-along, no hand-waving
Free on YouTube and karpathy.ai
Strong programming assumed; maths helps

9.6Essential

Start free

Hugging Face NLP & Diffusion Courses Hugging Face · free Best Applied Depth

Free, rigorous courses covering transformers and diffusion models from the inside - built by the team that ships the libraries the whole industry uses.

Transformers, fine-tuning, and deployment from first principles
Dedicated diffusion models course (theory + code)
Free, no ads, always up to date
Pairs well with the maths refreshers below

9.1Exceptional

Try free

Stanford CS224N: NLP with Deep Learning Stanford · free lectures Best University Course

The canonical graduate course on language models. Lectures, slides, and assignments are public. Rigorous theory behind the systems you already use.

Transformers, attention, training dynamics
Full lecture videos on YouTube
Assignments and notes online
Maths-heavy - the real thing

8.9Excellent

View course

Deep Learning Specialization Andrew Ng, DeepLearning.AI · on Coursera Best Structured Foundation

If your fundamentals are shaky, this is the most trusted structured path through neural networks, optimization, and sequence models before you specialize.

Five-course structured sequence
Employer-recognized certificate
Free to audit; aid available
Best for filling theory gaps

8.6Excellent

See plans Read full review

Read the primary sources free papers & explainers Stay at the Frontier

Eventually the field moves faster than any course. Learn to read papers: start with the transformer paper and the best visual explainer ever written.

Attention Is All You Need - the transformer paper
The Illustrated Transformer - visual walkthrough
Hugging Face docs and model cards
Follow the researchers below

9.0Essential

Read the paper

Voices worth following

The people actually building and explaining the frontier - learn from them directly, not from secondhand hype.

Andrej Karpathy · build from scratch Nathan Lambert · open models + RLHF Ilya Sutskever · research direction Yannic Kilcher · paper breakdowns 3Blue1Brown · the maths, visually Lex Fridman · long-form interviews Two Minute Papers · latest research

📄

Guide

Context Management & RAG

RAG vs long-context windows - the decision framework for AI architects in 2026.

Read the guide →

⚙️

Guide

LLM Orchestration Frameworks

LangChain, LlamaIndex, CrewAI, Flowise - compared for building multi-step AI pipelines.

Compare frameworks →

👩‍💻

AI for Software Engineers

Less research, more shipping? The best hands-on courses for building with LLMs.

Open guide →

⚙️

Tools

Best MCP Servers

The Model Context Protocol servers worth wiring into your agents.

View comparison →

Frequently asked questions

Do I need a maths background to go deep on AI?

It helps. Linear algebra, calculus, and probability are the working language of the field. You can start with Karpathy's Neural Networks: Zero to Hero with strong programming alone, but to read papers comfortably you will want the maths. 3Blue1Brown is the gentlest visual on-ramp to the intuition.

Should I build a model from scratch or use frameworks?

Both, in order. Build a tiny GPT from scratch once (Karpathy's course) so the abstractions are not magic, then use PyTorch and the Hugging Face stack for real work. Understanding the internals makes you far better at debugging and designing systems.

How do I keep up with the frontier?

Follow the researchers directly rather than secondhand hype. Read the primary papers (start with Attention Is All You Need), watch Karpathy and Yannic Kilcher break down architectures, and use Lex Fridman's long-form interviews to understand how the people building AI actually think.

Your first 7 days at the frontier

This is not a beginner sequence. It is a sprint for engineers who want to move from using AI tools to understanding and extending them.

Day 1

Build something agentic

Implement a simple tool-calling agent using the Anthropic or OpenAI SDK. No frameworks. Give it one tool and one task. Run it. Understand exactly what the API is doing at each step.

Day 3

Wire up MCP end-to-end

Set up Claude Desktop, connect the GitHub MCP server to a real repository, and ask it to do something non-trivial. Then read the MCP spec. You will understand it far better with a working example in front of you.

Day 5

Benchmark a model for your use case

Define the task your system needs to perform. Build a minimal eval harness with 20-50 examples. Run Opus 4.6 and Sonnet 4.6 against it. The results will change how you think about model selection.

Day 7

Publish or ship something

Write up what you built and what you learned - a blog post, a GitHub repo, a tweet thread. Teaching is the fastest way to find the gaps in your own understanding.

→

Keep going

Journal: essays on where AI is heading

The journal covers the strategic, philosophical, and technical questions at the frontier - where things are headed and what it means for people building in this space.

Read the journal →