WEEK 01 OF 12 · WHAT & WHY OF CUDA

The whole point of the box on your desk.

A 7-day, A-to-Z field guide to what CUDA is, why it matters, and why a $4,700 DGX Spark is a different animal from a similarly-priced Mac Studio with the same 128 GB of unified memory. Six days of mental models with small hands-on demos, then one project at the end where you fine-tune Llama 8B on a public US real estate dataset and ship an AI analyst running on your hardware. By Sunday night you'll know — to a depth most operators never reach — what you actually bought.

The bigger picture

This is Week 1 of a 12-week curriculum to make you a working master of NVIDIA's stack on the DGX Spark. Each week stacks on the last. By Week 12 you ship a real product on hardware you understand from the silicon up.

The 12-week roadmap

Each week ~7 days, ~1–2 hours/day. Today is the start.

WK 01 · NOW
What & why of CUDA
WK 02
Inference at scale — vLLM, TensorRT-LLM, batching
WK 03
Fine-tuning deep dive — LoRA, QLoRA, DPO
WK 04
Quantization & the FP4 revolution
WK 05
Local agents & tool use
WK 06
RAG done right — embeddings, vector DBs, rerankers
WK 07
Multi-modal — vision, voice, generation
WK 08
Real-time AI — streaming, voice agents, latency
WK 09
Custom CUDA via Triton — your first real kernel
WK 10
Production serving — Triton Inference Server, scaling
WK 11
Training your own architecture (optional deep cut)
WK 12
Capstone — ship a real product on the Spark

What you'll be able to answer by Sunday night

The seven days

DAY 01 · MON

What CUDA Actually Is

The 17-year origin story. The three layers (language, runtime, libraries). Demystifying a word everyone uses and few define. Hands-on: peek at the stack on your DGX.

DAY 02 · TUE

The Software Stack — The Real Moat

cuBLAS, cuDNN, cuTLASS, NCCL, Triton, FlashAttention, TensorRT, vLLM. Seventeen years of NVIDIA-paid PhDs that nobody else can replicate overnight.

DAY 03 · WED ⭐

DGX Spark vs Mac Studio

Same 128 GB, very different machines. Side-by-side on bandwidth, compute, FP4, software ecosystem. The single best lens for understanding why CUDA is the answer.

DAY 04 · THU

The Pretenders

AMD ROCm, Intel Gaudi, Google TPU, AWS Trainium, Huawei Ascend, Cerebras. Why each is interesting, where each falls short, and the one threat NVIDIA actually watches.

DAY 05 · FRI

What CUDA Lets You Actually Do

The use-case landscape — training, inference, custom kernels, scientific simulation, real-time graphics+AI. What's exclusive to the green stack and what isn't.

DAY 06 · SAT

The Future of CUDA & Your Bet

Blackwell to Rubin. NVL fabric. Sovereign AI. Where NVIDIA could lose. How to think about your own 3-year compute strategy without becoming a fanboy.

DAY 07 · SUN ⭐

Project — AI Real Estate Analyst

Hands-on capstone. Pull a public Realtor.com dataset, fine-tune Llama 3.1 8B on it with LoRA, deploy via Ollama. By dinner you have an AI that drafts listings and reasons about US housing.

What you need before Day 1

Hardware

Mindset

Start Day 1 → Jump to the Apple comparison Peek at the project