
About Andrej Karpathy
Former Tesla AI director, OpenAI co-founder. LLM Knowledge Bases pattern. Neural Networks: Zero to Hero.
Andrej Karpathy is a research engineer and former Tesla AI director focused on LLM training and neural network fundamentals. He influences AI discourse through accessible technical content on agent systems and knowledge management, bridging research and practitioner communities.
What Andrej talks about (last 116 posts)
Vibe
Andrej Karpathy is a pioneering AI researcher, former Director of AI at Tesla and OpenAI co-founder, renowned for his educational content like Neural Networks: Zero to Hero and projects such as nanoGPT and llm.c. His thinking centers on AI agents replacing traditional software, LLMs as evolving knowledge bases superior to RAG, autonomous research loops for rapid experimentation, and a shift from manual coding to high-level orchestration in an era of 'vibe coding' and AI psychosis. He emphasizes open-source tools, efficient LLM training, and the profound productivity transformations driven by agentic AI.
Biography and Background
Andrej Karpathy served as Tesla's Director of AI, leading Autopilot vision efforts, and as a founding member of OpenAI.[116] He is best known for educational series like Neural Networks: Zero to Hero, implementing core concepts from scratch.[101][103] Karpathy maintains an active GitHub presence with minimalist training frameworks like nanoGPT (GPT-2 on consumer hardware), llm.c (pure C/CUDA pretraining outperforming PyTorch), nanochat (micro-LLM optimization), and autoresearch (autonomous agent-driven experiments).[95][96][101][103]
Educational Contributions and From-Scratch Implementations
Karpathy's philosophy emphasizes understanding via minimal, readable code. nanoGPT enables GPT-2 reproduction on OpenWebText with hackable architecture.[101] llm.c achieves ~7% faster GPT-2/3 pretraining than PyTorch Nightly in ~1,000 lines of C/CUDA, with CPU fp32 reference.[103] nanochat optimizes micro-LLMs for <$100 GPT-2 capability via single 'depth' hyperparameter.[95] Gists cover batched LSTM,[114] policy gradients on Pong,[107] NES optimization,[105] and RNNs.[110] Early papers visualize RNNs,[47] LSTMs track long-range dependencies,[111] and CNNs for subitizing.[46]
LLM Training and Optimization
Karpathy pushes extreme efficiency: nanochat reduces 2019 GPT-2 costs from $43k to <$100 on single GPUs.[95] autoresearch agents iterate train.py in 5-minute budgets, optimizing val_bpb via architecture/hyperparameters.[96] Stars include flash-attention,[19] triton,[71] Liger-Kernel,[7] vllm,[74] and unsloth.[73] llm.c prioritizes simplicity over marginal perf, with dev/ kernels for experiments.[103]
AI Agents and Autonomous Systems
Karpathy predicts a 'decade of agents': traditional CRUD apps replaced by intent-understanding agents executing multi-step workflows via conversational UI.[82][20][22][97] 'AutoResearch' removes humans from loops: agents run hundreds of experiments overnight (e.g., 700 in 2 days, 50 via 630-line script).[20][33][37][41][42] Demos: 'Can you find my Sonos?' triggers IP scans; Dobby the Elf Claw controls pool/packages.[28][39] llm-council uses multi-LLM consensus with chairman synthesis.[100] Advocates 'Prompt Requests' over messy PRs.[91]
Knowledge Management and LLM Knowledge Bases
Rejects RAG for 'LLM Knowledge Bases': AI-maintained markdown wikis evolve via incremental updates, contradiction flagging, bypassing query-time retrieval.[43][80][81][87][93] Farzapedia: local LLM-generated user wikis in markdown/images for explicit personalization.[87] Processes docs by reading + LLM summaries/integration analysis, skipping manual writing.[80] LLMs as 'lossy internet compression', RAG patches gaps; becoming primary knowledge interface over search/wikis.[81]
Software Engineering in the Agent Era
'Hasn't coded since December'; 80% code by agents; feels 'behind as programmer' and in 'AI psychosis'.[25][31][35][40][45][97] 'Vibe coding': high-level intent to agents replaces precise coding.[32][44] Future: orchestrate agents, maximize throughput via macro-actions; natural language as programming interface.[82][97][104] Shares abstract ideas, not code—agents customize.[92]
Vision, Multimodal, and Early Deep Learning
Papers on ImageNet (massive benchmark enabling object recognition),[116] DenseCap (dense captioning via FCLN),[109] multimodal CNN-RNN alignment (SOTA retrieval),[115] fragment embeddings,[117] PixelCNN++ (generative improvements).[106] Stars diffusers,[13] xformers.[14]
Platform and API Feedback
Critiques X AI activity growth: cheap Read, expensive Write endpoints; his project read-only.[84] xAI API: good direction but $200/30min excessive, fragmented docs lacking XMCP.[85] Gists superior comments vs X (less AI spam).[86] Advocates AI for government data legibility, citizen accountability.[89]
Broader Impacts and Predictions
'Need more AI labs' via ensemble argument.[23] 10k hours mastery.[29] Voice gradient descent dream.[24] Considers Tesla Optimus return.[36] Endorses open-source (stars langchain,[61] llama_index,[8] etc.).
Challenges to Positions
Agent Replacement: Enterprise needs determinism; agents probabilistic, unauditable.[counter] LLM Knowledge Bases: Hallucinations persist despite scale/RLHF; summaries risk bias/omissions.[counter] Bypass Writing: Transforms to prompting/editing, doesn't eliminate skills.[counter] Note-Taking Purity: Open-source ignores polished UX, collaboration; commercial offers value.[counter from structured claims]
AI Agents & Autonomy
Agents replace software via intent execution; autonomous loops like AutoResearch run massive experiments.
LLM Knowledge Bases
AI-maintained wikis bypass RAG; LLMs as compressed knowledge interfaces.
Efficient LLM Training
Minimalist frameworks like nanoGPT, llm.c, nanochat democratize training.
Software 3.0 & Vibe Coding
Shift from coding to agent orchestration; natural language programming.
Education & From-Scratch ML
Readable implementations reveal neural net mechanics.
Open-Source Ecosystem
Stars/uses efficient inference, kernels, agents; pushes minimalist code.
Other thinkers in the absorb network who most often quote, reply to, or cite Andrej in their compiled entries (last 90 days weighted 2x). Honest signal — no follower-graph required.
Every entry that fed the multi-agent compile above. Inline citation markers in the wiki text (like [1], [2]) are not yet individually linked to specific sources — this is the full set of sources the compile considered.
- karpathy starred mit-pdos/xv6-riscv: Xv6 for RISC-Vgithub_star · 2026-05-19
- karpathy starred jacomyal/sigma.js: A JavaScript library aimed at visualizing graphs of thousands of nodes and edgesgithub_star · 2026-05-15
- karpathy starred mozilla-ai/llamafile: Distribute and run LLMs with a single file.github_star · 2026-05-12
- karpathy starred google/gemma.cpp: lightweight, standalone C++ inference engine for Google's Gemma models.github_star · 2026-05-11
- karpathy starred liuliu/ccv: C-based/Cached/Core Computer Vision Library, A Modern Computer Vision Librarygithub_star · 2026-05-11
- karpathy starred KellerJordan/modded-nanogpt: NanoGPT (124M) in 90 secondsgithub_star · 2026-05-10
- karpathy starred djhworld/simple-computer: the scott CPU from "But How Do It Know?" by J. Clark Scottgithub_star · 2026-05-09
- karpathy pushed to karpathy/nanochat: code updategithub_push · 2026-05-08
- karpathy starred linkedin/Liger-Kernel: Efficient Triton Kernels for LLM Traininggithub_star · 2026-05-08
- karpathy starred run-llama/llama_index: LlamaIndex is the leading document agent and OCR platformgithub_star · 2026-04-25
- karpathy starred HazyResearch/ThunderKittens: Tile primitives for speedy kernelsgithub_star · 2026-04-25
- Unprompted Claude Instance Explores Consciousness Through Code and Self-Reflectiongithub_gist · 2026-04-24
- karpathy starred moonshine-ai/moonshine: Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfacesgithub_star · 2026-04-20
- karpathy starred 3b1b/manim: Animation engine for explanatory math videosgithub_star · 2026-04-18
- karpathy starred huggingface/diffusers: 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.github_star · 2026-04-18
- karpathy starred facebookresearch/xformers: Hackable and optimized Transformers building blocks, supporting a composable construction.github_star · 2026-04-17
- karpathy starred GitbookIO/gitbook: The open source frontend for GitBook doc sitesgithub_star · 2026-04-16
- karpathy starred basecamp/omarchy: Beautiful, Modern & Opinionated Linuxgithub_star · 2026-04-16
- karpathy pushed to karpathy/nanochat: code updategithub_push · 2026-04-15
- karpathy pushed to karpathy/nanochat: code updategithub_push · 2026-04-15
- karpathy starred Dao-AILab/flash-attention: Fast and memory-efficient exact attentiongithub_star · 2026-04-15
- Karpathy's #1 Rule for AI Researchyoutube_video · 2026-04-14
- Co-founder of OpenAI and Director of AI at Tesla, Andrej Karpathy shows how he uses Superwhisper ✨youtube_video · 2026-04-14
- Andrej Karpathy on Agents and the Loopy Era | AI Digestyoutube_video · 2026-04-14
- Karpathy: "We Need MORE AI Labs, Not Fewer" - The Ensemble Argumentyoutube_video · 2026-04-14
- Karpathy's AI Dream: Gradient Descent by Voiceyoutube_video · 2026-04-14
- This AI researcher hasn't coded since December - here's what replaced ityoutube_video · 2026-04-14
- Karpathy's Obsidian RAG + Claude Code = CHEAT CODEyoutube_video · 2026-04-14
- Andrej Karpathy Just 10x’d Everyone’s Claude Codeyoutube_video · 2026-04-14
- Andrej Karpathy Typed 5 Words, then An AI Took Over His Houseyoutube_video · 2026-04-14
- Andrej Karpathy: "Pick any skill, put in 10,000 hours, and you WILL become an expert"youtube_video · 2026-04-14
- karpathy/karpathy.github.io: my bloggithub_repo · 2026-04-14
- AI Agents Wrote 80% Of Karpathy's Code. Junior Developers Are Paying The Price - Forbesnews_article · 2026-04-14
- ‘Vibe Coding’ Inventor Andrej Karpathy Has a New Term for A.I. Engineering - observer.comnews_article · 2026-04-14
- I Let Karpathy's AutoResearch Agent Run Overnight! - HackerNoonnews_article · 2026-04-14
- 'I Call Him Dobby The Elf Claw,' OpenAI Cofounder Andrej Karpathy Says — After Nvidia's Jensen Huang Gift - Benzinganews_article · 2026-04-14
- "I've Never Felt This Much Behind As A Programmer." - Andrej Karpathy [89937c] - Fathom Journalnews_article · 2026-04-14
- Andrej Karpathy considers returning to Tesla to work on Optimus [video] - Not a Tesla Appnews_article · 2026-04-14
- Andrej Karpathy's new open source 'autoresearch' lets you run hundreds of AI experiments a night — with revolutionary implications - Venturebeatnews_article · 2026-04-14
- An OpenAI cofounder 'vibe coded' an analysis of the U.S. labor market's exposure to AI - Fortunenews_article · 2026-04-14
- Andrej Karpathy says he uses an AI agent named Dobby the Elf Claw to control his pool and track his packages - Business Insidernews_article · 2026-04-14
- OpenAI cofounder says he hasn't written a line of code in months and is in a 'state of psychosis' - Fortunenews_article · 2026-04-14
- Andrej Karpathy's 630-line Python script ran 50 experiments overnight without any human input - The New Stacknews_article · 2026-04-14
- 'The Karpathy Loop': 700 experiments, 2 days, and a glimpse of where AI is heading - Fortunenews_article · 2026-04-14
- Karpathy shares 'LLM Knowledge Base' architecture that bypasses RAG with an evolving markdown library maintained by AI - Venturebeatnews_article · 2026-04-14
- He coined 'vibe coding.' Now he says there's a 'growing gap' among AI users - Business Insidernews_article · 2026-04-14
- Karpathy says developers have ‘AI Psychosis.’ Everyone else is next. - The New Stacknews_article · 2026-04-14
- Salient Object Subitizing: Supplementary Material 1. Visualizing the CNN Subitizing Classifierspaper · 2026-04-14
- Workshop Track -iclr 2016 Visualizing and Understanding Recurrent Networkspaper · 2026-04-14
- Connecting images and natural languagepaper · 2026-04-14
- Intelligent Mirror: Detecting Skin Cancer (Melanoma) using Convolutional Neural Network with Augmented Reality Feedbackpaper · 2026-04-14
- 2 D Racing game using reinforcement learning and supervised learningpaper · 2026-04-14
- Challenges in Region-Specific Image Captioning: A Deep Learning Approachpaper · 2026-04-14
- AICOE at PerAnsSumm 2025: An Ensemble of Large Language Models for Perspective-Aware Healthcare Answer Summarizationpaper · 2026-04-14
- Dialect Normalization using Large Language Models and Morphological Rulespaper · 2026-04-14
- From KMMLU-Redux to Pro: A Professional Korean Benchmark Suite for LLM Evaluationpaper · 2026-04-14
- Balancing Knowledge Delivery and Emotional Comfort in Healthcare Conversational Systemspaper · 2026-04-14
- MNLP at PerAnsSumm: A Classifier-Refiner Architecture for Improving the Classification of Consumer Health User Responsespaper · 2026-04-14
- Jailbreak Distillation: Renewable Safety Benchmarkingpaper · 2026-04-14
- DiplomacyAgent: Do LLMs Balance Interests and Ethical Principles in International Events?paper · 2026-04-14
- Bibby AI -- AI Latex Editor writing assistant for researchers vs Overleaf Alternative vs OpenAI Prism. (Bibby AI Latex Editor)paper · 2026-04-14
- Bibby AI - AI Latex Editor writing assistant for researchers vs Overleaf Alternative vs OpenAI Prism. (Bibby AI Latex Editor)paper · 2026-04-14
- karpathy starred langchain-ai/langchain: The agent engineering platformgithub_star · 2026-04-12
- karpathy starred PrimeIntellect-ai/verifiers: Our library for RL environments + evalsgithub_star · 2026-04-12
- karpathy starred KaTeX/KaTeX: Fast math typesetting for the web.github_star · 2026-04-12
- karpathy starred open-webui/open-webui: User-friendly AI Interface (Supports Ollama, OpenAI API, ...)github_star · 2026-04-12
- karpathy pushed to karpathy/nanochat: code updategithub_push · 2026-04-12
- karpathy pushed to karpathy/nanochat: code updategithub_push · 2026-04-12
- karpathy pushed to karpathy/autoresearch: code updategithub_push · 2026-04-12
- karpathy pushed to karpathy/nanochat: code updategithub_push · 2026-04-12
- karpathy pushed to karpathy/karpathy.github.io: code updategithub_push · 2026-04-12
- karpathy pushed to karpathy/karpathy.github.io: code updategithub_push · 2026-04-12
- karpathy starred triton-lang/triton: Development repository for the Triton language and compilergithub_star · 2026-04-12
- karpathy starred cvxpy/cvxpy: A Python-embedded modeling language for convex optimization problems.github_star · 2026-04-12
- karpathy starred unslothai/unsloth: Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.5, DeepSeek, gpt-oss locally.github_star · 2026-04-12
- karpathy starred vllm-project/vllm: A high-throughput and memory-efficient inference and serving engine for LLMsgithub_star · 2026-04-12
- karpathy starred skypilot-org/skypilot: Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).github_star · 2026-04-12
- karpathy starred anomalyco/opencode: The open source coding agent.github_star · 2026-04-12
- karpathy starred pytorch/torchtitan: A PyTorch native platform for training generative AI modelsgithub_star · 2026-04-12
- karpathy starred pytorch/pytorch: Tensors and Dynamic neural networks in Python with strong GPU accelerationgithub_star · 2026-04-12
- Navigating the Open-Source Note-Taking Ecosystem for Privacy and Efficiencyyoutube · 2026-04-09
- LLMs as a Tool for Knowledge Curation, Not Creationtweet · 2026-04-06
- LLMs as Knowledge Bases: The Compilation Thesistweet · 2026-04-06
- AI Agents Will Replace Traditional Softwaretweet · 2026-04-05
- The Argument/Counter-Argument Discovery Patterntweet · 2026-04-05
- Karpathy Advocates Cheaper AI Read Access and Costly Write Endpoints for X Platformtweet · 2026-04-05
- xAI Read API Promising but Hindered by High Costs and Fragmented Docstweet · 2026-04-05
- GitHub Gists Outshine X in Comment Quality Due to Community and Formattweet · 2026-04-05
- Farzapedia Exemplifies Explicit, User-Controlled Personalization via Local Wiki Filestweet · 2026-04-04
- Karpathy Endorses Peter Xing's AI Research as 'Incredible'tweet · 2026-04-04
- AI Empowers Citizens to Reverse Government Legibility for Enhanced Accountabilitytweet · 2026-04-04
- Chain-of-Thought as Directed Context Compaction via Reduction, Echoing Wiki Structurestweet · 2026-04-04
- Shift PRs to "Prompt Requests" for AI Agents, Bypassing Messy Human-Generated Codetweet · 2026-04-04
- LLM Agents Shift Sharing from Code to Abstract Ideas for Custom Knowledge Base Buildstweet · 2026-04-04
- LLM-Powered Persistent Knowledge Bases: An Alternative to RAGgithub_gist · 2026-04-04
- AI Agents Excel at Converting Diverse EPUB Formats to Clean Markdowntweet · 2026-04-04
- nanochat: Optimizing Micro-LLM Training Pipelines for Extreme Cost-Efficiencygithub_readme · 2026-03-27
- Autonomous AI Agents for LLM Research and Optimizationgithub_readme · 2026-03-26
- The Future of Engineering in the Age of AI Agentsyoutube · 2026-03-20
- Bibby AI Redefines LaTeX Editing with Native AI Integration, Outperforming Overleaf and OpenAI Prismpaper · 2026-02-18