Neural Networks Will Replace Hand-Written Software
The core thesis of Software 2.0: traditional software has humans write explicit rules, while in Software 2.0 neural networks learn rules automatically from data. Large swaths of software will be represented as weight files rather than source code.
Source: Software 2.0, Andrej Karpathy, Medium, November 2017, karpathy.medium.com/software-2-0-a64152b37c35
Intuition Before Formulas, Hands-On Before Explanation
Real understanding of deep learning comes from hands-on implementation, not abstract formula derivation. The best teaching path is to show students a working neural network first, then explain the mathematics behind it; code is the best teaching medium.
Source: The spelled-out intro to neural networks and backpropagation: building micrograd, Andrej Karpathy, YouTube, 2022 / Let's build GPT: from scratch, in code, spelled out, Andrej Karpathy, YouTube, January 2023
Simplicity Is the Highest Engineering Virtue
nanoGPT implements full GPT-2 training in under 300 lines; micrograd implements a backpropagation engine in about 150 lines. Minimal code is not a compromise but a precise grasp of essence. Complexity is the enemy of engineering; problems solvable simply should never introduce complexity.
Source: nanoGPT repository, github.com/karpathy/nanoGPT, 2022 / micrograd repository, github.com/karpathy/micrograd, 2020
Recursive Self-Improvement Is the Most Effective Learning Method
The best way to learn deep learning is to implement it from scratch — not by calling libraries but by writing every line yourself. Each reimplementation deepens understanding. Karpathy himself has reimplemented GPT and other models multiple times, discovering new insights each time.
Source: Andrej Karpathy Twitter/X posts on learning methodology, x.com/karpathy / Let's build GPT: from scratch, in code, spelled out, Andrej Karpathy, YouTube, January 2023
Vibe Coding: Natural Language as the New Programming Interface
In 2025 Karpathy coined Vibe Coding: programmers describe intent in natural language, AI generates code, and humans only need to 'vibe' whether the code feels right rather than auditing every line. This is Software 2.0 extended to developer tooling — programming itself is being rewritten by AI.
Source: Andrej Karpathy tweet introducing 'vibe coding', X (Twitter), February 2025, x.com/karpathy
Software 2.0 Substitution Thesis
Identify which software modules can be replaced by neural networks and prioritize data-driven approaches in those modules first.
Tesla FSD replaced rule-based perception systems with end-to-end neural networks, dramatically improving generalization.
System ArchitectureAI Product DesignEngineering DecisionsTechnology Selection
Minimal Viable Implementation
Implement core functionality with the least possible code, stripping all unnecessary abstractions until the code itself is the documentation.
nanoGPT implements complete GPT-2 training in ~300 lines of PyTorch, becoming the most widely cited GPT educational implementation globally.
Teaching DesignPrototype DevelopmentCode ReviewLearning Path
Data-Centric AI
In AI systems, data quality and scale often matter more than model architecture; the best model improvements come from better data.
Tesla Autopilot's core competitive advantage is not model architecture but the real-world driving data collected from millions of vehicles and its annotation system.
AI System DesignData EngineeringModel TrainingAutonomous Driving
End-to-End Learning Over Modular Pipelines
Let neural networks learn directly from raw input to final output, avoiding information loss from manually designed intermediate representations.
Tesla FSD v12 shifted to end-to-end neural networks, merging perception, planning, and control into a single network and discarding thousands of lines of hand-written code.
System DesignAutonomous DrivingAI ArchitectureEngineering Practice
Teaching Is Learning
Organizing knowledge into a form teachable to others is the most effective method for deepening one's own understanding.
While teaching CS231n at Stanford, Karpathy deepened his own understanding of CNNs through teaching; the course notes became one of the most widely cited deep learning educational resources globally.
Knowledge ManagementPersonal GrowthTechnical WritingOpen Source Contribution
Academic Research Phase
2011-2016
Stanford deep learning research and CS231n teaching
Completed doctoral research under Fei-Fei Li, focusing on image captioning and recurrent neural networks. Co-founded OpenAI (2015), taught CS231n, whose notes became a global benchmark for deep learning education.
Tesla Autonomous Driving Engineering Phase
2017-2022
Tesla Autopilot perception and FSD neural network architecture
As Tesla's Director of AI, led the transition of Autopilot perception from traditional computer vision to pure neural network architecture. Built Tesla's data engine and drove the end-to-end FSD architecture. His Tesla AI Day presentations became landmark public showcases of autonomous driving engineering.
OpenAI Return Phase
2022-2023
Large language models and GPT series research
Returned to OpenAI in 2022, participating in large language model research including GPT-4. Released nanoGPT the same year, implementing GPT training in minimal code, which became the most widely used GPT educational tool globally. Left OpenAI in 2023 to focus on independent AI education.
Independent AI Education Phase
2023-至今
AI education democratization and Eureka Labs founding
After leaving OpenAI in 2023, focused on YouTube educational series (millions of subscribers), releasing courses that implement AI models from scratch. Founded Eureka Labs in 2024 to build AI-native education platforms. Coined Vibe Coding (2025), influencing global discussions about AI-assisted programming.