Blog radlak.com

…what’s there in the world

The AI Psychosis: Andrej Karpathy on How Agents are Rewriting Software Engineering and Research

The provided transcript features a deep-dive conversation with Andrej Karpathy on the profound paradigm shift currently happening in software engineering and artificial intelligence. Karpathy argues that the traditional concept of “writing code” is becoming obsolete, replaced by a reality where human developers act as directors expressing their will to swarms of highly capable, autonomous AI agents. This transition is accelerating digital transformation, reshaping the future of work, and changing how we approach AI research and education.

The Rise of AI Agents and “Macro Actions”

Karpathy describes experiencing a state of “AI psychosis” after realizing the massive unlock in individual capability provided by coding agents. He notes that he has barely typed a line of code manually since late 2023, instead delegating tasks to agents like Claude and Codex. Engineering has shifted from micro-managing code syntax to orchestrating “macro actions” across code repositories. The new bottleneck in productivity is no longer typing speed or access to compute, but a human’s “token throughput”—their skill in instructing, parallelizing, and managing multiple autonomous agents.

Real-World Agency: From “Dobby” to the Demise of Apps

Highlighting the power of these agents, Karpathy shares a personal project where he deployed a Claude agent (named “Dobby”) to autonomously discover and control his home network’s IoT devices (Sonos, HVAC, lights, security cameras). Dobby reverse-engineered local APIs and built a unified WhatsApp control center. This demonstrates a coming shift in software UX: bespoke, fragmented mobile apps will likely become obsolete, replaced by raw API endpoints seamlessly navigated by AI agents acting on our behalf.

Auto Research and Removing the Human Bottleneck

In the realm of AI research, Karpathy introduced “Auto Research,” an autonomous loop designed to optimize LLM training parameters. For tasks with clear, verifiable objective metrics, humans are currently holding the system back. By giving an agent a metric and boundaries, Karpathy’s system ran overnight and found hyperparameter improvements he had missed despite his decades of experience. He envisions a future where untrusted swarms of internet compute could collaboratively verify AI research, potentially matching or exceeding the output of centralized frontier labs.

The “Jaggedness” of Current AI

Despite these massive capabilities, Karpathy points out that current models exhibit a “jagged” intelligence. They function like a brilliant systems programmer crossed with a 10-year-old. Because they are optimized via reinforcement learning (RL) on verifiable tasks (like code compilation or math), they excel there but stumble on nuanced, “softer” tasks—evidenced by the fact that state-of-the-art models still tell the same rigid, repetitive jokes they did five years ago.

The Future of Work, Open Source, and Education

Looking ahead, Karpathy makes several key predictions:

  • Job Market: Due to the Jevons Paradox, as the cost of software creation plummets, the demand for it will skyrocket. He remains cautiously optimistic about the continued demand for software engineers, though their daily workflows will change drastically.
  • Open Source vs. Closed Labs: Open-source AI currently lags frontier labs by 6 to 8 months. Karpathy views this as a highly optimal and healthy dynamic that prevents dangerous centralization while still democratizing powerful capabilities for the vast majority of use cases.
  • Education: With projects like “Micro GPT,” Karpathy realized that humans no longer need to write exhaustive tutorials. Experts should focus on distilling concepts to their absolute simplest, purest code form (the “few bits” of true value), and rely on AI agents to personalize the explanation and teach the end user.

Mentoring question

How can you shift your daily workflow from being the manual ‘executor’ of tasks to the ‘director’ of autonomous agents, and what verifiable processes in your role could you completely automate today?

Source: https://youtube.com/watch?v=kwSVtQ7dziU&is=8E5xpYMCfVonl4DO


Posted

in

by

Tags: