Today’s editorial: Satya Nadella’s token capital vs human capital argument, why senior talent may become both an asset and an anchor, and what corporate America has to change in the AI age.
→ What Satya Nadella Means by Token Capital and Human Capital, and Why the Problem Runs Deeper
The funny thing about this age of AI is that there are barely any stars who graduated from old software into this new AI world.
This thought came to me when I was reading Satya Nadella’s recent post that, at the moment of writing, had been seen by 56 million people. It’s a crucially important post, but maybe not for what’s visible on the surface. Though even that is important. It has many layers, and they may be seen better from afar.
Satya talks about token capital and human capital. Token capital is the AI capability a firm builds and owns: its models, agents, traces, evals, workflow memory, internal loops. Human capital is the judgment, taste, relationships, context, and pattern recognition of its people. His whole argument is that the two should compound, that humans grow more valuable, not less, as the machines do.
And here you can see Satya the Warrior, who can crush your skull if he needs to (in a battle, of course), suddenly sit down on a stone and turn into The Thinker. He is at the edge.

Satya Nadella as a Warrior and as The Thinker
His company is enormous, dominant, the default choice for almost every enterprise on the planet. It has Azure, Office, GitHub, the OpenAI relationship, distribution that most companies can only dream about. And still, his core warriors, maybe even the whole company, are too stiff for the shape of this moment.
That opens up the current problem: human capital is really important, but seniority no longer translates cleanly into success. If you are a star at Microsoft, inside that particular corporate structure, you are limited by so many things. Incentives. Committees. Procurement logic. Internal kingdoms. The need to sound correct before you know what is true. In a sense, human capital becomes his burden, because it wastes token capital in the wrong way.
Who made the best current models? Bold, uncorporate people, both in the US and China. And if you think about Gemini, it’s Demis, who is Ender Wiggin, playing and fighting Google’s corporate game, and Jeff Dean, who, like Chuck Norris, just looks at the model until it starts converging. They succeed not because of Google but almost in spite of it. (If you read the book The Infinity Machine, you know how long it took to merge Google Brain and DeepMind, how much brain power it wasted).
Notice what these names have in common. They are not ladder creatures. They did not become useful because they mastered the internal language of quarterly planning. They are researchers, obsessives, game players, quants, builders. Liang Wenfeng ran a hedge fund before DeepSeek. Hassabis built games and studied the brain before he taught machines to play and imagine. They came in sideways. The corporate ladder did not make them; in many places it would have sanded them down.
That is the layer underneath Satya’s post. He says human and token capital compound, and he is right. But he is too polite, or too strategic, to say which human capital. Because the kind a big company manufactures best, the senior, reliable, politically fluent veteran who knows how to move a proposal through six committees, is almost exactly the kind that can waste token capital. That person optimizes the org chart, not the loss curve.
And the kind that actually moves the frontier, irreverent, allergic to permission, willing to be loudly wrong in public, is the kind a structure like Microsoft selects against and, eventually, pushes out.
So this is my question, for corporate America: under what conditions can they hold those people without domesticating or exhausting them, Or does the age of AI reward a shape that looks less like a company and more like a band, a lab, a small huddle of fanatics pointed at one problem with infinite compute behind them?
The answer is uncomfortable because Corporate America employs more than 70 million people inside large firms alone: firms with 500 or more employees account for roughly 54% of U.S. private-sector employment. The firms with the most human capital by the old scoreboard, the most veterans, the deepest benches, the proudest org charts, may be badly positioned, because every one of those assets is also a reason not to move.
And there is another thing that I didn’t see people talking about: senior human capital is often too slow, while junior human capital is not yet deep enough. So the firm gets trapped between slow knowledge and fast ignorance.
Satya knows this. That is why he is sitting on the stone. He is not admiring his human capital. He is staring at the possibility that his greatest asset and his heaviest anchor are the same people. And if that is true, the work of the next decade is not simply accumulating human capital or token capital. It is building a structure porous enough to let uncorporate people stay uncorporate while giving them corporate-scale compute, data, distribution, and responsibility.
To keep the Demises and the Jeff Deans from being either crushed or domesticated.
The companies that learn to be that porous will own the age. The ones that keep promoting their best ladder-climbers will keep wondering why their token capital runs in circles. It might be a beginning of change for the whole corporate world. The layoffs we see are part of that.
That’s what I want to leave you with: in the age of intelligence, the scarcest capital is neither the human nor the machine. It is the structure brave enough to hold both without flattening either.
If any of those thoughts resonate with you – share them across your social networks. Let’s keep the conversation going.
Follow us on 🎥 YouTube Twitter Hugging Face 🤗
We are reading / watching
Vivek's article is an excellent example of something written with heavy AI assistance and is an absolute must-read. how to be good at research
First Steps Toward Automated AI Research by Recursive Superintelligence
When AI builds itself by Anthropic
and our reaction to this development in RSI
Twitter Library
News from the usual suspects ™
OpenAI joined the AI IPO conveyor belt. On Jun 8, it confidentially filed for a US IPO, with a possible valuation target of up to $1 trillion and a debut that could come as early as September, though OpenAI says timing is not decided.
xAI and SpaceX turned AI capital formation into rocket theater. SpaceX’s IPO expanded to $85.7 billion after underwriters exercised the greenshoe option, setting a new public-market benchmark for AI-adjacent infrastructure plays. At the same time, xAI faced a lawsuit from a former engineer alleging retaliation after he raised Grok safety concerns.
Anthropic compressed an entire model-governance season into one week. On Jun 9, it released Claude Fable 5, a public Mythos-class model with high-risk cyber and bio safeguards, while keeping full Mythos 5 access for vetted Project Glasswing users. By Jun 13-15, the US government had pushed Anthropic to restrict or disable Fable/Mythos access for foreign nationals over national-security concerns, and more than 50 cyber leaders urged Washington to lift the curbs, arguing that defenders were being hurt too.
NVIDIA and Amazon also backed Neura Robotics in a raise of up to $1.4 billion, another loud signal that physical AI and humanoids are no longer side quests.
Amazon showed both sides of the agent economy. It secured a $17.5 billion loan facility as AI capex keeps ballooning, while a US appeals court weighed whether Perplexity’s Comet agent could violate anti-hacking law by navigating Amazon customer accounts and placing orders. That is the legal edge of agents: when software takes action, the line between user, agent, and platform gets very expensive.
Arcee AI moved deeper into the Hugging Face stack. The two companies announced a multi-million-dollar partnership making Hugging Face the home for Arcee’s public and private models, datasets, and agent traces, using Hugging Face Buckets as private storage. That’s strategically interesting: agent traces are becoming assets, and model hubs are becoming operational infrastructure.
Under the radar, workplace-agent research kept moving fast. WorkBench Revisited reported a jump from GPT-4’s 43% task-completion rate and 26% unintended harmful-action rate in 2024 to Claude Opus 4.8’s 89% completion and 2.5% harmful-action rate in June 2026. Agents are getting less slapstick. Not harmless, but less slapstick
Research highlight
Models
Cohere launched North Mini Code, its first open-source model for developers. It is a 30B-parameter MoE with 3B active parameters, Apache 2.0 weights, 256K context, and a focus on code generation, terminal work, code review, and agentic software-engineering workflows.
Embodied-R1.5: Evolving Physical Intelligence via Embodied Foundation Models – extends reasoning-style training ideas into embodied agents.
LabVLA: Grounding Vision-Language-Action Models in Scientific Laboratories – adapts embodied systems to laboratory environments.
Light-WAM: Efficient World Action Models with State-Fusion Action Decoding – makes world-action models substantially more efficient.
ARM: An AutoRegressive Large Multimodal Model with Unified Discrete Representations – unifies modalities through a shared discrete representation space.
Research
Trends we see looking at every paper related to AI and ML published last week:
Agent infrastructure
Environment engineering
Memory as architecture
World models as reasoning tools
Verification bottleneck
Self-improving agents
Autonomous research, self-improving agents, and agent infrastructure
Toward Generalist Autonomous Research via Hypothesis-Tree Refinement – structures research as an iterative hypothesis-building process that accumulates evidence and lessons across attempts.
EurekAgent: Agent Environment Engineering is All You Need For Autonomous Scientific Discovery – argues that better environments, tools, and feedback loops may matter more than better agents themselves.
Role-Agent: Bootstrapping LLM Agents via Dual-Role Evolution – trains agents by letting models alternately play the role of agent and environment.
EvoTrainer: Co-Evolving LLM Policies and Training Harnesses for Autonomous Agentic Reinforcement Learning – co-evolves the agent and its training process rather than optimizing only the policy.
🌟 HarnessBridge: Learnable Bidirectional Controller for LLM Agent Harness – turns the harness itself into a trainable component that mediates interactions between agents and environments.
🌟 Retrospective Harness Optimization – improves agents by learning from successful and unsuccessful trajectory histories.
EEVEE: Towards Test-time Prompt Learning in the Real World for Self-Improving Agents – adapts prompting strategies during deployment using experience gathered from previous runs.
🌟 Bayesian-Agent: Posterior-Guided Skill Evolution for LLM Agent Harnesses – guides skill evolution using uncertainty estimates instead of simple trial-and-error updates.
Deep research, search, and long-horizon reasoning
SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research – explores how agents can distribute research tasks across specialized sub-agents.
🌟 FORT-Searcher: Synthesizing Shortcut-Resistant Search Tasks for Training Deep Search Agents – creates harder search environments that discourage superficial retrieval strategies.
InterleaveThinker: Reinforcing Agentic Interleaved Generation – alternates reasoning and action more tightly to improve multi-step problem solving.
Memory, long context, and continual information management
🌟 MiniMax Sparse Attention – redesigns attention to make extremely long context windows practical.
🌟 FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention – predicts future retrieval needs before decoding begins.
End-to-End Context Compression at Scale – compresses long context into compact representations that remain useful for downstream reasoning.
Echo-Memory: A Controlled Study of Memory in Action World Models – investigates how memory contributes to action prediction and planning.
Attention Amnesia in Hybrid LLMs – identifies a surprising tradeoff where chain-of-thought tuning can damage long-range recall.
One Token per Multimodal Evidence – compresses multimodal memories into extremely compact latent representations.
World models, embodied intelligence, and spatial reasoning
🌟 Latent Spatial Memory for Video World Models – stores persistent spatial memory directly in latent space.
World Pilot: Steering Vision-Language-Action Models with World-Action Priors – injects predictive world knowledge into robotic action generation.
World Model Self-Distillation – trains world models to improve themselves through self-generated supervision.
MoVerse: Real-Time Video World Modeling with Panoramic Gaussian Scaffold – pushes world modeling toward real-time operation.
🌟 SpatialClaw: Rethinking Action Interface for Agentic Spatial Reasoning – replaces fixed action spaces with code-like interactions for spatial tasks.
Reinforcement learning, verification, and reasoning optimization
🌟 MaxProof: Scaling Mathematical Proof with Generative-Verifier RL and Population-Level Test-Time Scaling – combines generation, verification, and refinement loops for theorem proving.
Beyond Scalar Rewards by Internalizing Reasoning into Score Distributions – replaces single reward signals with richer evaluation distributions.
🌟Rethinking the Divergence Regularization in LLM RL – revisits one of the core stabilization mechanisms used in RL training.
N-GRPO: Embedding-Level Neighbor Mixing for Enhanced Policy Optimization – improves exploration by operating in representation space.
TRACE: A Unified Rollout Budget Allocation Framework for Efficient Agentic Reinforcement Learning – allocates compute dynamically across trajectories during training.
Demystifying Hidden-State Recurrence: Switchable Latent Reasoning with On-Policy Reinforcement Learning – explores recurrent latent reasoning as an alternative scaling direction.
Model architecture and scaling
🌟 Redesign Mixture-of-Experts Routers with Manifold Power Iteration – improves how MoE models decide which experts to activate.
On Subquadratic Architectures: From Applications to Principles – synthesizes emerging approaches for breaking quadratic scaling bottlenecks.
HYDRA-X: Native Unified Multimodal Models with Holistic Visual Tokenizers – proposes a more unified multimodal architecture.
That’s all for today. Thank you for reading! Please send this newsletter to colleagues if it can help them enhance their understanding of AI and stay ahead of the curve.
How did you like it?
FAQ
What does token capital mean?
Token capital is the AI capability a company builds and controls: models, agents, traces, evals, workflow memory, and internal loops.
What does human capital mean in AI?
Human capital means the judgment, taste, relationships, context, and pattern recognition people bring to work. In AI, the question is which kind of human capital helps machines compound instead of slowing them down.
Why is senior human capital a problem in AI?
Senior people often hold deep context, but they may also move slowly inside large organizations. The risk is that corporate veterans optimize process, politics, and safety while token capital needs speed, experimentation, and direct contact with the frontier.







