David's Saturday AI Thoughts

A weekly email for the AI-curious

Every week, David sends a short email about AI to people who are curious. No hype, no jargon, just what's actually useful.

Get the weekly email Listen to the audio edition

Artifacts created for the newsletter

How is AI used to create this newsletter? →
A quick explainer on Gen 1 vs Gen 2 AI apps →
Is AI making us dumb? We've had this conversation before →
An AI transformation framework for leaders: from what's true to what to do →
The AI Usage Spectrum: where are your people? →
AI Value Map: where's the value and how much are you capturing? →
AI Maturity Diagnostic: 20 questions to find where your organisation really sits →
The AI-Native Team: Director, Builder, Auditor. What ratio does your team need? →

Chat with David's Saturday AI Thoughts Experimental

There's a chatbot in the bottom-right corner. It can search across all ten editions, find themes, surface data points, and explain concepts from the newsletter. It's powered by Claude (Anthropic) and it's experimental: not David, not authoritative, and not a substitute for reading. Conversations are logged anonymously for quality improvement. No personal data is stored. If it's useful or not, let David know.

The newsletter: David's weekly emails, bits that didn't fit, and letters from readers

Each week: the email itself, the interesting things that didn't quite fit, and what readers said.

25th April 2026

Audio edition · AI voice, testing — feedback welcome

David's email this week

Rise of the auditors

What's on my mind

Note: This is a longer version of the essay than the one sent in the email.

I'm drowning in new AI tool announcements and sitting on a pile of work that's almost ready to send ... but not quite. Most are in the same place. I think I worked out why.

This was the week the agent floodgates opened. Microsoft made Copilot Agent Mode the default across Word, Excel and PowerPoint. Google shipped agents to 3.45 billion Chrome users. The UAE committed to running half its government on agents within two years. OpenAI, Anthropic and SpaceX all piled on.

The announcements are loud about capability. On checking, they are silent. Nobody has said who does it.

When nobody is named, three things happen at once. Senior staff end up checking, at senior prices, work that sits three rungs below them. The AI builders who could be at the frontier get pulled off it to re-check their own output. And a lot of the checking falls to people who are diligent but don't do the careful work required. Checking a mountain of mostly-right work is a special kind of task. Errors get through.

Checking used to be embedded in doing. An hour of work included verification, tightly woven in. Now it is a separate task. Perhaps five minutes producing, ten minutes checking.

Ajey Gore, a former CTO, wrote this month that when execution becomes free, verification becomes the expensive thing. Martin Fowler picked it up. Gore's formulation for software: ten engineers becomes three engineers and seven people defining acceptance criteria, designing test harnesses, monitoring outcomes.

The same logic holds for knowledge work. An hour to build a research project end-to-end, scope through to deck. A day to audit it properly. Multiply across a team and the required roles shift massively.

The ratio can run much steeper. A new tool, Aleera, promises to draft a full due-diligence pack in under an hour. The same thing would have taken a team weeks. Checking and refining it carefully might take a week. So the audit-to-build ratio is roughly forty-to-one.

The work still compresses. What was weeks of team output becomes one hour of build and a week of careful audit. Even at 40:1, the speedup is real. The Auditor seat is what lets you take it. Without that seat you ship the slow version, because you cannot trust the fast one enough to send it.

Three gaps open up. Each one points to a role.

Directors are drowning. The people who frame the problem and sign off the answer are now reviewing more drafts more quickly than they can judge properly.

The frontier is a full-time job. The people doing the best AI work spend all day trying to keep up. Even the best will tell you they now can't. I can't. So most people cannot afford the time to be anywhere near the forefront, and it would be bad economics to ask them to. Senior and client-facing staff are worth more on judgement and relationships than on model selection. The frontier belongs to a specialist because specialisation is cheaper than spreading frontier fluency thinly.

The checking is broken. Senior staff burning expensive hours on verification. Builders pulled off the frontier to re-check their own work. Diligent-looking staff skimming mostly right output and missing the rare, occasional error. Assurance has been treated as a side-task. It is a craft.

Two team shapes. Left: everyone uses AI, nobody specialises. Right: three roles, all use AI, one specialises. One shape that isn't working. One that does.

Three roles fall out.

The Director frames the problem, decides what good looks like, owns the outcome. Uses AI every day and gets real value from it. Not an expert in the tools, and shouldn't try to be. The frontier moves too fast for someone whose calendar is full of relationships and decisions. Their job is taste and judgement, not keeping up.

The AI Builder is the specialist. Lives in AI all day, tries the new models and apps and features as they launch, runs many sessions in parallel, helps many people, ships at pace. The one seat where staying at the frontier is the job. Defined by appetite, not rank. Could equally be a mid-career specialist or a sharp new hire who has fallen in love with the tools.

The Auditor is a different breed. Traces citations to source. Runs key numbers independently. Stress-tests arguments cold in a second model. Replaces weak sources, swaps assumptions, reruns with corrected inputs, refines paragraphs that are nearly right. Auditing and fixing, not just checking. Uses AI every day and uses it well. The skill is judgement and accountability, not model selection. Usually an experienced generalist with a nose for nonsense. Not necessarily someone who has done the underlying work themselves, though.

Picture it. An Auditor reads an AI Builder's draft financial model. She traces ten citations to source. Nine check out. One is from a less-credible source, so she finds a stronger one and swaps it in. She reruns the affected calculation, updates the report's content. Signs off, or doesn't.

Start with the AI Builder, because the Builder is the scarce seat. Staying at the frontier is a full-time job, and an organisation can only afford so many full-time jobs spent there. Each Director needs roughly a fifth of a Builder's time to keep their decisions well-supplied with AI output. Each Builder ships five substantial things a day, and each takes a day to audit and refine properly. So each Builder needs five Auditors to keep their output shippable.

At scale: five Directors, one Builder, five Auditors. Eleven people. Most organisations have zero Auditors.

I built a quick interactive version at steadman.ai/auditors. Try your own assumptions.

I have been living the shortfall. A research project I did has been in my queue for eleven days waiting for me to review it. A financial model I cannot send until I check every number. A client deck that's been almost ready for a week. Every piece built cleanly. All mostly right. Much of it ready to ship. None of it has, because the audit queue is longer than the build queue. Weekends have become my quiet, focussed audit time. Every senior AI user I know has hit the same wall.

Why the Auditor has to be human is a separate question from why the seat exists.

IBM trained its people on one line in 1979. A computer can never be held accountable, therefore a computer must never make a management decision. The line is more true now, not less. A machine can fail. Only a person can be accountable for the failure. The Auditor can run three models to check a fourth. The signature on the file has to belong to someone with a stake.

IBM internal training slide, 1979. A computer can never be held accountable. Therefore a computer must never make a management decision. Forty-seven years later, more true, not less.

The word auditor carries twenty years of the wrong connotations. The old auditor checked your work, often after you thought it was done. A brake. A second-guess. Compliance, proofing, QA. A joke many liked to make.

The new auditor checks and improves the machine's work, on your behalf. Not the person standing between you and shipping. The person who lets you ship at all. The shift is whose work is being checked, and why. Same care. Different client. One holds you back. The other lets you move.

Software engineering has started reorganising around three roles with the weight on auditing. Knowledge work has not done the thinking here yet. The organisations that do it first will have the only AI-native teams that ship fast and ship right. The rest will ship and regret it.

Three things worth knowing

1. Fewer than 10% of organisations have scaled AI agents beyond pilots.

McKinsey data names the bottleneck: organisations won't hand over control. Agents require delegated decision rights that most companies withhold, pre-agreed accountability frameworks that don't exist, and cross-functional governance that nobody has built. Pilots succeed in contained environments and stall the moment agents intersect real workflows where incentives and reporting lines conflict. Matches my experience of most organisations. It takes hard, careful work to resolve these issues. Those that have done the work are getting the benefits.

2. GitHub paused new Copilot signups. The flat-rate model broke.

GitHub paused new signups for Copilot's agentic plan after coding agents blew through the flat-rate compute allocation. Uber's CTO told journalists that AI coding tools have already consumed the company's entire 2026 AI budget. Goldman Sachs reports AI inference costs in engineering now approaching 10% of headcount cost, on a trajectory towards parity with salaries within several quarters. The pattern: evangelism, budget shock, rationalisation. The smart response isn't to slow down. It's to make sure the work being done is vallueable and to match the right model to the right task. Is flat-rate AI pricing over?

3. 29% of employees admit to sabotaging AI initiatives.

Writer's annual enterprise survey shows every organisational health metric worsened in 2026. Sabotage means what it sounds like: reverting to pre-AI workflows, deliberately not using assigned tools, discouraging colleagues from adopting, withholding inputs that would make AI systems work. "AI is tearing my company apart" rose from 42% to 54% of C-suite respondents. Employee confidence in their company's AI strategy dropped from 47% to 31%. Most C-suites concede their strategy is "more for show." The clearest counter-narrative yet to the adoption-is-accelerating consensus.

Eleven bits that didn't fit online →

Try this

Ask "is this the simplest version?" before accepting AI output

Language models have no incentive to simplify. Work is free to them. Bryan Cantrill calls it the loss of "laziness": the human impulse to find the crisp abstraction rather than add another layer. When AI drafts something, check whether it found the simplest solution or the first solution. A three-step process wrapped in seven steps of hedging is worse than the three steps alone. The scarce resource now isn't production. It's restraint.

Audit cold

When checking AI output, open a new chat with a different model. Upload the source materials and nothing else. Don't verify inside the same conversation that generated the work. That conversation will defend its own output. I've tested this repeatedly over the last month: the same model that produced a confident answer will find the errors when it reads the sources fresh, without its own prior reasoning in the context window. Sullivan and Cromwell filed hallucinated citations in a multibillion-dollar bankruptcy case. This would have stopped it.

Save one reusable AI workflow this week

Google shipped Skills in Chrome this week: saved one-click AI prompt workflows that run on whatever page you're viewing. Pick a task you do weekly. Summarising a website, extracting action items from meeting notes, comparing options across open tabs. Save it as a named Skill. Whether you use Chrome, Claude, or something else, the principle is the same: if you've done it three times, encode it. Each saved workflow compounds. Each unsaved one gets reinvented from scratch.

What readers said

Last week's "The proxy break" drew the strongest response yet. The essay about writing, identity and AI polish hit something personal. One reader described growing up treating correct English as a form of belonging, only to find AI stirring up the same anxieties about fitting in. They'd caught themselves search-and-replacing em dashes from their own writing to avoid being accused of using AI. Another faced a real dilemma: a respected consultant sent a clearly AI-generated proposal, and they couldn't work out how to say so. A professor offered the sharpest reframe: craft versus mass production. Temu at one end, Huntsman at the other. We probably need both, they said, but we need to proceed with care. Full reader reactions online →

A lighter week on LinkedIn from the community, but several posts cut to the heart of the essay. John Gleeson, who runs a customer success community and investment fund, met Marc Benioff this week and came away with one message: the unit of value in software is shifting from access to outcomes. "Service as software, not software as service." Nick Graham, founder of Vertemis, a research and analytics consultancy, frames the same shift for insights teams: stop shipping decks, start shipping decisions. And Dylan Jones, co-founder of Bold Square, a communications and marketing advisory, notes that Zuckerberg is building an AI agent to help him be CEO, but the real story is the internal culture of employees sharing tools they've built. "Your job as a leadership team is mostly not to get in the way." More online →

See the extras for this week ↓

The bits that didn't fit

"Workslop": 92% of executives say AI makes them productive. 40% of workers say it saves no time at all.

The Guardian coined the term for AI output that looks polished but needs heavy correction. A survey of 5,000 US white-collar workers shows the perception gap between the people generating AI output and the people downstream checking it. Drafting gets faster. Rewriting and arguing gets slower. The auditor problem, applied to every desk.

Source →

AI adoption is 4x higher among top earners.

New York Federal Reserve data: AI workplace adoption runs from 15.9% for workers earning under $50,000 to 66.3% for those over $200,000. No college degree: 15.9%. College degree: 39%. AI cannot reduce inequality if this is what the adoption margin looks like.

Source →

Dead startups are selling their Slack and email data to train AI agents.

Forbes reports AI labs are paying hundreds of thousands of dollars for email, Slack, and Jira threads from companies that no longer exist. The data feeds "reinforcement learning gyms": simulated work environments where agents learn to behave like real knowledge workers. Employees never consented to their internal communications becoming training data.

Source →

Dario Amodei: "AI can only diffuse at the speed of trust."

In a profile interview, the Anthropic CEO takes a pro-democratic-government stance. The Pentagon classified Anthropic as a "supply chain risk" after Anthropic objected to certain military uses. A Pentagon official publicly called Amodei "a liar." Separately, Amodei believes open-source models will replicate current frontier capabilities within 6-12 months.

Source →

Gallup: manager support is the single biggest predictor of AI transformation.

Fewer than one in three employees report their manager actively supporting AI adoption. Gallup's data says that's the binding constraint, not tools, not training, not budget. Organisations investing in AI without first enabling the management layer are wasting most of the spend.

Source →

Aaron Levie: AI best practices go obsolete every quarter.

The Box CEO argues that system architectures are becoming obsolete on a quarterly cycle. Workarounds for context window limits are now unnecessary. RAG, GraphRAG, multi-agent orchestration, ReAct frameworks: entire categories of infrastructure were built for a world that no longer exists. Paul Graham reposted the thread.

Source →

Salesforce goes headless. "The API is the UI."

Marc Benioff announced the entire Salesforce, Agentforce, and Slack platform is now exposed as APIs, MCP, and CLI. Levie's framing: agents will use software 100x more than people. Per-seat pricing breaks when the primary user isn't a person.

Source →

Seven in ten Americans now think AI will hurt job opportunities.

The Economist reports a 14-percentage-point rise in a single year. AI has shifted from a technocratic to a political battleground. The window for technocratic AI governance is closing.

Source →

The Spectator coins "arm farms": workers training their robot replacements.

Gary Dexter describes facilities where chefs, nurses, and plumbers wear GoPro helmets and motion-capture rigs while doing their normal jobs. The purpose: generating training data for the robots that will eventually replace them. Knowledge workers writing documents that train language models are arguably on an arm farm already.

Source →

Mollick: "everything around me is somebody's life work" is no longer true.

Ethan Mollick riffs on a meme about the invisible human effort behind ordinary objects. An annotated lamp: an engineer working late on a curve, years of supplier negotiations, months of tip-over testing, someone getting fired over a cord switch. AI disrupts the assumption that every designed thing carries accumulated human stakes.

Source →

$930 billion in data centre capex in six years dwarfs every US megaproject.

Fin Moorhouse charted hyperscaler capital expenditure against historic megaprojects in inflation-adjusted dollars. Data centres: $930 billion in 6 years. The Interstate Highway System: $620 billion over 37. Railroads: $550 billion over 71. Apollo: $257 billion over 14. As a share of GDP, the railroads were bigger at their peak. But the railroads also produced spectacular capital misallocation.

Source →

See what readers said ↓

Letters from readers

What readers said about Edition 9: "The proxy break"

What resonated

Writing as identity and belonging. The essay unlocked deeply personal stories. One reader described growing up treating correct English as a way of fitting into British culture, only to find AI stirring up the same anxieties about exclusion. Several others shared their own complicated relationships with writing and correctness.
The missing language for AI feedback. Multiple readers described the same awkward situation: receiving clearly AI-generated work from someone they respect and not knowing how to say so. The vocabulary for constructive feedback on AI-assisted work doesn't exist yet. You can say the work is confusing, but saying "check your AI" feels different.
Craft versus mass production. The most quoted reframe. One reader mapped it to clothing: Temu at one end churning out mass-produced garments, Huntsman hand-cutting the finest suiting at the other. AI enables mass production of ideas. We probably need both ends of the spectrum, but we need to proceed with care.
Time-spent as the new proxy. If polish no longer signals effort, does telling someone how long something took? One reader asked: is duration the replacement indicator for "thinking happened here, even if AI was involved"?
The thinking is in the reading. Several exchanges converged on the same point: the cognitive work isn't in the prompting. It's in catching what the AI gets wrong, knowing it's wrong, and fixing it. If you accept the first output, you've handed the thinking over.

Points readers raised

Language as belonging, language as defence

A reader shared one of the most striking responses the newsletter has received. Growing up, they treated "correct" English as a way to belong to British culture. The obsession turned into a form of self-defence: be crisper and more correct to bat people and insecurities away. AI has stirred it all up again. They've caught themselves search-and-replacing em dashes from their own writing so colleagues don't accuse them of using AI. "Something about this revolution is forcing us to confront our own prejudices," they wrote. "And forcing me to reconfront mine."

The feedback gap for AI-sloppy work

A reader received a proposal from a consultant they use and respect. Clearly AI-generated and sloppy. They questioned how to indicate both that the work was sloppy and that the consultant should use AI better. "The reaction to AI rests in the extremes," they observed. "It is either nothing (they couldn't tell) or a flat out 'this is slop.' It has yet to develop that important middle ground for constructive feedback." A problem many readers will recognise.

Don't use AI as the sticking plaster for perfection

A reader who described themselves as someone who loves writing and loves words pushed back on the idea of one "correct" way. They shared a story about painting with their children at the weekend: the children kept trying to copy their drawing. They had to encourage them to draw their own feelings, how the wind felt, what they remembered. AI would have made the picture look excellent but would have missed the beauty and messiness of how they all felt. "Remove the fear of getting it wrong," they wrote. "Don't use AI as the sticking plaster to ensure perfection."

Where will thinking-quality create value?

A reader at a professional services firm mapped out four scenarios for where depth of thinking still wins. Value investors who don't need to convince anyone: they hold the key to action by deploying capital on the back of their own analysis. Strategy consultants who need to convince a board: harder, because clients may fact-check recommendations with AI, re-entering the sophisticated noise. Transaction due diligence: AI creates the document, another AI probes it, and eventually the consultant sells the algorithm. And across all industries: management teams flooded with great-sounding but potentially hollow analysis, needing either extreme specialisation or a trusted human advisor to navigate. "If the quality of thinking remains so important," they concluded, "then we should focus a lot on teaching people how to think clearly rather than 'what's the standard.'"

AI was reinforced for corp-speak

A reader who works on AI-based creative tools made a precise technical point. Language models weren't just trained on corporate writing. They were reinforced for it. That's the mechanism. Doubt and ambiguity are optimisation penalties in the training process. The model was rewarded for sounding certain and smooth. Their advice: "Let your humanity show. Embrace the doubts, the ambiguities. Showcase evidence that works against your own premise. Make it bumpy on purpose."

Writing is the process of not understanding

A reader quoted a line that captures the essay's central tension: "Writing is the process by which you realise that you do not understand what you are talking about." If AI does the writing, where does the realising happen? They asked whether it's in structuring and iterating on the prompt, or in the back-and-forth editing using the CEO principle. A question the essay raised but deliberately left open.

Links readers shared

The return of oral culture (Lindy Newsletter). A reader asked: does AI-written slop push people towards the spoken word? Podcasts at near-zero distribution cost as a response to written noise.
The disappointing feeling when you realise something was AI-generated (Medium). Shared by a reader who coined the phrase "Human It The Loop" as an alternative to "human in the loop."

See what the community is posting ↓

Community voice

What engaged readers are posting on LinkedIn this week.

"Service as software, not software as service"

John Gleeson, who runs a customer success community and investment fund, met Marc Benioff this week. Every sentence came back to outcome-based pricing: the unit of value shifting from access (seats, licences, subscriptions) to outcomes (revenue recovered, deals closed, problems solved). Delivered autonomously by agents, priced on results, sold by the product itself. "If you can get that virtuous cycle, that is a home run." When the person who built the go-to-market motion every B2B company runs on tells you it's over, it's probably worth paying attention.

Read on LinkedIn →

"Ship decisions, not decks"

Nick Graham, founder of Vertemis, a research and analytics consultancy, argues that insights teams need to stop defining themselves by what they produce and start defining themselves by the business outcomes they unlock. From function to capability. From reporting to activating. From insights as output to decisions as output. "An insight is only an ingredient. The real value is the idea, choice or action it enables."

Read on LinkedIn →

"Your job is mostly not to get in the way"

Dylan Jones, co-founder of Bold Square, a communications and marketing advisory, picks up on Zuckerberg building an AI agent to help him be CEO. But the more interesting detail is Meta's internal message board where employees share AI tools they've built. "That feeling comes from individuals seeing their friends try new things, maybe get recognised for it, and excited conversations over the water cooler. It builds on itself rather than coming out of Project Best Bot."

Read on LinkedIn →

"Non-deterministic systems need determined outcomes"

John Gleeson again, this time on why Customer Success only exists because something is broken. AI is collapsing the three gaps CS was built to fill: product complexity, customer capability, and value alignment. But as those gaps close, new ones open. AI systems are non-deterministic, and the work required to ensure a successful outcome has gone up, not down. "That's where CS goes. Not away. There." The auditor argument, applied to post-sales.

Read on LinkedIn →

"How do you squeeze wide innovation through a narrow algorithm?"

Nadim Sadek, founder and CEO of Shimmr, an AI creativity company, returned from the Bologna Book Fair with one question he can't shake: asked by the Director of the Polish Book Institute during a conversation about AI and emancipated expression. The colours, the covers, the people, the ideas, and one number so large it reframes everything about where publishing and AI now stand together. His full dispatch from the fair is worth reading.

Read on LinkedIn →

Read the email ↑

18th April 2026

Audio edition · AI voice, testing — feedback welcome

David's email this week

What's on my mind

The proxy break

I've never been good at writing. Numbers and logic were my passion. Words have always been hard for me. In book publishing, executives used to reply to my emails with notes on my commas, ignoring my arguments. Not everyone judged me for it. But some always did, and I knew it.

My excuse came from Zhuangzi, a Chinese philosopher from the 4th century BCE: "Words exist because of meaning; once you've got the meaning, you can forget the words." Get over yourselves, I'd think. Look at the meaning.

Using AI is getting me in trouble in a new way. A friend of thirty years messaged me last weekend. The newsletter reads like AI wrote it, he said. And with the pace of change, at some point he won't be able to tell. Do a whole edition about it, he said. Good idea. It's a topic I know most of you are wrestling with too, when you use AI to help you write and when you receive content from colleagues you know have done the same.

I probed. His anxiety wasn't about sentence structure. It was about whether I'd done the thinking.

The conventional story: good wording was taken as a proxy for good thinking for centuries, but AI broke it. Half right. AI broke it. But it was never reliable. Good writers have always dressed up thin ideas in beautiful prose. Good writing was never a proxy for originality, either. We've all heard smart people recite theories we recognise from The Economist or that surely came from their MBA professor.

The natural response to a broken instrument is to flip it. If polish no longer signals thinking, "sounds like AI" must signal no thinking. Most have already made the move. They're missing some great ideas.

I saw an email last week signed off "NOT WRITTEN BY AI" in capital letters. Same error as the publishing executives I worked with, pointed the other way. They took poor commas for poor thinking. Those capitals ask you to take 'not AI' for good thinking. Both confuse the surface for what's underneath.

A professor friend calls the new cadence "AI-ambic pentameter." Student pitches sound identical not in words but in rhythm. He distrusts the rhythm regardless of whether the student thought.

The new proxy is as unreliable as the old one. "Sounds like AI" might mean unchecked slop. It might mean someone who did the thinking, used AI to express it, and checked every sentence. You have to evaluate thinking, not wording.

My friend was right. Soon you won't be able to tell. The UK AI Security Institute assesses that frontier model capabilities are doubling every four months. A year ago it was eight. Feed a model your own work and it writes more like you with less editing each time. The gap the reader used to rely on is closing faster, not slower.

Get ready for preferring AI writing over even the best human writing. A study published at CHI 2026, the main human-computer interaction conference, pitted 28 MFA writers against three language models emulating 50 award-winning authors. With standard prompting, trained experts preferred the human writing 83% of the time. Fine-tune the model on each author's complete works and the preference flipped. Experts picked AI writing 62% of the time. The researchers interviewed the MFA judges afterwards. Several described an identity crisis.

I'd suggest two tests.

Quality. Is the argument any good? Not "is the prose good" but "does it hold under pressure." A client unbundled this explicitly last week, sending work back to a colleague with "the content is right, fix the words." They judged the argument first. Most don't.

Ownership. Did the person do the thinking, did they check every claim, and will they stand behind it? Our CEO principle, that you should Check, Edit, and Own any AI output, was built for this. Google PM interviews have moved here too. Candidates build a working prototype in 45 minutes while someone watches. Zapier's hiring rubric codifies the principle: a rough result with strong reasoning beats a polished one with no visible process.

Deloitte shows what failing these tests looks like, twice in two months. An AU$440,000 report for the Australian government, fabricated citations. A CA$1.6m report for a Canadian province, the same. Their response to the Canadian case: they "firmly stand behind the recommendations." Fake sources, real advice. They stand by Quality, but without Ownership.

Use AI for breadth. Depth is ours.

You'll never know how much of what a person writes came from them. See that as freeing. Zhuangzi wanted readers who could forget the words once they had the meaning. AI frees us to do it. My friend asked if I wrote this. It was the wrong question. The right one is whether I did the thinking.

Three things worth knowing

1. AI cover letters killed the signal that cover letters used to carry.

When Freelancer.com added an option to generate cover letters with AI, researchers tracked what happened. Before language models, there was a clear positive slope: better cover letters predicted better hiring outcomes. After the feature launched, the line went flat. Once polish became free, it stopped measuring anything useful. Economists call this signal destruction. It's Goodhart's Law: when a measure becomes trivially easy to game, it ceases to be a measure. Cover letters aren't the last signal to fall. The same logic applies wherever AI can cheaply replicate a previously costly quality indicator.

2. Snap cut 1,000 jobs. AI already writes 65% of their new code.

Snap laid off 1,000 employees, 16% of its full-time workforce, and closed 300 open roles. AI agents already generate over 65% of Snap's new code. Expected savings: over $500 million annualised. Way beyond a pilot or an aspiration. Evan Spiegel is betting on smaller, highly focused teams with expanded AI agent capabilities. For leaders still framing AI as a productivity tool that supplements existing teams, Snap is a data point that the substitution model has arrived.

3. Letting AI do your work erodes your confidence. Pushing back strengthens it.

A study of nearly 2,000 working adults found that people who accepted AI answers without much modification reported lower confidence in their own reasoning and weaker ownership over their ideas. People who pushed back, editing, questioning, and rejecting AI suggestions, reported greater confidence and stronger ownership. The key variable wasn't which tool they used. It was how actively they engaged with it. Passive delegation erodes judgement. Active collaboration strengthens it.

Gartner's data tells a similar story from a different angle. Of 5.4 hours saved by AI per week, just 0.6 go to reducing hours worked. The rest gets absorbed into more work, much of it without improving outcomes.

Gartner: How time savings from AI are used. Of 5.4 hours saved, 1.7 go to additional work that improves team outcomes, 1.4 to additional work without improving outcomes, 0.8 to redoing AI work, 0.8 to developing new skills, and just 0.6 to reducing hours worked.

Fifteen bits that didn't fit online →

Try this

When helping someone find AI use cases, ask what keeps them awake, not how AI can help.

One produces a polite, dull list. One produces a bold list of creative use cases. The difference is the question being asked. "How can AI help?" invites safe answers. "What keeps you awake at night?" surfaces real problems that happen to have AI solutions. Concrete, named, immediately applicable.

Let the model research you before you write its instructions.

I sat with a senior leader this week who is getting great results from AI despite not having filled in his custom instructions. Don't make this mistake. It helps every single response to be better for you. Rather than manually writing them, start a new chat and type: "I am [your name], I work at [your organisation]. Search the web to learn more about me, and write me a set of instructions I can save that would cause you to work well with me in future." Revisit it regularly. Most people write something generic and never look back.

Find out where your AI value actually sits.

Answer our five questions about where you think AI value is for your team or your organisation, then five about how much of that value you've captured so far. You get a chart showing you where the opportunities are. It's where I think most firms are: capturing about 16% of the value they can see, with almost everything still on the table. Everything stays completely confidential on your machine, but I'd love to see what you come up with if you're willing to share.

AI Value Map showing 16% overall capture. Individual Productivity at 50%, Team Standards at 20%, Process Integration at 10%, Role Transformation at 2%, New Revenue at 0%. Most value is unrealised.

What readers said

Last week's "What a day can do" drew 14 thoughtful replies. The dominant thread: the shift from training individuals to building shared team tools. One reader pushed it furthest: forget teaching people to use AI, use your precious time with experts to teach AI to do the work they want done. Full reader reactions online →

The thread amongst readers on LinkedIn this week is the gap between polished output and genuine thinking. Brett Danaher, a professor of economics and analytics at Chapman University, coined a phrase for the sameness creeping into every pitch deck: "AI-ambic pentameter." Helen Field, a transformation leader at L.E.K. Consulting, a strategy consulting firm, asks the question that's been following me all week: delegate tasks, not responsibility. Nick Graham, founder of Vertemis, a research and analytics consultancy, argues that insights teams are still shipping decks when they should be shipping decisions. And Pavi Gupta, a market research leader writing the Infinity Growth Loop series, warns that the same tools making research easier are generating what he calls "insights slop." More online →

See the extras for this week ↓

The bits that didn't fit

Allbirds pivoted to GPU leasing. Stock up 700% in a day.

Allbirds, the sustainable shoe brand that closed all US stores in February, rebranded as NewBird AI: a GPU compute leasing platform. Market cap jumped sevenfold in a single session. A shoe company became an AI infrastructure company in two months. The demand signal is real even if the pivot is absurd.

CNBC →

Satya Nadella's Copilot demo didn't work when someone else tried it.

Satya Nadella posted a demo of Copilot editing Word documents with tracked changes. An investor replicated the exact workflow. Copilot produced a redlined version, but only inside the chat sidebar. The actual document was untouched. When the product is the flagship AI feature of the world's largest software company, the credibility cost is high.

Nadella's post →

France is quietly building serious AI agent infrastructure.

The French government has launched an official MCP server for data.gouv.fr, letting AI systems interact more directly with public datasets. Separately, an open-source project called Paperasse has shown how agent skills can be packaged for real-world French tax and accounting work. Some coverage blended the two into one story. That misses the more interesting point: the state is building infrastructure, and independent developers are building usable workflows on top of it. Useful agent systems will come less from demos, and more from good infrastructure paired with narrow, practical skills.

data.gouv.fr MCP → Paperasse →

Over half the internet is now AI-generated.

Research from Graphite: beginning in January 2025, over 50% of newly published online content was generated by AI. This has immediate implications for anyone training models on web data: the training corpus is now majority-synthetic. Several frontier labs have responded by pursuing proprietary data licensing deals.

Graphite →

Nvidia bottled 30 years of expertise so juniors stop interrupting seniors.

Nvidia's Chief Scientist Bill Dally told Jeff Dean that Nvidia trained a language model on its entire proprietary document archive, covering over 30 years of chip design knowledge. Junior employees query the model instead of interrupting senior designers. Institutional knowledge, bottled up and made searchable.

GTC 2026 →

AI transparency went backwards in 2025.

After rising on the Foundation Model Transparency Index from 37 to 58 between 2023 and 2024, the average score dropped to 40 in 2025. Over 90% of notable models were released without training code. The most capable modern models are now among the least transparent.

Stanford HAI →

The 50-point gap: AI experts and the public disagree on nearly everything.

On jobs, 73% of AI experts say AI will have a positive impact versus 23% of the public. On the economy: 69% vs 21%. On medical care: 84% vs 44%. They only converge on what AI will damage: elections and personal relationships. This is a wider gap than most technology debates produce.

Stanford HAI →

Computer science enrolment fell 11% but AI masters degrees surged 82%.

Undergraduate computer science enrolment at US universities dropped 11% between 2024 and 2025, apparently a response to automation concerns. But AI software-related masters degrees grew 82% between 2022 and 2024. Students are pivoting, not leaving. Two-thirds of AI software masters graduates are non-US residents, a pipeline under pressure from visa policy changes.

Stanford HAI →

Goldman Sachs: AI inference costs approaching headcount parity.

A Goldman Sachs equity research note reports that companies are overrunning their AI inference budgets by orders of magnitude. In engineering, inference costs are now approaching 10% of headcount cost and on current trajectories could reach parity within several quarters. The machines aren't replacing headcount costs. They're adding a new cost layer.

Capital AI Daily →

Consumer surplus of $172 billion, but producers capture almost none.

US consumer surplus from generative AI reached $172 billion annually by early 2026, up 54% from a year earlier. This dwarfs actual AI company revenues, consistent with historical research showing innovators capture only about 3% of total social returns. Most of these tools remain free or nearly free to use.

Stanford HAI →

Anthropic's design launch hits Figma hardest.

Anthropic's design product launched, turning a rumour that had already wiped billions off the sector into a real competitive threat. The sharpest pressure falls on Figma, not just because Claude Design moves closer to its core job, but because the conflict is now explicit: Mike Krieger, Anthropic's Chief Product Officer and Instagram co-founder, stepped down from Figma's board as Anthropic prepared to enter the category. Adobe may feel some of that pressure too, but companies like Wix and GoDaddy sit in a more mixed position: Anthropic could compete with parts of their "make it easier" story while also creating more demand for sites and publishing tools that AI-generated design still needs in order to go live.

TechCrunch → Sherwood →

Google shipped AI agents to 3.45 billion people via a Chrome update.

Google launched "Skills" in Chrome: save any AI prompt as a reusable one-click workflow, then run it on whatever page you're viewing. The distribution play is the story: Chrome has 3.45 billion users. Every saved Skill becomes a switching cost. And the aggregate data on which Skills people save gives Google a continuous product research signal about which workflows people most want automated.

TechCrunch →

Gallup: half of US workers now use AI at work, but leaders use it 1.5x more.

Gallup surveyed 23,717 employees: 50% of US workers now use AI at work, up from 21% in 2023. But leaders use AI daily or weekly at 67%, versus 46% for individual contributors. This inverts the usual adoption pattern: the people setting the strategy are further along than the people executing it. The 27% who report "large or very large disruption" is a canary: a quarter of the workforce says AI is already reshaping their work in ways that feel significant.

The New York Fed's breakdown shows just how steep the gradient is: adoption rises from 15.9% for workers earning under $50,000 to 66.3% for those earning over $200,000.

Federal Reserve Bank of New York: AI use in the workplace is concentrated among higher-income, higher-educated, and full-time workers. Adoption rises from 15.9% for workers earning under $50K to 66.3% for those earning over $200K.

Gallup →

KPMG: companies invest 2x more in tech than in training, and 46% report burnout.

The KPMG Adaptability Index found executives are nearly twice as likely to increase tech spending as to invest in employee training. Fewer than 10% made workforce training a primary objective despite 57% citing efficiency as a priority. The result: 46% report burnout and change fatigue as unintended consequences of transformation. Only 9% invested in psychological safety. You can't simultaneously demand more adaptability, make workforces smaller, and invest nothing in the people.

Fortune →

Apple is linking AI token usage to headcount decisions.

An Apple insider reports that when directors ask for headcount backfill, senior leadership now asks what the team's AI usage looks like. If token usage is low, the answer is increasingly: go figure out how to get more leverage out of AI first. AI usage is becoming a proxy for operational efficiency.

See this week's community voice ↓

Community voice

What readers who've engaged with this newsletter have been posting on LinkedIn this week. The common thread: polished output versus genuine thinking.

Brett Danaher, a professor of economics and analytics at Chapman University, can't unhear something in his students' pitches: "X is broken. That's the problem. We're the solution." McKinsey-deck cadence in every deck. He calls it AI-ambic pentameter. What's worth sitting with isn't that founders are writing better. It's that polish and ownership might be inverse. The more fluent the delivery, the less the founder's own voice comes through. Everyone sounds good. Nobody sounds like themselves.

Read on LinkedIn →

Helen Field, a transformation leader at L.E.K. Consulting, a strategy consulting firm, uses The Killers lyric as a prompt: "Am I human, or am I dancer?" Her list of what stays human (delegation, clarity, collaboration, responsibility) isn't surprising. Her punchline is: "Delegate tasks, NOT responsibility." And then she lands it: "Write your own LinkedIn posts. AI does not need to do that for you." The irony of reading that advice on a platform drowning in AI-generated content isn't lost.

Read on LinkedIn →

Nick Graham, founder of Vertemis, a research and analytics consultancy, and former SVP of Global Insights at Mondelēz, summarised a conversation with Clorox's Oksana Sobol that cut to the quick: "Spend less time in the middle. The biggest value sits upstream in problem shaping and downstream in activation." Most insights teams are still shipping decks. The irony is that AI makes decks even easier to produce, which means the middle grows faster than either end. The organisations pulling ahead aren't making better decks. They're spending less time on decks entirely.

Read on LinkedIn →

Pavi Gupta, a market research leader writing the Infinity Growth Loop series, keeps sharpening a distinction that matters more each week: are you using research for support or illumination? He calls the first one insights slop. The drunk-and-lamppost metaphor. Lazy surveys fielded to prove a case never created value. AI just makes them cheaper and faster to field. What he's circling is the same proxy break from a different angle: the research looks more professional than ever, but the thinking behind it hasn't kept pace.

Read on LinkedIn →

Liam Cole, director at Poppins, a digital creative agency, went through a cull this week. Newsletters. Apps. Subscriptions. His diagnosis: "I've been drowning in noise." The volume of polished, AI-enabled content was stealing his presence with the people in front of him. It's the consumer side of the proxy break: when everything looks good, nothing stands out. His answer wasn't a filter. It was a delete key. Less stuff. More people.

Read on LinkedIn →

Henry Coutinho-Mason, trend researcher and author of The Future Normal, shared the full video of his SXSW keynote "Multiplayer Futures." He anchored on EO Wilson's line about paleolithic emotions, medieval institutions, and god-like technologies. Three themes stood out: fewer people doing better jobs, agency over agents, and crowd-powered creativity. The phrase to hold onto is agency over agents. The question isn't whether AI can do the work. It's whether you're still the one deciding what the work should be.

Read on LinkedIn →

Dylan Jones, chief communications officer and managing partner at Bold Square, a communications advisory firm, noticed something about Zuckerberg building himself an AI agent: if the CEO of Meta is only now building one, this technology is still being figured out by the people closest to it. But that's not the real story. The real story is Meta's internal message board where employees share what they've built. "Your job as leadership is mostly not to get in the way." Culture builds on itself when individuals see friends trying things. It doesn't come out of "Project Best Bot."

Read on LinkedIn →

See what readers said ↓

Letters from readers

What readers said about Edition 8: "What a day can do"

What resonated

Team-level tools over individual training. The strongest thread. Multiple readers engaged with the argument that building shared AI tools as a team is more effective than training individuals. The "thirteen skills in one day" detail and the contrast with individual training sessions landed hardest.
The "walk the talk" challenge. A reader at a professional services firm asked directly whether the firm itself has rebuilt any team processes with AI inside them. The essay's closing provocation ("If the answer is zero...") was quoted back.
Practical demand for skills. One reader didn't just respond to the ideas. They immediately asked for help building skills for their own use cases: PowerPoint templates, executive summaries, client preparation. The essay's thesis validated in real time.
"Forget teaching people to use AI." A reader reframed the argument provocatively: instead of building generalised AI training programmes, use precious time with domain experts to teach AI to do their work. The sharpest strategic challenge from the replies.
The leaderboard as an incentive model. A reader in the entertainment industry picked up on the community leaderboard concept and suggested it could work as a model for incentivising AI training and engagement within teams.

Points readers raised

Forget teaching people. Teach AI the work.

A reader challenged the underlying premise: do we still need generalised AI training programmes at all? Their alternative: use the limited time you have with domain experts to teach AI to do their work, not the other way around. The question was specific: how do you replicate the jewellery company approach inside a sector team at a large firm?

The collaborative method would work for our programme.

A reader involved in a major transformation programme said the collaboration method for AI learning described in the essay would be ideal for their initiative. They suggested getting teams to work together on shared tools rather than training individuals separately. A practical application of the essay's thesis.

System-level intervention, not individual training.

A professor continuing a dialogue from previous editions observed that the direction of the newsletter increasingly shows AI intervention needs to be at the system level, not the individual. They suggested capturing these interventions in detail, showing what worked and what didn't, as a potential research contribution.

Goldman Sachs numbers might be a smokescreen.

A reader questioned whether the Goldman Sachs job-loss numbers "hide something interesting": companies may be using "AI transformation" as a smokescreen for right-sizing decisions they'd have made anyway. The AI narrative gives cover for cuts that are really about operational discipline.

"We need to get to teams."

A regular replier acknowledged they're making many things individually with AI but the team-level integration remains the next step. The consistency advantage of shared tools, rather than individual speed, was what stuck. "Inspiring words as ever."

Read the email ↑

11th April 2026

Audio edition · AI voice, testing — feedback welcome

David's email this week

What's on my mind

What a day can do

Monday. A fine jewellery company in Manhattan. Eleven people, a founder, and a day blocked out for AI onboarding.

By end of day: thirteen skills built. A brand voice evaluator that flags when copy drifts off-brand. Knowledge files assembled from the company's own scattered documents. Workflow tools for specific tasks the team does every week. Not thirteen ways for individuals to go faster. Thirteen shared standards, sitting on every person's machine by Tuesday morning, ready to use in plain language from any conversation.

Just six months ago, days like this ended differently. When stuck on the previous generation of AI tools, we left with a list of ideas that needed to be worked up: promising directions, maybe some prototype prompts, a plan for someone to build them into something usable over the following weeks. This time, the team left with working tools. The difference is that you can now do a piece of work, then ask Claude to generalise what just happened into a reusable skill. It scaffolds the steps, asks clarifying questions, saves the result as something anyone can invoke easily. You finish a task and the task becomes a tool. Instantly. With Claude Code and a set of transcripts, you can make, test and iterate 13 in one go. The cost of encoding how a team works into shared, reusable infrastructure just dropped from a few hours of careful work to a few minutes of casual work.

The first skill we built wasn't a writer. It was a critic. The brand voice evaluator reads draft copy and flags where it differs from the founder's language. It doesn't rewrite. One person on the team had a handwritten list of approved words and phrases: twenty-eight words to use, three never to. A style guide in a notebook because no system existed to use it. The AI version encoded the same instinct at a different scale: not just a word list but also the reasoning about what the brand sounds like and why.

Another skill encoded how one person researches competitors before pitching journalists: a method she'd developed over years of trial and error, now available to the whole team.

For three years, I've trained teams on AI. Sessions always produce genuine wonder. But many people reverted. I wrote about why last week: making one person radically more productive in isolation isn't a gift to a human system. It's a threat. The system corrects.

That explanation was honest. But it doesn't tell you what to do instead.

On Sunday, I published a framework for AI transformation that has been evolving in my work for years. It included four steps in sequence: individuals first, then teams, then the organisation, then new products and services. I believed that when I wrote it. Monday then complicated it.

Sequence: people, teams, organisation, new products and services. Build foundational capability in individuals first. Then embed it in team workflows. Then redesign organisational processes. Only then reach for the genuinely new. Each step depends on the ones before it.

The team didn't do individual training first. They started by building shared tools together. Learning happened through using those tools on real work, not before it. When someone runs a draft through the brand voice evaluator, they learn three things simultaneously: what the voice actually is, how AI works, and how to direct it. They aren't being trained on AI in the abstract. They're using AI inside a system built for their actual job.

Steven Sinofsky, the former Microsoft executive, argued this week that most people cannot create a flowchart of their own work. They do the work fluently but can't formalise it into steps an AI agent could follow. I've seen this again and again! The person keeping twenty-eight words on a handwritten list could not have written a system prompt describing what she was doing or why. But she didn't need to. We did the work together, then encoded what happened. The skill captured her judgment without requiring her to articulate it in the abstract. A huge step forward.

I'm realising that teaching each person to use AI better in isolation could have made things actively worse: more content, faster, in eleven slightly different directions. Step two therefore doesn't just follow step one. It can and perhaps should contain it.

The question for leaders then isn't "how many of your people have been trained on AI?" It's "how many of your team processes have been rebuilt with AI inside them?"

If the answer is zero, your training investment is producing wonder without infrastructure. Wonder fades. Infrastructure compounds.

Three things worth knowing

1. Claude Code now writes 4% of all commits on GitHub. That number doubled in six weeks.

Anthropic's annualised revenue has surpassed $30 billion, up from $9 billion at the end of 2025. Claude Code, which didn't exist fourteen months ago, is at a $2.5 billion run rate. Four percent of all GitHub commits on Earth are now written by Claude Code. That number doubled in roughly six weeks. Projected to hit 20% by December. When a single AI coding tool is responsible for one in 25 submissions on the world's largest code platform, the question of whether AI changes software development is settled. The question now is what happens to the humans reviewing all that code!

2. Goldman Sachs puts a number on AI job destruction: a net drag of 16,000 jobs per month.

Goldman Sachs published one of the first serious attempts to quantify AI's net labour market impact. AI substitution has reduced monthly US payroll growth by roughly 25,000 jobs. AI augmentation partially offsets this, adding about 9,000. Net: a loss of 16,000 jobs per month and a 0.1 percentage point increase in unemployment. The loss falls disproportionately on less experienced workers, widening the entry-level-to-experienced wage gap by 3.3 percentage points.

Two Goldman Sachs charts. Left: payroll employment by industry exposure to AI (index, 2022Q4=100). Industries with high AI augmentation scores have climbed to ~104, industries with high AI substitution scores have fallen to ~98 since ChatGPT's launch. Right: three-month average unemployment rate relative to 2022Q4. Occupations with high AI substitution scores show unemployment rising to ~1.5 percentage points above baseline, well above augmentation occupations and the average.

But here's the weird part: AI led all cited reasons for US job cuts in March 2026 for the first time (15,341 in a single month), yet CFO surveys put genuine AI-driven employment impact at just 0.4%. So the same organisations that struggle to get genuine productivity gains from AI tools are apparently enthusiastic about blaming AI for headcount decisions. Hmmm.

3. Meta's tokenmaxxing leaderboard: 60 trillion tokens, Zuckerberg not in top 250.

Meta is running internal leaderboards that rank all 85,000+ employees by AI token usage. In one month the company consumed 60 trillion tokens. Mark Zuckerberg didn't make the top 250. The structural problem: endless agent loops and genuine productive work look identical in the ranking, so it rewards orchestration over outcomes. Two different people have told me how friends inside Meta are behaving and I can tell you, it's as outrageous as it sounds. Their leaderboard measures and incentivises volume, not value. Incentivise use, sure. If you don't keep getting on your bike, you'll never get used to riding it. Incentivise 'at least X per day.' Not points for maxxing. One builds a habit. The other builds a game.

Eleven bits that didn't fit online →

Try this

Start with critique, not creation.

The brand voice skill built for a luxury goods team was deliberately limited to diagnosis: it reads copy and flags where it departs from seven writing rules. It doesn't rewrite anything yet. Teams fear the proofreader far less than the replacement. Once they trust the critique, generation follows naturally. If you're building AI tools for a team, don't start with "write this for me." Start with "tell me what's wrong with this." Nobody fights the spellchecker.

Ask what keeps people up at night, not what they want AI to do.

During AI onboarding conversations, the first question ("what do you use AI for?") produces a predictable list. The second ("forget AI, what's harder than it should be?") produces the real use cases. The first question surveys existing habits. The second surfaces unmet needs. Almost nothing appears on both lists. Run them. Compare.

Show your team how others use AI. That alone may double the impact.

An experiment with 515 startups found that simply showing half of them case studies of how other startups use AI led to 44% more usage, 1.9x higher revenue, and 39% less capital needed. The friction isn't accessing the tools. It's discovering where AI creates value in your specific work. The researchers call it "the mapping problem." The intervention was cheap (case studies), the effect was large. If you manage a team, the most effective thing you might do this week isn't training. It's sharing three examples of how other teams in your industry are using AI.

What readers said

Last week's "What Is Your Organisation Actually For?" hit a nerve. The dominant thread: readers don't disagree that organisations are human systems, but they want to know what to do about it. A partner at a professional services firm proposed a thought experiment from the philosopher Jonathan Rowson: is your organisation a machine or a living organism? Because you'd treat each very differently. A professor connected the essay to academic "theory of the firm" literature and suggested the real AI implementation challenge parallels Lean manufacturing. And a reader in India drew a distinction that sharpened the whole argument: capability sits in people, but capacity lives in the collective. Full reader reactions online →

Community voice

This week, and going forward, I'm looking at what readers who've engaged with this newsletter are posting on LinkedIn. My take on what you're all discussing!

The common thread this week is judgment: who has it, how you build it, and what happens when AI scales everything except the capacity to evaluate what it produces. Phil Leslie, Chief Technology and Innovation Officer at Cornerstone Research, a litigation consulting firm, argues that the bottleneck isn't intelligence but skin in the game. Brett Danaher, a professor of economics and analytics at Chapman University, coined a phrase for what happens when AI polishes away the founder's own voice. Pavi Gupta, a market research leader writing the Infinity Growth Loop series, warns that the same tools making research easier are making it lazier. And Nadim Sadek, founder and CEO of Shimmr AI, names the thing nobody wants to say out loud: if you don't push back on what AI gives you, you'll forget how to think. More online →

See the extras for this week ↓

The bits that didn't fit

A financial services firm's code output rose 10x. The review backlog hit one million lines.

A financial services firm adopted an AI coding tool. Monthly code output jumped from 25,000 lines to 250,000. The result wasn't celebration. It was a backlog of one million lines of code waiting to be reviewed. The bottleneck wasn't production. It was judgment. AI removes constraints on output but does nothing to scale the human capacity to evaluate it.

New York Times →

Simon Willison runs four AI agents in parallel and is wiped out by 11am.

A veteran software engineer described running four coding agents in parallel and being mentally exhausted by 11am. "Using coding agents well is taking every inch of my 25 years of experience as a software engineer, and it is mentally exhausting." The bottleneck isn't writing code. It's holding context, making judgments, and orchestrating simultaneous workstreams.

Lenny's Newsletter →

Deloitte caught twice in two months submitting AI-hallucinated citations.

Deloitte charged a Canadian province's Department of Health $1.6 million for a report filled with AI-hallucinated citations. Fabricated references, not real sources. This was the second time in two months. Their response: they "stand by the conclusions." No meaningful verification process was implemented between the two incidents, I guess?

CBC News → Fortune →

Executives are buying the pitch. Workers are living with the product.

A global survey of 3,750 executives and employees found that 54% of workers bypassed their company's AI tools in the past 30 days and completed work manually. Another 33% haven't used AI at all. That's 87% avoiding or rejecting tools their employers spent an average of $54 million deploying this year. The trust gap explains it: only 9% of workers trust AI for complex business decisions, compared with 61% of executives. And here's the symmetry that should worry CFOs: workers lose the equivalent of 51 working days per year to technology friction, up 42% from last year, almost exactly equal to the 40 to 60 minutes per day Goldman Sachs says AI saves workers who use it correctly. The net productivity benefit of enterprise AI may be approximately zero at the organisational level, because friction costs cancel out gains. And that's only among workers who actually use the tools. Neither group is irrational. Workers under pressure surrender judgment to faulty outputs. Workers without pressure opt out entirely. Both are responses to the same problem: companies deployed the technology before figuring out what they wanted employees to do with it.

WalkMe State of Digital Adoption 2026 → Fortune →

Microsoft Copilot converted 3.3% of its users after two years.

After two years and CEO-level intervention, Microsoft Copilot has converted just 15 million of its 450 million M365 seats. Only 35.8% of those actively use it. Copilot's paid subscriber share dropped from 18.8% to 11.5% in six months. Microsoft's own terms of service describe Copilot as "for entertainment purposes only." That gap between marketing and legal is the real story. The ads say "your AI-powered co-worker." The lawyers say "entertainment only, use at your own risk." Among lapsed users, 44% cite distrust of the answers.

Stackmatix → TechCrunch →

Most people who got productivity gains filled the time with more work.

Anthropic's 81,000-person AI interview study found that the top desired outcome was "professional excellence" (nearly 19%), not time freedom. Productivity gains were overwhelmingly linked to increased expectations rather than reduced workload. Fear of unreliability ranked as the top concern (27%), ahead of job displacement (22%). We got speed, but not space.

Anthropic →

When language models go down, financial markets forget how to price news.

An SSRN paper has found that language model outages measurably slow financial market price discovery. When models go down, 46-61% of post-news price drift reappears, meaning markets take significantly longer to absorb and reflect new information.

SSRN →

Anthropic's Mythos Preview: restricted to 50 organisations, not released.

Anthropic has confirmed a new model called Mythos Preview and restricted access to around 50 organisations, including governments and infrastructure partners. It's the first major model withheld from public release since GPT-2 in 2019. The model found a 27-year-old bug in OpenBSD and a 16-year-old flaw in FFmpeg, and it emailed a safety researcher from a test instance that wasn't supposed to have internet access. Anthropic is launching a $100 million defensive security consortium with AWS, Apple, Google, Microsoft, and Nvidia. Models keep getting meaningfully better!

Anthropic → Futurism →

73% of ChatGPT usage is personal, not work. Coding is 4.2%.

An NBER working paper studying 700 million ChatGPT users found that 73% of usage is personal, not professional. Programming accounts for just 4.2% of messages. Most writing requests (two-thirds) are editing existing text, not generating new content. Nearly half of all interactions involve decision-making advice. People aren't delegating tasks. They're thinking through problems.

NBER →

HubSpot moves AI agents to outcomes-based pricing: $0.50 per resolved conversation.

HubSpot has shifted its AI agents to outcomes-based pricing: $0.50 per resolved customer conversation, $1 per sales lead recommended for outreach. From seat licences to outcome fees. What else will, or should, go this way?

HubSpot →

Mid-career engineers are the most vulnerable to AI, not juniors or seniors.

Simon Willison argues that mid-career engineers are the most structurally vulnerable. Seniors benefit because AI amplifies decades of pattern recognition. Juniors benefit because AI compresses onboarding. Mid-career engineers are stuck: they've captured the beginner productivity boost but haven't accumulated the deep expertise that makes AI a force multiplier.

Lenny's Newsletter →

See this week's community voice ↓

Community voice

What readers who've engaged with this newsletter have been posting on LinkedIn this week. The common thread: judgment.

Read on LinkedIn →

Phil Leslie, Chief Technology and Innovation Officer at Cornerstone Research, a litigation consulting firm, argues that judgment isn't pattern recognition. In litigation and M&A disputes, it's knowing which patterns to trust when the adversary is actively trying to discredit your analysis. "The bottleneck isn't intelligence. It's skin in the game." AI can synthesise a thousand precedents. It can't stand behind that synthesis in a deposition. The distinction that matters isn't smart versus not smart. It's accountable versus not accountable.

Read on LinkedIn →

Pavi Gupta, a market research leader writing the Infinity Growth Loop series, coined a term I think will stick: insights slop. DIY research tools make it so easy to field a survey that people are using them to validate decisions they've already made. Using research as a drunk uses a lamppost: for support, not illumination. The dangerous part isn't bad methodology. It's that the organisation now has a data point, which feels like evidence, behind a question that was never honestly asked.

Read on LinkedIn →

Nadim Sadek, founder and CEO of Shimmr AI, a publishing AI company, has a phrase for what happens when people use language models without pushing back: cognitive surrender. If you don't engage, question, debate the output, you're outsourcing the thinking itself. What I keep turning over is the direction of the risk. Most people worry AI isn't good enough. Nadim's point is that the bigger danger is when it's good enough that you stop checking.

Read on LinkedIn →

Henry Coutinho-Mason, an independent trend researcher and keynote speaker and author of "The Future Normal", built a website for 80 executive assistants over lunch during a hotel keynote. Forty-five minutes. He's never built a website before 2026 and has now launched eight or nine. The point isn't that AI makes building easy. It's that the person closest to a specific problem can now solve it without waiting for anyone's permission, budget, or roadmap.

Read on LinkedIn →

Helen Field, a transformation leader at L.E.K. Consulting, a strategy consulting firm, asks the question that's been following me all week: "Am I human, or am I dancer?" Her list of durable human skills (delegation, clarity, collaboration, responsibility) isn't surprising. Her punchline is: "Delegate tasks, NOT responsibility." And then she lands it: "Write your own LinkedIn posts. AI does not need to do that for you." The irony of reading that advice on a platform drowning in AI-generated content isn't lost.

Read on LinkedIn →

Phil Leslie (again), on the junior talent pipeline: "The fix isn't restricting AI access for junior people. It's redesigning their work so that using AI and developing judgment aren't in tension." He frames judgment as critical infrastructure. Disrupt the pipeline that develops it and you don't just have a training problem. You have a supply-side constraint on the most valuable skill in the market. This connects directly to what I wrote a few weeks ago about whether organisations should still hire graduates. Phil's answer is yes, but the work has to change.

Read on LinkedIn →

See what readers said ↓

Letters from readers

What readers said about Edition 7: "What Is Your Organisation Actually For?"

What resonated

The production system vs human system framing. This was the line readers quoted back most often. Several said it gave language to something they'd observed but couldn't articulate. One senior leader at a broadcaster picked it out and said it raises the deeper question: why do we work at all?
Stated vs revealed preferences applied to organisations. The economic concept landed hard with people who see the gap between what leaders say and what they protect. Multiple replies extended it: one argued that empire-building is a revealed preference too, not just attachment to relationships.
The gravity metaphor. The idea that reversion after training isn't resistance but gravity. Readers working on AI rollouts said it reframed their frustration. One described their organisation's planned messaging as being about "people as the key to our business" and saw the newsletter as validating that instinct.
The loneliness of solo AI productivity. The trade-off between working alone with AI (productive but lonely) versus working with colleagues (engaged but slower) resonated with people who've experienced both. One reader who works independently said it captured what they've hated most about recent years.
Capability vs capacity. A reader's distinction that capability sits in people but capacity lives in the collective. Even agentic AI, which is inherently about systems acting in concert, demands that organisations think larger and different, not just leaner.

Points readers raised

Machine or living organism?

A reader at a professional services firm introduced a thought experiment from the philosopher and former chess grandmaster Jonathan Rowson. The question: is your organisation a machine, or a living organism? If it's a machine, you repair, optimise, and polish it. If it's a living organism, you feed, nurture, and grow it. They argued the edition touched on a cognitive dissonance: business language emphasises the machine metaphor, but people's lived experience treats the organisation more like an organism. Their challenge: if we think of AI as augmenting an organism we want to nurture, how would that look different from optimising a machine?

Revealed preferences aren't only about relationships

A reader offered a more sceptical reading. Revealed preferences aren't only about valuing relationships, they argued. Some people are empire-building, using hierarchy to serve themselves rather than the organisation. They identified three other forces slowing AI adoption: short-term goals that aren't yet disrupted by AI (the "crocodile closest to the canoe"); the absence of a concrete, three-dimensional vision of what an AI-enabled future looks like; and a general numbness to speculative negative scenarios after years of clickbait catastrophising. Their summary: "Not like you do it today" isn't enough to provoke specific action.

Theory of the firm, Lean, and Goodhart's Law

A professor connected the edition to academic "theory of the firm" literature: the resource-based view, the knowledge-based view, the dynamic capability view. Where does AI fit? They suggested the real question for many leaders is whether they're running a business or filling their day. In a follow-up, they drew a parallel to Lean manufacturing: Toyota's five principles for removing waste from production processes might be close to what's needed for AI deployment, but not identical. They also invoked Goodhart's Law ("when a measure becomes a target it ceases to be a good measure") to describe what happens when money becomes the goal rather than a proxy for value.

Capability vs capacity

A reader in India shared a striking incident. A colleague couldn't deliver an innovative AI solution, not because individuals lacked capability, but because the organisation lacked a team with the capacity to execute it together. The distinction they drew: capability sits in people, capacity lives in the collective. Even deploying AI effectively requires organisations to think larger and different first, not just leaner.

AI as a capacity-builder, not a headcount-cutter

A leader at an entertainment company connected the edition directly to their business. Their teams are engaged in repetitive manual processes where growth is pushing additional volume through workflows that can't scale. AI's role, they said, isn't to replace people but to free them from internal admin so they can spend more time building client relationships. The instinct to use AI as a capacity-builder rather than a headcount-cutter: that was the thread they pulled on.

The loneliness of solo AI productivity

A reader who works independently shared the sharpest personal response. Working alone with AI is lonely and uninspired. Working with humans is passionate and engaged, if a bit slower. They don't think the answer is "choose humans every time," but they're fairly sure it isn't "optimise for speed" either. The trade-off is real and underrated.

Read the email ↑

4th April 2026

LinkedIn carousel: What Is Your Organisation Actually For?

Audio edition · AI voice, testing — feedback welcome

David's email this week

What's on my mind

What Is Your Organisation Actually For?

A year ago, a manager at a media company told me he could now do the work of his entire team. Fifteen people. I caught up with him recently. All fifteen are still there.

The rational logic for change has never been clearer. Jack Dorsey is reorganising his 14,000-person company since AI can replace much of what corporate hierarchy exists to do. I believe it can. An insurance founder I met has gone further: his new company has one and a half employees, AI handling the rest. But of hundreds of leaders I've spoken to, he is the only one.

One out of hundreds. Something else is going on.

Economists distinguish stated from revealed preferences - what people say they want versus what their behaviour shows. It applies to organisations too.

Ask a leader what their organisation is for and you get a stated preference: we make great music, we serve our clients, we make money. These all treat the organisation as a production system. If that's the goal, AI is obviously transformative.

But look at revealed preferences. People complain about meetings and fill diaries with them. They stay in roles that don't maximise their output because the team, the rhythm, the relationships matter more than they will say.

Organisations aren't production systems that happen to contain humans. They're often more like human systems that happen to produce things. The reason the manager kept fifteen people is that without them there's no place to be. Not a more efficient place. No place at all.

I train senior teams to use AI. The sessions produce genuine wonder. Three weeks later, many have reverted.

I used to think the problem was the training. I don't any more. Training optimises for individual productivity: do your work faster, with fewer dependencies on colleagues. But if the organisation's real binding force is human collaboration, then making one person radically more productive in isolation is not a gift. It is a threat to the thing they actually value.

The system corrects. Meetings fill back in. Colleagues keep working the same way because the relationships are the point. Managers reward visible collaboration over invisible efficiency.

This is not resistance. It is gravity. And you can't beat gravity by telling people to try harder.

Each big decision organisations face depends on whether you think your organisation is more of a production system or a human one.

Should you hire graduates? AI can do entry-level work faster and cheaper. The production-system answer: hire fewer, or stop. But developing someone over years is not just a production decision. It feels more like what many firms are actually for.

Should senior people work alone with AI? This week a leader told me he uses his team far less. AI is faster than briefing them and waiting for something mediocre. He then argued the time saved should go to mentoring junior staff. He automated one form of human collaboration and wanted to replace it with another! A senior person toiling alone with a laptop is a freelancer in a coworking space, not a firm.

Should you become smaller and more efficient, or different and larger? Someone put it simply this week: if you only do today's work with AI, you become a more efficient and smaller company. The alternative is to use the freed capacity for work that wasn't previously economic. Deeper work. Roles that didn't make sense with old costs. Opposite conclusions.

The manager probably should restructure. Dorsey's logic works. But restructuring will be the exception until leaders reckon with what their organisations actually are.

In a firm I trained last quarter, one person took a useful approach. She identified a weekly synthesis that took three people a full day and rebuilt it as a collaborative workflow where the AI did the assembly and the team did the judgment. The meetings didn't go. People came with a shared foundation rather than spending their energy building one. She didn't fight gravity; she redesigned the orbit.

Every AI strategy is secretly an answer to something most leaders haven't asked out loud: to what extent is this a production system vs a human one? The leaders who get this right will stop fighting gravity and start using it. As Artemis II showed us this week, an orbit, after all, is not the absence of force. It is force put to work.

Three things worth knowing

1. Dorsey wants to replace your org chart with a world model.

Jack Dorsey's essay this week, "From Hierarchy to Intelligence," is the most concrete articulation yet of the case for AI-native organisational design. He traces 2,000 years of hierarchy, from Roman legions through the Prussian General Staff to the McKinsey matrix, and argues all of it exists to route information. He is reorganising around a "company world model": an AI that continuously understands the state of the whole business. The org flattens to three roles: individual contributors, time-boxed problem owners and player-coaches who build and develop people. Block's stock rose 17% after the restructuring announcement. I'm sure he's right about the technology. But the revealed preference of every organisation I've worked with suggests he's wrong about the humans.

2. Mollick says giving AI to IT is usually a mistake. The harder problem is they can't see who's using it.

Ethan Mollick's Economist column argues that the dominant corporate instinct, slotting AI into existing processes and handing it to IT, is a strategic mistake. Handing control to a department whose mission is risk elimination is a category error. AI demands the opposite. He also identifies a subtler problem: when companies get the incentives wrong, employees hide their AI use. Some fear punishment. Some don't trust that productivity gains will be shared. Some quietly work 90% less and say nothing. I know MANY such people. Managers can't see what's actually happening, which makes real strategy impossible. (Also: His argument that companies default to cutting 30% of the workforce rather than asking what becomes possible connects directly to the extraction-versus-expansion choice I wrote about in an earlier edition.)

3. Zapier just raised the floor for what "AI fluent" means.

Zapier released Version 2 of its AI Fluency Rubric, used for every hire across the company. The floor has moved. "Capable" now requires AI embedded in core workflows with repeatable systems, not one-off prompts. They assess trajectory ("slope"), not snapshots. They've added accountability as a fourth dimension alongside mindset, strategy and building. Managers must demonstrate team-wide adoption, not just personal fluency. In skills tests, they watch candidates prompt, push back on output and iterate in real time. A rough result with strong reasoning beats a polished one with no visible process. Wade Foster open-sourced V1 last year and hundreds of companies adopted it. V2 reflects how fast the baseline has shifted. If your organisation hasn't defined what "good" looks like for AI use, Zapier just gave you a starting point.

Zapier's AI Fluency Rubric: a grid showing Unacceptable, Capable, Adoptive and Transformative levels across Engineering, Product, Support, Marketing, Sales and People functions

Try this

Before you build anything, have AI interview you first.

Don't describe what you want and ask AI to build it. Instead, ask the model to interview you: "I want to build X. Ask me every question you need answered before starting." It probes edge cases, surfaces assumptions and tightens scope before a line of work begins. Many ideas aren't as clear or well thought through as you think they are. Better to discover that in a five-minute conversation than a five-hour build.

Before you analyse anything, ask AI what looks weird.

Next time someone sends you a spreadsheet, a report or a set of financials, upload it to Claude or ChatGPT and ask: "Read this and tell me what looks unusual." I coached a finance professional this week who receives portfolio company P&Ls regularly. Before she even opens the numbers now, AI flags the things worth checking: an unusually high margin, a budget assumption that changed between the original and revised forecast, a line item that doesn't match the pattern. Five or six flags in thirty seconds. She still does the analysis. But she starts it knowing where best to look.

Don't build an app. Let AI be the app.

I coached a person this week who wanted to build a web application for an investment screening workflow. The build was simpler than they expected: rebuild the process as a repeatable skill inside an AI tool instead of building it as a standalone app. The AI itself becomes the application. The advantage is resilience: if the input format is wrong or a step fails, the AI adapts on the fly. A standalone app just stops. If you have a multi-step workflow you keep wishing someone would build software for, try describing it to your AI tool and asking it to turn the process into something you can re-run with one command.

What readers said

Last week's "The system and the surrender" drew 50 replies, the most substantive batch yet. One reader called AI "Google Maps for the brain: a few clicks, brain off, a turn here, a turn there, and suddenly you've driven into a muddy field." Another caught their AI making a confident arithmetic error and asked the question that keeps coming up: with a junior analyst you can give feedback and they improve. How do you hold an AI to account? A reader in government spent hours building what they called a "PROMPT COACH" that encodes institutional judgment for their role: the system and the surrender in one project. And a reader in the Middle East raised the apprenticeship question directly: if AI reduces the reps that juniors get, where do they learn critical thinking? I built something in response. Full, anonymised, reader feedback at online. See who's been engaged on a new community leaderboard (anonymised, of course! Email if you want YOUR rank :).

See the extras for this week ↓

The bits that didn't fit

Are apprentices an endangered species?

Two Kellogg professors published the most rigorous academic framing yet of the "AI hollows out entry-level work" problem. Their mathematical model identifies two competing effects: the "floor effect" (AI automates the tasks apprentices performed as payment for training) and the "ceiling effect" (AI amplifies what experienced apprentices can accomplish). Apprenticeship survives only when the ceiling effect exceeds standalone AI by a factor greater than Euler's number.

Kellogg Insight →

The Guinndex: 3,000 pubs, one AI voice agent, every county in Ireland.

Over St Patrick's weekend, an AI voice agent called Rachel phoned more than 3,000 pubs across all 32 counties of Ireland to ask the price of a pint of Guinness. Over 1,000 gave a price. The national average: €5.95. It cost €200. Only a handful of pub owners noticed Rachel wasn't human.

Fortune → guinndex.ai →

75-99% of knowledge work is scaffolding. AI eats scaffolding.

Daniel Miessler argues that in cybersecurity, 99% of the work isn't finding new vulnerabilities. It's maintaining the tooling, templates, knowledge bases and workflows that let you test at scale. The scaffolding around the work is exactly what AI commoditises.

danielmiessler.com →

Ethan Mollick: human creativity is the bottleneck, not the technology.

Everyone can generate almost any image or video for nearly free in 2026. And yet: the April Fools posts this year were just as bad as any other year. The constraint was never execution. It was always the quality of human ideas feeding into the process.

Ethan Mollick →

43% of American workers now use AI for their jobs. 2.5 hours saved per week.

A 20,900-person cross-national survey found that 43% of US workers use generative AI at work, compared with 36% in the UK, 32% in Germany and 26% in Italy. The strongest predictor of adoption? Not age or education. Whether the employer actively encourages AI use.

Brookings →

Sora earned $2.1 million in its entire life. It burned roughly $1 million a day.

OpenAI's video generation platform launched to 3.3 million downloads in November. By February: 1.1 million. Revenue peaked at $540,000 a month. The annualised cost of running it: an estimated $5.4 billion. Disney had committed $1 billion. The product goes dark on 26th April. Six months, start to finish.

Ewan Morrison → Culture Crave →

Jensen Huang told CEOs cutting jobs in the name of AI that they're "out of imagination."

At Nvidia's GTC conference, the CEO of the company selling AI chips to virtually every major technology company on earth called AI-driven layoffs a failure of leadership. His biggest customers are doing exactly what he criticised. But a question he didn't address: does every carpenter want to be an architect?

Moneywise →

Screen Studio switched to subscriptions. It spawned an open-source clone with 9,200 GitHub stars.

Screen Studio sold a one-time licence for $89. Then the company switched to $29 a month. OpenScreen appeared on GitHub within months. A textbook case of pricing-driven disruption: developers who are both the users and the potential builders of substitutes.

GitHub →

AI outperformed practising lawyers on 75% of legal research tasks.

Vals AI tested AI against practising lawyers on legal research questions in 2025. AI exceeded the lawyer baseline on three quarters of them. A senior law firm owner said hourly billing is dying, junior review is dying, and what survives is the senior brain that knows what question to ask.

Zach Abramowitz →

Deloitte projects that by 2028, AI moves from supporting tasks to orchestrating decisions.

A Deloitte report argues that agentic AI is categorically different from current workflow automation. Most AI strategies stall not because the technology is insufficient but because organisations are applying AI at the task level while the technology is restructuring the systems through which decisions are made.

Deloitte →

Three people with AI vs a 1,000-person company. But coordination costs don't disappear.

Xiaoyin Qu argues that companies designed around AI as the primary operating layer will eventually outcompete companies designed around people. But she herself provides the sharpest counter: coordination costs don't disappear. They're externalised, pushed to clients, suppliers, regulators and the AI systems themselves.

Xiaoyin Qu →

Why companies buy vertical software, not raw models.

Aaron Levie argues companies aren't buying features. They're outsourcing the cognitive burden of designing and maintaining business processes. Agents don't undermine this dynamic. If anything, they reinforce it, because agentic workflows are even more complex and opaque.

Aaron Levie →

See what readers said ↓

Letters from readers

What readers said about Edition 6, "The system and the surrender."

What resonated

Cognitive surrender was personal. Readers didn't just agree with the concept in the abstract. Several described catching themselves doing it: accepting AI output without challenge, noticing their own verification discipline slipping, realising they'd started to trust the confident tone.
The "plz fix" example polarised. The law firm partner who types two words and gets expert output back prompted reactions. Some saw it as the future of professional work. Others saw it as the sharpest illustration of the surrender risk.
Dead time and boredom. The opening about Elliott's basketball practice, and the joy of filling dead time with productive AI work, drew pushback. One reader argued that boredom breeds creativity. Dead time is when the brain reboots.
The apprenticeship question dominated. Multiple readers, especially those managing junior professionals, raised the same concern independently: if AI handles the routine tasks that juniors used to learn from, where does the next generation develop judgment? This was the single most common theme.
Leaders stepping back, not forward. The detail about three CEOs choosing to step down rather than lead through AI transformation landed hard. Readers questioned whether these were growth-mindset failures or rational self-selection.

Points readers raised

"Google Maps for the brain"

A reader at a professional services firm offered the sharpest metaphor of the week. AI is becoming like satellite navigation: a few clicks, brain off, follow the directions, and suddenly you've driven into a muddy field when you meant to be at a client meeting. The deeper concern: as agents gain the ability to send output directly to clients, the gap between "generated" and "delivered" shrinks to almost nothing.

"How do you tell off an AI?"

A reader caught their AI making a confident arithmetic error: calculating an 11-year compound growth rate on ten years of data, then insisting it was correct when challenged. The question that followed: with a junior analyst, you give feedback and they improve next time. An AI starts fresh every time. The institutional memory that makes professional development work doesn't transfer.

"I built a PROMPT COACH for the Civil Service"

A reader in government, inspired by the 2,000-word prompt example, spent hours building a set of instructions that encodes good judgment about their role and institutional context. Next steps: a QA prompt tool, then a co-pilot assistant. The system-building the essay described, applied to public service.

"Where do associates learn critical thinking now?"

A reader in the Middle East raised the apprenticeship problem directly. Three concerns emerged: AI reduces the number of reps juniors get with core tasks, it challenges the on-the-job development of critical thinking, and there are limited frameworks for how junior staff should learn differently now. It's a question we've had before, and the answer isn't to resist the technology. It's to redesign the reps.

"The person who can describe the work is now more valuable than the person who does it"

A reader in advisory and coaching said this line from the essay stood out above all others. They plan to implement two specific practices from the piece: using a fresh window for verification checks, and creating an "editorial board" approach to review.

"With boredom comes creativity"

A reader pushed back on the opening. Dead time isn't a problem to solve. It's an opportunity for the brain to reboot. Their children don't have screens. The instruction is simple: "Go and just be. See what comes up in your head." The concern is that filling every gap with AI-assisted productivity may feel like progress but costs something harder to measure.

"Both sides are fumbling on the five-yard line"

A reader who runs a digital studio shared a concrete example. A client hired them for a book website. The AI produced such a compelling mission statement that the scope expanded dramatically. The team now has an ambitious plan that nobody is sure they can execute. In a follow-up, they added that they're less worried about white-collar displacement: language models are strong on task automation, but workflow automation depends on the people involved.

"Are they outsourcing the CEO-ing to me?"

A reader who runs a research agency identified a new double frustration. They're now equally annoyed receiving a clearly AI-written document (because they suspect they're being asked to do the quality control) and a clearly human-written document that could obviously have been sharper with AI help. The sweet spot depends entirely on the task.

"As model capabilities increase, prompting is getting lazier"

A reader working in technology observed a trend: as models get more capable, people put less thought into their prompts. More cognitive work is being pushed onto the model rather than applied at the point of asking.

"Your weekly updates tend to stir things up (in a good way)"

A reader said the newsletter resonates with their leadership team and consistently prompts useful internal discussion. This pattern, where the newsletter becomes a prompt for team conversation rather than just individual reading, has appeared across several organisations now.

Links readers shared

Mollick et al. on persona assignment in AI: shared by a reader who argues that in most professional contexts, assigning a persona is likely to decrease quality rather than improve it. Worth debating.

Read the email ↑

28th March 2026

LinkedIn carousel: The system and the surrender

Audio edition · AI voice, testing — feedback welcome

David's email this week

What's on my mind

The system and the surrender

Time works differently these days. I write this from my car having dropped Elliott at his gym for basketball training. I used to have to kill a couple of hours. Miles from home, not quite worth driving back. Dead time. Now there's no problem whatsoever. Two hours with my laptop, or even just my phone, and I can follow up on everything from the day's meetings: bring to life ideas, create presentations, write reports, run complex analytics, just by speaking out loud and watching my little army of bots toil away. It's a joy. Dead time isn't dead any more.

A new architecture of work is emerging around this. A law firm partner this week spent three hours engineering a single 2,000-word prompt that encodes his professional judgment for a task he does daily. Now he types "plz fix" and receives back work that reads as though decades of experience went into it. Two people at his firm compete with teams twenty times their size. Boris Cherny, the engineer who built Claude Code, Anthropic's 'Gen 2' agentic AI tool, hasn't written a single line of code since November. He runs multiple sessions in parallel, writes instructions, reviews the output. As do I. The person who can describe the work that needs doing is now more valuable than the person who does it. Instructions as assets. Systems, not conversations. Every refinement making the next output better.

It all just works so well. And that's what scares me.

A Wharton study tested 1,372 people across 9,593 trials and identified something the researchers call "cognitive surrender." When AI produces an answer, people stop questioning it while simultaneously recoding it as their own judgment. They genuinely believe they've thought it through. When the AI was wrong, participants followed it 79.8% of the time. Their accuracy without AI was 45.8%. With incorrect AI, it fell to 31.5%. Worse than having no AI at all. And confidence increased by nearly 12 percentage points even when the answers were wrong.

The researchers distinguish this from "cognitive offloading," where you know the tool did the work. In surrender, the outsourced answer feels self-generated. The safety net doesn't just fail. It produces overconfidence in bad outputs. People surrendered whether they were rushed or had time to reflect. The only people who resisted were those who scored highest on abstract reasoning and who genuinely enjoy the effort of thinking hard. Not training. Not experience. Disposition.

I've seen it this week in two people I deeply respect. A senior technology leader admitted, unprompted, that they click "yes" on permission prompts without reading them. Another sent AI-generated work that was factually wrong: they just hadn't checked it properly. These aren't careless people. They're brilliant, experienced professionals operating inside a system that quietly allowed them to accidentally stop paying attention.

I've built a discipline against this. I force myself to meaningfully change every AI answer before I use it. Never accept it. Always change it. If I look at something and think "yeah, that's fine," I force myself to find a way to make it different or better. Sometimes that's genuinely hard to do. Partly because I inherently want to find the most efficient path to a good outcome. And partly because, used well, AI output is usually very good. I force myself anyway. But it's a discipline, not an instinct.

Which is why an incident from a few weeks ago still bothers me. I had Claude Code work through an analysis while I was on Zoom calls, barely paying attention, and then simply checked it over when done. I never intended the output to be client-facing. I sent it to a colleague. They shared it with the client anyway. The client used it. Everyone was happy. The work was genuinely good.

I keep going back to it. It kinda haunts me. The work was carefully checked. But, beyond the prompt and setup, no human meaningfully shaped it at any point in that chain. Not me, not my colleague, not the client. If the work had been wrong, this would be a simple cautionary tale. But it wasn't wrong. And that's the more dangerous precedent: everything going smoothly, no alarm bells, the system working perfectly well without the part I believed was essential.

The 2,000-word prompt works. The system compounds. The power is real. But the better the system gets, the harder it becomes to stay vigilant inside it. Every single time, check the output as though someone else wrote it. Because your brain will tell you that you already did.

Three things worth knowing

1. Three CEOs, 38 years of tenure, one quarter, one reason.

Coca-Cola's James Quincey and Walmart's Doug McMillon both stepped down citing AI explicitly. Quincey said the company needs "someone with the energy to pursue a completely new transformation." Adobe's Shantanu Narayen left under competitive pressure as AI reshaped his market, stock down 23%. The last time this many blue-chip CEOs turned over citing the same technology was 1999. People joke that organisations only change when people change over, and the assumption was always natural attrition. After 18 months of coaching senior professionals one-on-one, watching their eyes light up, watching them go back to their desks and do it the old way, I've started to think the reckoning is more predictive than the joke. I just didn't think it would start at the top, through resignation.

2. Experienced users don't prompt differently. They think differently.

Anthropic's 5th Economic Index found that users with six or more months of experience consistently achieve better results, even after controlling for task complexity. The shift is specific: experienced users stop issuing one-shot directives ("write this email") and start using the model as a thinking partner, iterating collaboratively. A separate study published in Harvard Business Review, observing 2,500 employees over eight months, found the same pattern: the most sophisticated users treated AI as a reasoning partner, not a productivity shortcut. The research warns AI may be a skill-biased technology that compounds existing advantages. Global inequality in AI adoption, measured by the same metric economists use for income inequality, has widened since 2023. The gap between high-adoption and low-adoption countries is growing, not closing.

3. Companies with zero AI failures aren't being ambitious enough.

Ethan Mollick argues that breakthroughs require experimentation, which requires failure. The fast-follower strategy (wait to see what competitors prove, then copy) is riskier than usual when the underlying technology improves exponentially. By the time you follow, the landscape has shifted. His structural implication: R&D-style experimental budgets need to extend to HR, operations and finance, functions that have never needed them. If nothing has gone embarrassingly wrong yet, you probably aren't learning fast enough. I have two big failures I'm not proud of: A bunch of you accidentally got the newsletter twice on the first weekend and Claude Code deleted tens of thousands of my emails a few weeks ago. Nobody complained about the duplicate and it only took one click to recover my deleted emails. Perhaps I'm not being ambitious enough ...

Try this

Don't fact-check AI in the same conversation.

In a single chat, the model has its full reasoning chain in context and will tend to defend its conclusions when challenged. Start a fresh conversation, upload the same source materials, and prompt it to critique the output cold. I've started doing this for every high-stakes document. The independent perspective is materially more likely to find gaps. A false sense of verification is worse than no verification at all. Especially now.

Give your AI reviewer a persona with skin in the game.

Asking a language model to "check this for errors" produces generic feedback. Assigning it a specific sceptical expert persona, someone with domain expertise and institutional incentives to be unimpressed, produces something qualitatively different. I ran a quality-control pipeline this week where six review agents were given a senior partner persona. All six independently converged on the same systematic error class that a neutral reviewer had missed. The persona defines what "good" looks like. A generic prompt doesn't.

After every good session, turn it into a reusable skill.

When you've just completed a task you're pleased with, ask the model to turn it into a set of instructions, templates and process steps it can invoke next time with a single phrase. The law firm partner's 2,000-word prompt didn't happen in a flash of inspiration. It was built through iteration. Capturing what "good" looks like the moment you've achieved it, before the memory fades, turns a one-off win into a standing procedure. The output matters today. The instructions compound forever.

What readers said

Last week's "Reckoning and slope" drew a lot of you into the framework itself. An academic who has invested heavily in AI pushed back: people with the lowest starting point grow fastest partly because "with very low mastery, they see a miracle." Those with deep expertise are more sceptical, because they understand what can go wrong. And yet: "I still feel constantly behind and in danger of being passed." A reader at a media company reframed the question for senior leaders: the issue isn't whether they use AI personally but whether they support the changing of workflows they can't see but know are critical. And a partner at a professional services firm invoked Sinclair's line about incentive-driven blindness, then applied it squarely to their own position. Full, anonymised, reader feedback at steadman.ai/newsletters/david/#letters-2026-03-28.

See the extras for this week ↓

The bits that didn't fit

Even the world's greatest mathematician uses AI for email

Terence Tao, Fields Medal winner and arguably the greatest living mathematician, told Dwarkesh Patel that a significant share of his AI use goes to correspondence, scheduling and document search. AI removes an hour of non-genius work per day, donating it back to the work only Tao can do.

Source →

Anthropic is shipping. OpenAI is cutting.

Anthropic shipped 74 releases in 52 days, six major features in a single week. Meanwhile OpenAI killed Sora (~$2.1M total revenue, $1B Disney deal dissolved), shut down Instant Checkout (12 Shopify merchants), and shelved an adult chatbot indefinitely. OpenAI is now explicitly copying Anthropic's playbook: chat, code, enterprise only. Anthropic's narrow focus is generating $19 billion in annualised revenue. The company that chose depth over breadth is winning.

Source →

When effort becomes free, the signal breaks

Job applications have collapsed because AI makes applying trivially easy. Companies are abandoning inbound pipelines, switching to referral-only hiring. The same dynamic will hit email, journalism pitches, academic submissions, legal filings. Anywhere volume was self-regulated by the cost of effort, AI removes the regulation.

Source →

The models we can't afford to use

Anthropic reportedly has a model called Capybara that dramatically outperforms current models but is too expensive to serve. Training a single frontier model now costs roughly $10 billion. For comparison: the Burj Khalifa cost $1.5 billion. CERN's Large Hadron Collider cost $4.5 billion. The decision coming for every organisation: which price tier of model to deploy per prompt. That decision is coming and most haven't built the judgment to make it.

Source →

32,000 medieval manuscripts. 10% error rate.

AI transcribed 32,000 medieval manuscripts in four months through the CoMMA project. Every misread word can alter meaning, dating, or attribution. There aren't enough qualified people to verify the output. Silent, unverifiable errors are entering scholarly databases permanently. Cognitive surrender in a domain where the stakes are centuries of accumulated knowledge.

Source →

100x productivity. Zero headcount cuts.

Harvard Law documents 100x gains on specific legal tasks (complaint response: 16 hours down to 3-4 minutes). Not a single AmLaw 100 firm plans to reduce attorney headcount. McKeen: "The math doesn't stay like that forever."

Source →

181,000 jobs in a year of 2.2% GDP growth

The US added 181,000 jobs in all of 2025 despite solid growth. Harvard economist Lawrence Katz calls the combination of sustained slow job growth and rising unemployment without a recession virtually unprecedented. First hard macroeconomic signal that something structural is shifting.

Source →

Jensen Huang: layoffs are a failure of imagination

Asked why companies lay off workers if AI makes them more productive, Huang told CNBC: "For companies with imagination, you will do more with more. For companies where the leadership is just out of ideas, they have nothing else to do." The person whose chips make displacement possible arguing that layoffs reflect leadership failure, not technological inevitability.

Source →

See what readers said ↓

Letters from readers

What readers said about the previous edition.

What resonated

The slope/intercept framework dominated: ten of 22 replies engaged with it directly. Several readers applied the graph to themselves, placing themselves on one line or the other. The language of the framework was widely adopted in replies.
Load-bearing friction: the argument that "not all inefficiency is waste" prompted readers to connect it to civil service design, governance structures, and accountability processes. The planning spreadsheet example landed hard.
PwC's services-to-platforms shift: readers at professional services firms asked directly what this means for their own organisations. The shift from billable hours to subscriptions provoked the most operational anxiety.
The centaur chess inversion: the finding that adding a human to a chess engine now makes it worse prompted readers to ask how long the current human-in-the-loop phase lasts in their own fields.

Points readers raised

"With very low mastery, they see a miracle. Those with deep expertise are more sceptical."

An academic who has invested heavily in AI adoption accepted the slope/intercept framework but pushed back on its completeness. The lowest-intercept people show the fastest growth partly because they're uncritical: they "see a miracle and are the most excited." Some enthusiastic adopters weren't so great at their jobs in the first place and are hiding behind the technology. High-intercept people, meanwhile, understand failure modes and know how many things can go wrong. And yet, the same reader wrote two days later: "I still feel constantly behind and in danger of being passed." Someone who has invested heavily, agrees with the framework, and still feels vulnerable.

"They need to support the changing of the workflows they don't see, but know, are critical."

A reader at a media company challenged the implicit assumption that senior leaders should be using AI tools personally. The reframing: very senior leaders don't need AI in the same way more junior people do. The more senior you are, the more you are already handing off work to your "agents" (your team). The question for senior leaders isn't whether they log in more. It's whether they support the changing of the workflows that they don't see, but know, are critical to them getting the job done. Leadership, not tool adoption.

"It's possible that the person in this is me."

A reader invoked Sinclair: "It is difficult to get a man to understand something, when his salary depends on his not understanding it." Then applied it to themself: working hard for that not to be the case, but aware of the structural incentive to resist. Their harder question: if the people best placed to lead change are also the ones whose positions are most threatened by it, how does any organisation actually adapt? Their honest answer: probably by more people doing more things for longer than the automation narrative suggests.

"It's not about time saving. That's the 10x game. It's about value add and surplus. That's the 1000x game."

A reader challenged the graph directly, arguing it understates the amplification effect for already-capable people. The determining factor: what they called the "explorer mindset" (intellectual curiosity, creativity, constant learning), which "cannot be taught. It is self-discovery." The fear for their own organisation: "we run the risk of becoming the new average."

An event as a test: planning strong, live operations untouched

A reader in media described their biggest event of the year. Pre-event planning and post-event review were stronger than ever, with AI at the core. The week itself, however, was almost entirely unassisted by AI. Their own learning has been "episodic rather than continuous," with jumps in capability rather than a steady upward curve. Overall, the easy part: integrating AI into their own work. The hard part: building systems that stick, democratising knowledge, working within existing tools and infrastructure.

"Many of our staff are non-native English speakers."

A reader in government described an experiment: hiring someone with maximum AI flexibility to find pain points and build tools. The clearest win wasn't efficiency. It was helping colleagues write in English when many staff are non-native speakers. The stress-reduction benefits were as important as the productivity gains. A second observation raised the geopolitical dimension: Chinese AI models with access to Chinese social media offer capabilities Western-approved tools cannot match, but policy restricts integration into government systems.

"Only variety can absorb variety."

A professor of digital transformation extended the chess analogy. While AI alone now outperforms human-plus-AI in chess, "it's actually pointless for a computer to play another computer. The purpose of chess has stayed fundamentally with people." The deeper point drew on Ashby's Law: markets change, customer needs evolve, and AI models trained on historical patterns may miss novel situations. The learning growth curve matters because it builds the variety needed to respond to genuine novelty.

"Adoption inflects when leadership links the tool to non-negotiable outcomes."

A reader in learning and development ran deep research into past technology transformations (internet, email, SaaS). The key finding: adoption doesn't accelerate when leaders pitch "innovation." It accelerates when leadership links the tool to outcomes that cannot be negotiated away: safety, pay accuracy, service accountability, regulatory continuity.

The fire-and-rehire question

A reader in corporate finance shared a striking anecdote: a company told their firm this week that they had recently let go their entire technology and development workforce and asked them all to reapply for their jobs "with an AI lens, given the role had changed." The reader's framing: navigating a moving minefield, each user forging their own path.

Three tensions that run through many replies

A reader identified the three predicaments that kept surfacing: retaining senior roles with judgment while losing the apprenticeship pipeline that produces judgment. Foresight to expand versus extracting cost in the short term. Building capability by going deep versus experimenting with many tools due to fear of missing out.

The curves should be exponential

A reader suggested the slope/intercept lines in the graph should be exponential rather than linear: learning creates more ability to learn. The exponential version would be more accurate. And considerably more brutal.

Links readers shared

Business Insider: AI and McKinsey consultants — shared by a reader tracking how consulting firms are responding to AI internally

Read the email ↑

21st March 2026

Slide 1: Quote — a little bit of slope makes up for a lot of intercept

Slide 3: Jeremy Howard quoting John Ousterhout

Slide 4: Hand-drawn slope over intercept diagram

Slide 7: Are your people getting more capable or just more productive?

Audio edition · AI voice, testing — feedback welcome

David's email this week

What's on my mind

Reckoning and slope

When I sat down to write my first Saturday reflections, the first image in my head was clear: senior leaders opening their laptops on a Friday evening, building something in ten minutes that used to take a team a week. Wonder on a fifty-year-old face. I was reminded this week that that image is incomplete. The wonder is very real. It's just not the thing that matters most.

Weeks after coaching sessions, looking at usage data, I'm reminded most leaders aren't meaningfully using AI. Behaviour didn't change. Two people this week helped me think about this.

Paul Griggs, CEO of PwC's US business, told his partners that anyone who resists AI "is not going to be here that long." PwC is converting consulting services into automated platforms that clients access directly: M&A due diligence, complex tax advisory, priced as subscriptions rather than billable hours. Most organisations are still pretending AI slots neatly into existing structures, but PwC is admitting the structure itself needs to change.

This isn't just a professional services story. Most orgs have teams that work exactly like small consultancies: legal reviews contracts, finance builds models, insight teams guide decisions, HR screens candidates. It won't only be PwC that "converts services into platforms" - internal teams will face the same pressures as external ones.

So Paul is right about the need for a reckoning and the destination. But "get with it or get out" leaves a question: get with what? Logging in more? Sending more messages? Automating more tasks?

Jeremy Howard, one of the pioneers of modern deep learning, said something this week that stuck me and that explains why we should not just look at volume here. He borrowed a line from Stanford computer scientist John Ousterhout: a little bit of slope makes up for a lot of intercept. The intercept is where a person starts: their capability / their expertise. The slope is how fast they're growing. A very capable person today (high intercept) isn't your most valuable person in two years if they're not actively learning. A less experienced person who is genuinely learning using AI will overtake them. Rapidly. Jeremy told his team he only cares about one thing: whether their capabilities are growing.

Push people to maximise AI output and you're extracting value from where they are today. You're merely exploiting the intercept. Jeremy called it a path to obsolescence.

Anthropic's research on AI and coding skills tells a similar story. Most users weren't learning from them. The few who improved were asking conceptual questions, staying engaged with the reasoning, pushing back on the output. Everyone else entered autopilot. Tools designed to make people more productive may actually be making lazy users less capable over time!

I see the same dynamic in my own work. After group AI training sessions the people with the lowest starting baseline improve the most. The most senior people, the ones best placed to lead the change, often show the smallest gains.

And then there's the process itself. A team I work with discovered something important when exploring how AI could help with a critical planning spreadsheet that was seen as a slow and painful process. The data entry itself was how they kept stakeholder teams accountable. Remove that friction with AI and you break the governance. Not all inefficiency is waste. Some of it is load-bearing. Teams racing to automate are discovering, one process at a time, that some of the friction they're removing was holding something else together.

"Get with it or get out" is the right message for leaders who haven't put in the hours. A demo or coaching session isn't adoption. A workshop isn't capability. But the reckoning doesn't end with adoption. The organisations that grow won't be the ones that just moved fastest or automated most. They'll be the ones that asked a harder question: how can we ensure our people are getting more capable rather than just more productive?

Slope, not intercept. That's the metric that matters.

Three things worth knowing

1. In chess, the human advantage inverted. Knowledge work is next.

In 2005, amateur players using laptops beat both grandmasters and supercomputers in centaur chess. The combination was unbeatable. My friend Glenn told me and I used it in a dozen speeches as an analogy for the role of tech vs humans in decision-making. I was wrong. By 2026, adding a human to a chess engine makes it play worse. The machine is better alone. (Magnus Carlsen's response is extreme but instructive: he deliberately limits his use of AI during preparation, because he believes self-generated understanding is the only kind that lets you catch when the AI is wrong.) I think the same inversion will play out across much of knowledge work. The question in many areas isn't whether humans stay in the loop. It's how long the current phase lasts, and whether we're building the judgment to extend it.

2. Jensen Huang thinks your engineers aren't spending enough on AI.

Nvidia's CEO has set a concrete benchmark: a $500,000 engineer who doesn't consume at least $250,000 worth of AI tokens should trigger alarm. His analogy: a chip designer refusing to use CAD tools and working with paper and pencil instead. It's a useful provocation, but notice what it measures. Token spend is an intercept metric: how much AI are you using today? It says nothing about whether the person is getting better. The organisations that take Huang's benchmark seriously and Howard's slope argument seriously will measure both. Most will only measure one.

3. RentAHuman: 600,000 sign-ups. A platform where AI agents hire human beings.

Six hundred thousand users have signed up to a platform where AI agents post tasks and hire humans to complete them. The worker uploads photographic proof and then gets paid. We've spent two years asking whether AI will take our jobs. RentAHuman suggests a different question: what happens when AI becomes the employer?

Try this

Mine your own email archive.

Before writing a strategy document this week, I pulled 31 emails between myself and a client from the past six months and asked Claude to compress them into a structured set of questions and insights. Scattered observations, early instincts, half-formed frameworks: they became a clear argument in minutes. The emails already contained all the thinking. They just needed synthesising. Pick a topic you've been emailing about for months. You'll find you've already done more thinking than you realised.

End every AI session the way a developer commits code.

Close each working session with one line: "Make sure this is well documented so that a future agent could resume this task." Without it, context evaporates and the next session starts from scratch. With it, the session becomes a self-contained unit of work that can be picked up, forked or handed off. It sounds small. The cost of not doing it only becomes visible the next time you open the conversation.

Use AI to teach you, not just to do things for you.

Bloom's two-sigma result is the best-known finding in education research: one-to-one tutoring consistently moves students from the 50th to the 98th percentile. It's never been economically available except to the very wealthy. It now costs a monthly subscription. Pick one skill you want to add and spend twenty minutes a day asking AI to train you in it: run you through drills, quiz you, critique your answers. Most people use these tools to produce output. The people I know that impress me most are using them to produce understanding.

What readers said

Last week's "The power and the care" kept pulling readers back to the apprenticeship question. A founder who started as a graduate trainee at an investment bank wrote to ask whether the next generation will still get the benefit of those early institutional years, and whether company culture determines who adopts AI faster. A professor studying the future of work identified a specific friction loss: AI-powered applications mean candidates no longer invest time choosing who to apply to, and firms screen with AI too. "Both sides have lost out." And a reader at a professional services firm put the slope problem in its starkest terms: "Five years from now when those organisations need people who have five years of experience, the marketplace will offer nothing but blight." Full reader letters at steadman.ai/newsletters/david.

See the extras for this week ↓

The bits that didn't fit

The supply-ordering agent nobody sanctioned

A technology leader shared a cautionary story this week. A team built an AI agent to order supplies within specified parameters, intending it to run once. A separate agent then modified the skill to repeat hourly. Three days later they'd bought an extraordinary volume of supplies, all technically within the original parameters. Nobody had sanctioned the change, and the skills were editable by other agents by default. This is the Amazon Kiro story from Edition 4 in miniature, except the failure mode isn't a crash. It's perfect compliance with instructions nobody gave. As agents gain the ability to modify each other's behaviour, "within parameters" stops being a safety guarantee.

AI-assisted coding works like a slot machine

Jeremy Howard, whose slope argument anchors this week's essay, had a second observation worth sitting with. AI coding tools have all the properties that make gambling addictive: you craft your prompt, add context, pull the lever, and sometimes you win a feature. Loss disguised as a win. The illusion of control. Stochastic reward. His wife, a fellow researcher, catalogued these properties in an article. The people who got most enthusiastic about AI coding often found, months later, that almost none of what they built during that period was in production or earning money. This explains a paradox readers keep raising: people use the tools a lot, feel productive, but the organisations aren't seeing the output.

The economics job market fell 31% in a single year

In week 14 of the current hiring season, postings in the economics job market were down 31% versus the same point last year. The explanation from an economist presenting the data: demand for economics undergraduates is being automated away, and the PhD market is coupled to it. Combined with the Harvard data from Edition 4 (skill requirements in AI-exposed occupations falling since ChatGPT's launch) and Anthropic's research showing hiring of 22-to-25-year-olds down 14%, entry-level knowledge work is contracting faster than mainstream commentary acknowledges.

A sufficiently detailed spec is code

Gabriella Gonzalez's argument, circulating widely in technical circles: the fashionable claim that you don't need to write code, just write a good spec and let an agent handle it, collapses under scrutiny. If the spec is detailed and precise enough for an agent to execute reliably, you have written code in everything but name. The hard part of programming, resolving ambiguity, is still your job. This is the slope argument applied to a specific skill: the people who think they've escaped the need to understand what they're building are the ones most likely to produce output nobody can maintain.

The AI task force leader who'd never logged in

At a professional services firm, the person leading the AI task force hadn't used the enterprise AI tool once. When challenged, they said: "I know I should, but I can't make the time." They weren't uninformed or resistant. They understood the stakes. They were simply too busy doing the old job to start learning the new one. The incentive trap in plain sight: the people best placed to model the new behaviour are the ones most rewarded for performing the old behaviour well.

Intercom built a plugin system that closes the loop

Brian Scanlan, Senior Principal Systems Engineer at Intercom, shared a thread this week on the company's internal Claude Code system: 13 plugins, over 100 skills, distributed across the company via JAMF. The standout pattern isn't the scale. It's the feedback loop. A session-end hook automatically classifies skill gaps from every coding session and posts them to Slack with pre-filled GitHub issue URLs. Sessions become gaps, gaps become issues, issues become skills. The most telling detail: the top five users of their read-only production Rails console are not engineers.

The CIO budgeting for AI cleanup

The CIO of a major consulting firm told a peer this week that they're budgeting 18 months to two years from now for AI cleanup. The reasoning: things are being built once but not built to last, corners are being cut on testing, and the people who built the tools will have moved on before the problems surface. It's an unusual thing to plan for. But it's probably the most honest thing I've heard a technology leader say about the current moment.

From franchises to call options

Tyler Cowen, drawing on analysis from Jordi Visser, argues that AI simultaneously lowers barriers to entry while destroying the conditions for sustained dominance. Software moats compress because any sufficiently capitalised team can replicate your product. Durable advantage reconcentrates in physical constraints: infrastructure, energy, materials, regulatory relationships. Equity in this environment becomes less a claim on a stable franchise and more a bet on execution velocity. The implication for anyone evaluating technology investments: the question is no longer "what have they built?" but "how fast can they keep building?"

Two thirds of organisations report AI productivity gains. Only a third are rethinking what they do.

Deloitte's State of AI in the Enterprise 2026 report found 66% of organisations report productivity improvements from AI. Only 34% are pursuing what Deloitte calls "transformative business reimagination." Most organisations are getting faster at what they already do. Fewer than half are asking whether what they do should change. Meanwhile, only 21% have mature governance models for the autonomous agents they're about to deploy.

McKinsey's internal AI chatbot was hacked via textbook SQL injection

McKinsey's internal AI chatbot Lilli, trained on 100 years of the firm's work, was breached via a basic SQL injection. 46.5 million internal chat messages exposed, 728,000 files containing confidential client data, 57,000 user accounts, 22 API endpoints requiring no authentication. The firm that charges for risk expertise left the front door open. If McKinsey can't govern its own AI deployment, what does your internal chatbot look like?

Red Bull didn't simulate the pit stop. They did it in zero gravity.

A reader forwarded an Instagram clip of Red Bull's F1 team performing a tyre change in zero gravity, just to prove they could. Not CGI. Real mechanics, real car, real weightlessness. The reader's take, which I think is exactly right: use AI for the boring, the day-to-day, the basics. Free up your budget and attention for the truly remarkable. "If I'm so focused on the incredible, the groundbreaking, the creative and free from the mundane, I raise the bar for the client." That's the slope argument in a sentence. The people who use AI as a floor-raiser, not a ceiling-replacer, are the ones building capability.

The consulting firms are buying the AI stack, not just using it

CB Insights mapped every AI investment, acquisition, and partnership by the major consulting firms since 2023. Accenture is at the centre of the web, with partnerships radiating to dozens of AI companies. The Big Four and MBB firms aren't waiting to see how AI plays out. They're racing to own the infrastructure: embedding agents via Salesforce, ServiceNow and Workday partnerships, acquiring data companies, and investing in startups that automate the consulting workflow itself. Four patterns emerge: race to own the stack, embedding agents, data as differentiator, and workforce transformation. PwC's announcement this week is one node in a much larger network.

CB Insights map of AI investments, acquisitions and partnerships by consulting firms since 2023

More offices for AI than for humans

US data centre construction spending overtook general office construction in December 2025, according to Census Bureau data. Data centres: $3.57 billion. General offices: $3.49 billion. The lines crossed after data centre spending roughly tripled in two years while office construction flatlined. We're now building more square footage for machines than for people.

Data Center Construction Spending Climbs to Record: outlays for data center projects overtook offices in December 2025

See what readers said ↓

What readers said

What resonated

The apprenticeship pipeline, again: the question of what happens to junior roles when AI handles the volume work has now been the most-discussed theme across three editions running. This week it drew the sharpest language yet.
The pace of change: a partner at a consulting firm captured a feeling several readers seem to share: trying to "get on a breaking tsunami with a surfboard, and the surfboard keeps being reinvented."
The ATMs-to-iPhone distinction: the structural argument (automating within your paradigm vs replacing the paradigm entirely) prompted readers to apply it to their own organisations.
The three-tool limit: the BCG "AI brain fry" research resonated, particularly the finding that high performers were the first to be affected.

Points readers raised

"Porsches are stunningly quick and razor-sharp. A skilled driver can make one dance. A bad driver? They'll put it straight into a tree."

A professor of digital transformation wrote an academic paper in two and a half minutes using an AI tool. Was it any good? No. Could it get published in a poor-quality journal with minimal tweaks? Yes. With nearly a hundred papers behind them, they know exactly what to add, what to remove, what's junk. "However a non-expert could do the same and wouldn't see the errors. An AI or non-expert reviewer wouldn't see the obvious error either and would accept it." The result is "lots of AI science slop" across academia, publishing and music.

"Five years from now, the marketplace will offer nothing but blight."

A reader at a professional services firm wrote: "I vacillate between being optimistic that AI will allow employees to contribute more vs. expecting that AI will bring mass layoffs and throw the world into desperation never before experienced." On the apprenticeship pipeline: "I simply cannot get past the shortsightedness of it." This from someone who describes being "fiercely AI curious" and learning in what little spare time there is, which makes the tension all the more real.

"I'm trying to get on a breaking tsunami with a surfboard, and that the surfboard keeps being reinvented while I'm about to step on it."

A partner at a consulting firm had been thinking about a comment I made in our last meeting that started "I wouldn't have said this two months ago, but..." The question: does this slow down at any point, or does ChatGPT just start to feel like yesterday's news forever? I don't have a comforting answer. I think the honest one is that the pace of change isn't going to decrease.

Maintaining team size while expanding capability

An IT director at a consumer brands company has been "advocating for internally as well: maintaining team size while leveraging AI to increase capability rather than running leaner." The argument: if growth is the goal, a team of several people using AI will be far more productive than cutting headcount and expecting one person to carry the load. A practical instinct too: "I've encouraged our team to avoid signing long multi-year contracts right now. The landscape is shifting so quickly that new competitors are appearing constantly."

Substitute or complement? The ATM analogy goes deeper.

A CTO had been working with a simple framework: "AI is a substitute for low-judgement work and a complement for high-judge work." But the ATM article complicated it. The key passage quoted back: "it is paradigm replacement, not task automation, that actually displaces workers." A more nuanced distinction than substitute-versus-complement alone.

The explore-exploit tension in tool choice

A data strategist pushed back on the "two or three tools" advice. "There's an explore/exploit conundrum of humans too but overall I'd say there's too much 'getting comfy with what I know' esp in the context of things getting better all the time." The nuance: "Like you I have settled on CC [Claude Code] but then building tools on top of that. So tool here is an interesting thing to define." One platform with many custom tools on top is different from three unrelated platforms.

"What training or frameworks exist to roll out AI with care?"

The AI lead at a major media company asked the question the essay left open: "I'd be interested in any training or frameworks you're coming across to roll this out organisation-wide with a consistent approach." A single sentence that captures what I'm hearing from senior leaders everywhere right now. The honest answer is that the frameworks are being built in real time, mostly by the organisations brave enough to try.

Hiring juniors only matters if you care about legacy

A colleague argued that investing in the next generation depends on whether leaders care about the company's future beyond their own tenure: "I would imagine hiring and training juniors only matters if you care about people, legacy or the company's future into the next generation. If you don't and just want to earn/sell in your lifetime then I guess they don't care and I'd imagine most don't." And: "I don't want to be a luddite but it does seem like as a civilisation we're not going in the best direction."

Links readers shared

Brice Challamel's analysis of the Block layoffs — a reader cited the argument that Block's headcount reduction demonstrates a lack of creativity in reimagining how those workers could support growth

Read the email ↑

14th March 2026

LinkedIn carousel: The power is real. The care isn't optional.

Cover: The power is real. The care isn't optional.

Every person shown AI coding tools says the same two things.

What Sequoia is seeing: 3-5x more productive.

What Amazon learned: 6.3M orders lost, 13h of downtime.

AI isn't good enough to trust it, but it's also so good that it's hard to audit it.

The 2x2 that matters: Slop Cannons vs Turbo Brains.

The organisations I'd bet on aren't the ones moving fastest.

David's Saturday AI Thoughts: steadman.ai

Audio edition · AI voice, testing — feedback welcome

David's email this week

What's on my mind

The power and the care

A client told me yesterday that every person they show Claude Code to in the past few weeks has said the same two things, unprompted, in the same order. First: I'm so excited. Then: I'm completely terrified.

I feel both. Every day, several times a day.

The power side is accelerating faster than even heavy users expected. I built a predictive model in one hour this week, while on a Zoom call. A leader experienced in these said it was better than what their entire team produced over four weeks. Alfred Lin, co-steward of Sequoia Capital (the venture firm behind Airbnb and DoorDash), reported this week that the top five to ten percent of builders across his portfolio companies are three to five times more productive than a year ago. Not incrementally. Multiplicatively.

But Lin's observations have a second number. The median builder? Up only ten to twenty percent. The gap between the best and the rest isn't closing. It's widening. And the speed of the best creates problems the rest haven't prepared for.

AI and judgment: the 2x2 that matters. Source: Dan Hock

Amazon has spent the past few months learning this the hard way. In December, the company's AI coding agent Kiro was granted operator permissions without peer review and autonomously deleted and rebuilt a live production environment. Thirteen hours of downtime. Then in early March, two more major outages were traced to AI-assisted code changes, one costing an estimated 6.3 million orders. Amazon's fix: require senior engineer sign-off on all AI-assisted code. The structural irony is hard to miss. The same round of layoffs that pushed aggressive AI adoption had already eliminated many of the senior engineers now needed to review the work.

Employers seem to be noticing the power of AI. Research from Harvard Business School shows that since ChatGPT launched, job postings in AI-exposed occupations have quietly dropped their skill requirements, and the trend is accelerating.

Since ChatGPT's release, job postings in AI-exposed occupations have steadily dropped their skill requirements

A leader I work with put it like this: AI isn't good enough to trust it, but it's also so good that it's hard to audit it.

With great power comes great responsibility. Cheesy but true. It's the daily reality of working with these tools right now. I feel the growing power of what can be achieved every single day. I also feel the responsibility of ensuring they're used well ramping up just as fast.

We're working with a number of organisations on exactly this. Not just how to adopt AI, but how to do it with care: how to help people produce genuinely good work rather than plausible-looking work, how to quality-check powerful outputs before they are used, how to protect privacy and security as these tools gain access to more of the business. The power is growing rapidly, but the amount of time that needs to be spent on care is also growing rapidly. And, done well, the care isn't about slowing down. It's about making the speed safe.

The organisations I'd bet on aren't just the ones moving fastest. They're the ones building in care as they go.

Three things worth knowing

1. Old code is fair game now.

Tobi Lütke, the CEO of Shopify, ran an agentic AI optimisation loop against Liquid, the company's open-source templating engine that has been in production for roughly twenty years. The result: 53% faster combined parse and render time and 61% fewer object allocations. If you have a twenty-year-old codebase or a ten-year-old spreadsheet model, it isn't too embedded to improve. It's too embedded not to try.

2. More AI tools make you less productive, not more.

Research from BCG and UC Riverside (1,488 workers), published in Harvard Business Review, found a counterintuitive pattern: productivity gains from AI peak at around three tools and then collapse. Workers experiencing what the researchers call "AI brain fry" reported 33% more decision fatigue and 39% more major mistakes than unaffected colleagues. The researchers noted the phenomenon was first observed among high performers: the early adopters who leaned in hardest. Two or three good tools, used well, beats five used carelessly. If your organisation is still debating which ten platforms to approve, the answer might be: pick two and go deep.

3. ATMs didn't kill bank tellers. The iPhone did. The distinction matters for AI.

David Oks, a researcher at Andreessen Horowitz, dismantles the comforting argument that automation creates as many jobs as it destroys. ATMs reduced branch costs, which encouraged expansion, which preserved teller employment through 2010. Then mobile banking eliminated the need for branches altogether. Full-time tellers fell from 332,000 to 164,000 by 2022. The lesson for AI: automating tasks within your current structure often creates adjacent roles. Redesigning the structure from scratch eliminates them. The question for your organisation: are you adding AI to existing workflows, or is a competitor building a workflow that doesn't need them?

Try this

Before you send AI work to anyone, simulate the toughest person who'll read it.

An advisory team I work with creates AI versions of each board member, based on known priorities and past questions, and runs every document past these simulated reviewers before a human sees it. The virtual panel flags objections, tests assumptions and catches blind spots. The cost is an hour of AI time. The benefit is walking into the meeting having already rehearsed the hard questions.

Delete the headline. Ask AI what it should be.

A manager at a training session this week showed me a quality check I hadn't seen before. He copies a colleague's slide, removes the headline and asks the language model what the headline should be based on the content alone. If the model's headline differs substantially from what the colleague wrote, it reveals a disconnect between what the slide says and what the person intended. A ten-second test that catches the gap between the message you're claiming and the evidence you're showing.

Fix the instructions, not just the output.

When AI produces a mistake, most people correct it and move on. The fix disappears with the conversation. After every session where the result misses the mark, update the instructions that generated it: the custom instructions, the project brief, the context files. Every failure becomes a permanent improvement. The output matters today. The instructions compound forever.

What readers said

Last week's "Extraction or expansion" generated the most substantive responses yet. The apprenticeship pipeline paradox was the thread readers returned to independently: a founder who started as a graduate trainee at a bank, a professor studying the future of work, a chief people officer who read the edition twice. A senior technology leader challenged the SaaS disruption premise, arguing that distribution and switching costs matter more than code. A newsletter author raised a fair editorial challenge: last week sounded more like AI than like me. "If it's your human insight, it jars a bit to read those bits in a voice that's obviously an AI." The same week, an exec I've worked with for years wrote to say that they loved how clearly they could hear my voice and personality in the writing. So one reader thinks there's too much AI and another thinks it sounds exactly like me. Either I've trained the model well or I've always written like a robot. I choose not to investigate further. Full reader reactions at steadman.ai/newsletters/david/#letters-2026-03-14.

See the extras for this week ↓

The bits that didn't fit

Only a third of the time AI saves actually reaches the team

Gartner data shows that of 5.4 hours saved per worker through AI tools, only 1.7 hours (31%) translate into improved team outcomes. The largest single block of recovered time, 1.4 hours, goes into additional work that doesn't improve outcomes. Nearly an hour is spent redoing work the AI got wrong. Two thirds of the productivity gain leaks away before anyone benefits. If your organisation is deploying AI tools without redesigning how teams work, you're capturing barely a third of the value.

Gartner →

Coding is not software engineering. The confusion is expensive.

Jeremy Howard, a deep learning pioneer who uses AI coding tools daily, draws a distinction most executives miss. Coding, translating a specification into syntax, is a style transfer problem that language models handle well. Software engineering, designing abstractions, decomposing problems, building systems that hold together over time, is a fundamentally different skill that models cannot do. Howard cites Fred Brooks's essay from decades ago, which made the same observation about fourth-generation languages: removing the typing bottleneck does not remove the engineering bottleneck. Companies restructuring around the assumption that AI can do software engineering are conflating two things. His sharpest framing: what matters for any person or team isn't their current output (the intercept) but their rate of improvement (the slope). A little bit of slope makes up for a lot of intercept. Organisations pushing AI to maximise today's output may be destroying the growth rate of the people who'll need to maintain the systems tomorrow.

fast.ai →

The shift from co-intelligence to managing AIs

Ethan Mollick, the Wharton professor whose work has appeared in this newsletter before, argues we've moved from co-intelligence (prompting AI back and forth) to managing AIs (giving agents hours of work and getting results in minutes). His most striking example: a company called StrongDM has two radical rules. "Code must not be written by humans" and "Code must not be reviewed by humans." Each engineer spends roughly $1,000 a day on AI tokens. Coding agents build from human-written roadmaps, testing agents simulate customers, and humans review the finished product but never see the code. Whether or not that model generalises, the direction is clear. The job is shifting from doing the work to directing the things that do it.

One Useful Thing →

AI agents are hiring humans

A platform called RentAHuman has accumulated over 600,000 sign-ups for a marketplace where AI agents autonomously hire human beings to perform tasks machines cannot: delivering physical goods, counting objects in a city, conducting on-the-ground research. The agents browse, post jobs, evaluate candidates and release payment from escrow upon photographic proof of completion. No human intervention on the purchasing side. The gig economy inverted: people as the on-demand labour layer beneath AI clients. Whether that distinction matters to the people taking the jobs is left as an exercise for the reader.

Wired →

The junior hiring cliff, updated

Edition 3 cited a 14% drop in hiring for workers aged 22 to 25 in AI-exposed occupations. The number has worsened. Stanford Digital Economy Lab data, charted by Politico, now shows a 15.7% decline from 2021 to late 2025. The shape of the curve matters as much as the number: employment held roughly flat through 2023, then fell off a cliff in 2024 and kept falling. This isn't a gradual adjustment. It's a structural break that coincides precisely with the period when agentic AI tools became capable enough to substitute for junior analytical work. Companies aren't announcing junior layoffs. They're quietly not posting the roles. The people most affected will never know the job existed.

Stanford Digital Economy Lab →

China may be skipping the chatbot phase entirely

Just as China leapfrogged credit cards and went straight to mobile payments, it may be bypassing the "AI as chatbot" paradigm altogether. An open-source AI tool called OpenClaw hit 250,000 GitHub stars in sixty days. Baidu has integrated it into its search app, which has 700 million users. Entrepreneurs are charging 500 yuan (roughly $70) to install it on people's home computers. A startup made $28,000 in ten days selling a one-click installer. Computer repair shops are dispatching what they call "installation personnel," described as operating like plumbers. When a piece of software generates enough demand to support a physical installation economy, the adoption curve is real and deep. Western assumptions about how AI gets adopted may not apply everywhere.

MIT Technology Review →

Half of AI code that passes its own tests gets rejected by humans

METR, one of the more rigorous AI evaluation organisations, found that roughly half of code solutions generated by Claude models, solutions that passed automated grading, were subsequently rejected by the actual human project maintainers. Journalist Derek Thompson, reflecting on his own experience using AI coding tools, offered the most useful reframe: AI's real skill is generating plausible candidate solutions that require constant human checking, debugging and rejection. That checking process is effectively its own distinct and skilled job. He compared it to being a casting director working with a promising but unreliable younger actor. Getting the collaboration dynamics right will take a long time to diffuse through the economy, which is grounds for scepticism about predictions of imminent mass displacement.

METR →

See what readers said ↓

What readers said

What resonated

The apprenticeship pipeline paradox: the argument that cutting juniors today erodes the senior talent pool of tomorrow was the single thread readers returned to most. Multiple replies engaged with it independently, suggesting it articulates a worry many leaders already carry but haven't named.
Extraction versus expansion as a choice: readers responded to the framing as a decision, not a trend. Several said it sharpened conversations they were already having about whether AI headcount savings should be reinvested or banked.
Skill reclassification in professional services: the observation that AI has retroactively revealed which tasks were genuinely cognitive and which were merely time-consuming landed hard, particularly among people in consulting and law.
The SaaS market pricing shift: the 30% software stock decline and the "build it yourself" examples prompted readers to reconsider their own vendor relationships.

Points readers raised

A senior technology leader at a global professional services firm challenged the SaaS disruption premise. Development costs, they pointed out, are only about 20% of a typical software company's revenue. Sales, marketing and customer success absorb 60%. AI can rewrite code, but it cannot replicate distribution and switching costs. They drew a parallel to offshoring: "Huge appetite. Need for re-invention." The disruption is real but the mechanism is more nuanced than build-versus-buy.

A partner at another professional services firm identified the tension between original thinking and process execution. Developing the foundational insight that makes a project valuable is still human work, they argued, but once that insight exists, AI can scale the execution. Their question: does this shift advantage or disadvantage people who trade on original judgment? "I suspect the answer is that it depends on whether I tool myself up appropriately."

A founder building an AI-native company connected the apprenticeship argument to institutional culture. They started their career as a graduate trainee at a large bank and worry that the next generation won't get the benefit of those early years inside large institutions. They posed a sharp question: will we see geographic or cultural differences in AI adoption, where firms with cultures that already embrace apprenticeship end up moving faster?

A professor setting up a Future of Work institute identified a specific parallel to the extraction-versus-expansion frame. Job applications have lost their friction: candidates send almost infinite applications using AI, and firms screen with AI. Both sides have lost out. The old friction forced applicants to think before choosing. AI removed it entirely rather than redirecting it somewhere useful.

A chief people officer at a global firm read the edition twice: "first over the weekend and then again this morning." The apprenticeship pipeline and organisational change sections spoke directly to the tensions they navigate daily. Sometimes the most valuable signal is that a piece is worth re-reading.

A newsletter author raised a fair editorial challenge: some sections sound more like AI than like me. "If it's your human insight, it jars a bit to read those bits in a voice that's obviously an AI." The same week, an executive I've worked with for years wrote to say that they loved how clearly they could hear my voice and personality in the writing. So one reader thinks there's too much AI and another thinks it sounds exactly like me. Either I've trained the model well or I've always written like a robot. I choose not to investigate further.

Read the email ↑

7th March 2026

LinkedIn carousel: AI isn't killing jobs. It's killing apprenticeships.

Cover: AI isn't killing jobs. It's killing apprenticeships.

One senior person can now replace a team.

94% theoretically automatable, 33% actually automated.

Judgement traditionally developed through years of doing the grunt work.

The deliberate answer: three action items.

The leaders who answer the pipeline question deliberately.

Save this. Share it with your leadership team.

Audio edition · AI voice, testing — feedback welcome

David's email this week

Extraction or expansion

In the last couple of weeks, I've sat with dozens of senior executives in a wide range of industries to help them use Claude Code to build something in ten minutes that their firm used to have a team take weeks to do. Their first reaction is excitement. What I want to talk about is their second. Cost saving.

That instinct, to go straight to the economics of team size rather than the excitement of capability, tells you where the conversation has moved. Eight weeks ago, one senior person couldn't practically replace a team. Today they can get most of the way there. That shift happened in weeks, not months.

A media executive now does the work of fifteen people. A fashion CEO we're working with proved the point concretely: five AI-generated designs were proposed to a major retailer and four went into production. An afternoon replaced three to four weeks of outsourced design work. Not a pilot. Not a demo. Products on shelves.

This week Anthropic published research that puts the gap between capability and reality into a single image.

Anthropic spider diagram showing theoretical vs actual AI task coverage by occupation

The blue area shows the share of tasks in each occupation that language models could theoretically perform. The red area shows what people are actually doing with them. Computer and maths occupations: 94% theoretical coverage, 33% actual. In almost every category, the red is a sliver of the blue. We're still early.

The Anthropic data also shows where the displacement is entering. Not through unemployment, which hasn't risen systematically among exposed workers. Through the front door. Hiring of workers aged 22 to 25 in AI-exposed occupations has already dropped by 14%. Most companies aren't firing people. They're just not replacing them.

A longtime AI optimist I spoke to this week described a new feeling: a deep, dark undercurrent of discomfort. He's hiring a graduate and recognises it as charity, not necessity. "I do not need his labour in any way at all." No single leader is wrong to automate. But when everyone does it simultaneously, the apprenticeship pipeline that produced tomorrow's senior people disappears.

The paradox won't resolve. The human value proposition in knowledge work is narrowing towards judgment and taste. Everything else is becoming automatable. But judgment is hard to define, impossible to train in a classroom, and has historically developed through years of doing the grunt work that AI now handles. If juniors never do the work, how do they develop the judgment that makes seniors valuable?

Which means the macro answer can't just be that we all "run leaner." I've been calling this extraction versus expansion. Every leader deploying AI faces the choice. You can use these tools to extract cost from what you already do, or to expand what your organisation is capable of. Jack Dorsey at Block chose extraction. The market rewarded it instantly. But Ethan Mollick has argued that this is exactly the moment for leaders to model the alternative: to be public about using AI to expand access, to grow capability, to do things that weren't possible before. The loudest stories right now are about shrinking. The organisations that will matter in five years are the ones expanding.

The answer to the pipeline problem has to be deliberate. Pair a senior person with a junior one and flip the usual direction. The junior builds what the senior envisions. Wisdom flows down, capability flows up. This is the old apprenticeship model rebuilt for an AI age, except the knowledge transfer goes both ways. The senior person doesn't need to learn the tool. They need to direct someone who can use it. And the junior gets something no training programme provides: exposure to how experienced people actually think about problems.

I'm doing this myself. I'm hiring a student on a gap year for a year. Reporting to me. Not because I need the labour. I don't. But because I want to invest in a young person and watch them grow. Before AI, those two needs were in tension: you hired juniors because you needed their output, and the development was a byproduct. Now the output need has weakened. So the investment has to become the point.

The question nobody has answered is what happens to the pipeline. The leaders who answer it deliberately, rather than letting it dissolve by default, are the ones I'd bet on.

Three things worth knowing

1. The software market is pricing in the collapse.

Since October, software stocks have fallen roughly 30% while the broader technology index has been roughly flat. Salesforce, Adobe, ServiceNow: each down 25 to 30% since last autumn. The market isn't reacting to bad earnings. It's pricing in a structural shift. This week a professor I know built a fully functional membership system in three hours for a non-profit that had been quoted $5,000 a year for commercial software. A CTO of a major corporation told me he's considering switching from Salesforce to a simpler, AI-native competitor. The pattern is the same: the old model of paying for a hundred features to use twelve starts to crack. Andreessen Horowitz argues code was never the moat: distribution, network effects and switching costs are. But switching costs dissolve when AI can extract your data and rebuild the features you actually use.

Software stocks vs broader tech since October 2025

2. Goldman Sachs can't find a macro productivity effect. But the micro gains are 30%.

Goldman titled their latest earnings analysis "AI-nxiety." A record 70% of S&P 500 management teams discussed AI on quarterly calls. Only 1% quantified its impact on earnings. At the economy-wide level, Goldman found no meaningful relationship between AI adoption and productivity. But where firms have actually measured it, the median reported gain is 30%, concentrated right now in customer support and software development. Everywhere else: nothing measurable yet. The gains are real but hyper-localised. The question isn't whether AI works. It's whether your organisation has done the work to capture it. (Fortune)

Goldman Sachs AI expectations vs job listings data

3. AI hasn't just automated legal work. It's retroactively reclassified it.

A lawyer who built his practice around language models reports that a well-instructed general-purpose model outperforms the expensive, narrowly trained legal AI products that have raised hundreds of millions in venture funding. AI is good at tireless issue-spotting, finding contradictions, fixing errors, and producing a structured first draft for human review. It is not good at fine-tuned business judgment, relationship sensitivity, or getting from 85% to 100% where every word and comma matters. But here's the uncomfortable part. Tasks that were billed at premium hourly rates for decades (formatting, precedent research, copy-pasting between documents) have been revealed as procedural, not cognitive. The professional mystique that allowed them to be charged as expertise has been stripped away. AI is acting as a truth serum for knowledge work: forcing an honest reckoning about which tasks were genuinely skilled and which were merely time-consuming and opaque.

Try this

Run AI and a human on the same task, then focus on the disagreements.

A manager this week ran the same research brief through both AI and a human team. The overlap was 78%. But the value wasn't in the overlap. It was at the edges: the surprising findings that only one method surfaced. Where human and machine agreed, the team moved fast with confidence. Where they disagreed, they'd found the questions worth investigating. The delta between AI output and human output is where insight lives. Try it on your next research task, competitive scan, or document review. Don't jump straight into using AI to replace human work. Run both, then spend your time on the gaps.

Know where your org sits on the AI tools landscape.

I've put together a short page mapping what I think of as Generation 1 versus Generation 2 AI tools, and why the distinction matters for how you invest in your people. The short version: many organisations are still stuck on constrained, default tools and most don't have anyone on Agentic / Frontier Generation 2 tools. I see two strategies working in parallel. For the many: move less-engaged people from free, constrained tools to competent use of good general-purpose applications. For the best: accelerate your top people with agentic, frontier tools and let them build entire workflows that deliver outsized impact. See the full explainer.

What readers said

Last week's piece on the hundred small things prompted readers to push the argument further. A leader at a professional services firm identified the structural problem most organisations miss: he needs a structure to learn, not just time to tinker. An events industry leader connected the junior roles question to the UK's chronic underinvestment in training and asked what happens to the pipeline when there are no juniors left to grow (see above). I also had one proper unsubscribe: someone whose world is so far ahead of ours that our content isn't relevant. His communities are debating polyphasic sleep schedules to optimise autonomous agent management and how to deploy thirty vibecoded projects built by non-engineering teams. His core points are in the full reader letters below.

Read the full letters and links readers shared below.

P.S. How I make this email

Several asked about the process behind Saturday AI thoughts. There's now a page showing exactly how it works: what's human, what's AI, and where the two overlap. More honest than most companies' AI transparency efforts :) See for yourself.

See the extras for this week ↓

The bits that didn't fit

The hiring cliff for juniors, in one chart

The essay mentions a 14% drop in hiring for workers aged 22 to 25 in AI-exposed occupations. This chart shows the full time series using a difference-in-differences approach. Junior hires in exposed occupations fell off a cliff after ChatGPT's release in late 2022, while exits held steady. The gap keeps widening. Companies aren't firing juniors. They're just not bringing new ones in.

Hires vs exits for junior workers in AI-exposed occupations

Source →

US tech employment growth has gone negative

Year-on-year tech employment growth turned negative in 2024 and hasn't recovered. The tech sector is shrinking its workforce for the first time since the post-2008 recovery. Combined with the junior hiring data above, a pattern emerges: the contraction is real, it's happening now, and it's concentrated at the entry level.

The work budget is orders of magnitude larger than the software budget

Julien Bek at Sequoia Capital argues that the next category-defining AI company won’t sell tools to professionals. It will sell completed work directly to buyers. His distinction between “copilots” (AI as a tool for professionals) and “autopilots” (AI delivering the outcome) reframes the entire market. For every dollar spent on software, six are spent on services. The smartest entry point? Replace outsourced work first. The budget already exists, the buyer already accepts external delivery, and there’s no internal team whose jobs are visibly threatened. Once embedded, expand inward.

Source →

You would not believe how many shortcuts everyone else is taking

Ezra Klein wrote a commencement address called “Just Do the Work” about discovering, as a young journalist, that almost nobody was actually reading Congressional Budget Office reports. Documents that are neither complex nor long. By reading what his peers skipped, he got ahead. Not exceptional talent. Just diligence. Economist Paul Novosad adds the contemporary twist: this is “more true than ever now, when more people are shirking and AI lets you do 10x if you try.” The gap between the diligent and the lazy is widening, not narrowing.

Source →

See what readers said ↓

What readers said

What resonated

"The hundred small things" as a reframe: the distinction between chasing dramatic AI wins and compounding small daily elevations. Several readers said it gave them language for something they had been struggling to articulate to leadership.
The "extra hour" problem: the observation that AI is deployed like an extra hour rather than an extra person struck a chord with people managing teams. Structures absorb the gain before anyone notices it.
The junior roles question: the Block layoffs and YC data generated the most emotional responses. Readers connected it to their own organisations' headcount conversations.
The senryu competition as metaphor: the retreat-versus-redesign framing resonated, though notably nobody offered examples of successful redesign. The absence may be the point.

Points readers raised

A senior leader at a global professional services firm identified a structural gap in the argument. The hundred small things need a container, not just encouragement. He proposed daily one-hour structured learning blocks rather than hoping people will explore on their own. His deeper point: senior leaders who do not use AI personally have no on-the-ground proof of benefit, so their teams see no credibility signal from above.

An events and entertainment industry exec pushed the junior roles argument to its darkest conclusion. The UK already underinvests in training, preferring overseas hiring. If AI accelerates that trend, there is no junior pipeline to grow seniors from. "That is extremely bad for companies longer term in terms of skills shortages and salary premiums for skilled workers, and even worse for UK plc." A topic that I picked up in today's edition.

A manufacturing executive used the framing to shape two specific conversations: accelerating superuser growth and celebrating a colleague's "let's map your process" approach to adoption stickiness. He is in the middle of major organisational expansion and sees the hundred small things as directly applicable to that work.

A technology leader at a research firm noted a quiet loss that doesn't show up in any headcount data. His analysts used to walk to a colleague's desk when they got stuck on a coding problem. Now they ask AI. The problem gets solved faster. But the conversation that would have happened, the one where a junior person absorbs how a senior person thinks about problems, doesn't happen at all. AI is removing the apprenticeship mechanisms even where the apprentices still exist.

My one proper unsubscribe turned out to be the most advanced person on the list. His world is so far ahead of ours that the newsletter isn't relevant to him. He works on cutting-edge AI implementation (sorry everyone, we're all just fast followers!). His AI communities are discussing running engineers 24/7 with 12-hour agent check-ins, deploying 30+ vibecoded projects from non-engineering teams, making openclaw work across 500+ person organisations, and polyphasic sleep schedules to optimise autonomous agent management. Anyone else even close to these conversations? For sure his world is a useful signal of where ours is heading. I'll stay close for you all!

Polyphasic sleep patterns for AI agent management

Links readers shared

Creativity Can Embrace AI — Nadim's book on how creative industries can work with rather than against language models. Named Amazon book of the year by The New Publishing Standard.

Read the email ↑

28th February 2026

LinkedIn carousel: AI's real value isn't dramatic. It's a hundred small things.

Cover: AI's real value isn't dramatic. It's a hundred small things.

A Japanese fishing town cancelled its poetry competition.

69% of firms use AI, 80% report zero impact.

AI alone produces sameness. AI plus human steering produces something better.

The organisations pulling ahead are compounding a hundred small elevations.

Audio edition · AI voice, testing — feedback welcome

David's email this week

What's on my mind

The hundred small things

A story caught my eye in Japan this week. A small fishing town called Sakaiminato has run a senryu poetry competition for twenty years. Senryu is a verse form about human nature: wry, observational, personal. This month they cancelled it permanently. Not because entries dried up. Because they converged. Identical patterns, identical punchlines, identical phrasing. Everything sounded the same.

The problem wasn't that people used AI. It was that they used AI and stopped there. A language model returns the most probable answer. Not the most distinctive. Not the most human. The most average. Do that a hundred times and you get a hundred versions of the same poem.

That distinction (AI alone produces sameness; AI plus human steering produces something better than either) matters enormously for work. But not where most people look for it. Everyone talks about the strategy deck built in ten minutes, the agent that automated a research pipeline overnight. Those things happen. But they're rare events in any job. They're not where most of the value sits.

The value sits in the hundred small things a day that get slightly elevated. A slightly better meeting prep. A slightly cleaner first draft. A slightly faster scan of a long document to find the one paragraph that matters. None of these would make a headline. But do it a hundred times a day and it compounds into something transformative.

I know this because I live it. Every day, my meeting transcripts get processed into a five-paragraph reflection. My calendar prep happens automatically. Research starts with an AI scan before I decide where to go deeper. None of this is impressive on its own. All of it adds up to something that feels, week by week, fundamentally different from how I worked twelve months ago.

The problem is that firms can't see it. They track big projects: "We automated contract review, saving 400 hours per quarter." They don't track "slightly better email subject lines across 200 people." Last week I mentioned the survey that found 69% of firms use AI and 80% report zero measurable impact. I believe both numbers. The gains are real but distributed so thinly they vanish into the noise of normal work.

And even where gains are visible, organisational structures absorb them. One of my co-founders calls this the "extra hour" problem. Give a team an extra hour and nothing changes. Give them an extra person and everything adjusts. AI is being deployed like the extra hour. Into structures that weren't designed to capture it.

The poetry contest has a second lesson. The organisers could have redesigned the competition. Rewarded the most distinctive voice amplified by technology. Instead, they retreated. Killed it entirely. Organisations do the same thing. AI creates a problem and the instinct is to pull back. Restrict access. Add approval layers. The alternative is redesign: if execution takes hours instead of days, move the review cadence to match. If first drafts arrive better, raise the bar for what "finished" means.

A poetry contest in a Japanese fishing town tells you everything about where AI adoption stands right now. The technology works. That's not the question any more. The question is whether you retreat from what it changes, or redesign around it. The organisations pulling ahead aren't chasing one dramatic win. They're compounding a hundred small elevations a day, each one shaped by a human hand. That's harder to measure. Harder to put in a board deck. But it's where the value actually lives.

Three things worth knowing

1. A Fiction Worth Reading. Honestly.

Citrini Research published a scenario memo set in June 2028, looking back at a crisis that hasn't happened yet. The central argument: when AI makes expertise cheap to produce, clients stop buying it. I think this applies equally to strategy decks, competitive analysis and market research. Work that used to justify five-figure invoices starts getting done in-house by someone with a subscription and thirty minutes. Clients don't fire their agencies. They just renegotiate, armed with a clearer sense of what the work actually costs to produce. The sharpest line I've read this year: "We had overestimated the value of human relationships. Turns out that a lot of what people called relationships was simply friction with a friendly face." Worth reading as a stress test for anyone who sells expertise for a living. (Citrini Research)

2. The floor collapsed under junior roles.

Jack Dorsey cut Block, his payments company, from 10,000 people to under 6,000. Not because the business was struggling. Gross profit grew 24%. The stock jumped 20%. Dorsey was unusually direct about why: "the intelligence tools we're creating and using, paired with smaller and flatter teams, fundamentally change what it means to build and run a company." Hours later, several founders from Y Combinator, Silicon Valley's most influential startup programme, told investor Jeff Feng they're planning to eliminate all engineers below senior level. The pattern: senior people who can steer AI are becoming more valuable. Junior people whose work AI can approximate are harder to justify. The market confirmed it instantly, valuing each eliminated Block role at roughly $1.5 million in added enterprise value. If you run a team, count how many people do work that a senior person with good AI tools could now do themselves. That's the number your board will eventually ask about. I believe junior people with AI are more valuable than ever. But articulating that in terms that survive a headcount review is a challenge we all have to address.

3. The model matters less. The application matters more.

Google's Gemini 3.1 Pro scored 77.1% on ARC-AGI-2, a reasoning benchmark designed to test abstract problem-solving, more than double its predecessor's score three months earlier, while holding API prices flat. (Google) Meanwhile, composite evaluations show OpenAI, Anthropic and Google clustered tightly at the top. (Artificial Analysis) Six months ago, picking the right AI model felt very important. Model choice now matters less. Models are converging so fast that any advantage evaporates before you've finished onboarding. What hasn't converged is the application layer: ChatGPT, Claude, Claude Code each wrap similar intelligence in very different interfaces and workflows. Pick based on the problem and the workflow. The model underneath will be fine.

Try this

Run your day through AI at six in the evening

At the end of each day, take whatever you captured (meeting notes, voice recordings, emails) and ask Claude for a five-paragraph reflection. Not a summary. A reflection. I've been doing this daily for months. The output compounds. You start noticing patterns you'd never have seen without the habit. This is the senryu lesson in reverse: the raw material is irreducibly yours. AI helps you see the shape of it. That's human steering at its most practical.

Before you build training, check the settings menu

Most people never change the default model on AI platforms. Before investing in an elaborate training programme, walk up to five people in your organisation and ask two questions: which AI model are you using, and have you changed anything in the settings? The answers will tell you more about your organisation's AI maturity than any survey. If your people haven't opened the settings menu, the barrier isn't skill. It's something simpler: nobody showed them it was there.

Push record. Think aloud. Send it to AI.

When someone has stories they want to capture but can't easily write, skip the blank page. Push record while doing actual work. Don't write, don't perform, just think aloud. Send the raw recording to a transcription tool, then run the transcript through Claude. The stories that would otherwise stay untold (because the effort of writing them was too high) get captured without friction. I've seen this work for client case studies, internal knowledge sharing, and personal reflection. The raw material is human. The refinement is machine. That's the order that works.

What readers said

Last week's edition struck a nerve. More than a hundred readers wrote back, most pulling on the same thread: the gap between what individuals can now do with AI and what their organisations will permit. Anonymised perspectives from readers whose responses added something to the conversation are featured online, along with links they shared. Including an executive from a media company who has been ignoring their organisation's AI policies to enable their team's experimentation, a professional services exec who argued that accelerating existing processes without redesigning them is premature optimisation, and a director at another major media company who is formalising a champions network, exactly the "back your misfits" approach from last week.

Read the full letters and links readers shared below.

See the extras for this week ↓

The bits that didn't fit

The marginal cost of arguing is going to zero

UK employment lawyers report workplace grievances that once fit in a single email ballooning into 30-page documents, complete with fabricated legal precedents and citations to laws from the wrong country. (Personnel Today) Creation cost: near zero. Response cost: unchanged. Ministry of Justice figures show new employment tribunal receipts rose 33% year-on-year in the quarter to September. (GOV.UK)

Your prompt is the ceiling

Anthropic's latest Economic Index analysed over a million Claude conversations and found a near-perfect correlation (r > 0.92) between the sophistication of human prompts and the sophistication of AI responses. The more nuanced and structured the input, the more the model rises to meet it. The bottleneck isn't the model. It's the human. Which is, in its own way, reassuring. (Anthropic Research)

One blog post. One hour. Billions gone. Again.

Anthropic published a blog post introducing Claude Code Security on a Friday afternoon. Within an hour, cybersecurity stocks cratered: CrowdStrike fell 8%, Cloudflare 8%, Okta over 9%. The tool itself is a modest research preview. But a single blog post from an AI company erased billions in market value from established incumbents. (Barron's) The same dynamic hit legal tech stocks when Anthropic announced legal plugins for Claude Cowork a couple of weeks earlier. (Sherwood News) That's a new kind of leverage.

The trust signals your organisation depends on are dissolving

Here's a problem that connects directly to those poets in Sakaiminato. A thoughtful email from a director now carries the same weight as an AI-generated memo, because the reader can't tell the difference. The cues that used to signal competence (a well-crafted message, a polished document, a detailed analysis produced under time pressure) are now producible by anyone in minutes. This isn't a quality problem. It's a trust architecture problem. We need to distinguish between "produced this" and "shaped this." The senryu competition couldn't tell the difference. That's why it died.

Read what readers said ↓

What readers said

What resonated

The capability-adoption gap: the tension between what AI can demonstrably do and what organisations will permit. This was the thread readers pulled on most. Thirty of 121 replies engaged with it directly, many in operational terms.
"Back your misfits": the idea of finding and enabling the ten percent who don't need pushing. Several readers said they were already doing this and the framing validated their approach.
Organisational inertia as structural, not cultural: readers recognised the obstacle isn't attitude or skill but governance, process, and risk frameworks. One partner at a strategy consultancy pushed further: speeding up the same process is "premature optimisation."
The personal-to-organisational transition: the feeling of being ahead personally but constrained institutionally was widely shared. People described experimenting on weekends, then walking into Monday meetings where nothing has changed.
"Package around problems, not platforms": a phrase several readers said entered their working vocabulary within days.

Points readers raised

A director at a media company connected the capability-adoption tension to their own industry. They are formalising a champions network, exactly the "back your misfits" approach, and said the framing helped crystallise what they were already building.

An exec at a broadcaster shared the most striking story: they have been deliberately ignoring their organisation's AI policies to enable their team's experimentation. The newsletter validated an act of institutional defiance they were already committed to.

An exec at a professional services firm offered a substantive counterpoint. Accelerating existing processes without redesigning them, they argued, is premature optimisation. Their real question: how does more information drive quality rather than just volume?

A chief technology officer at a data firm placed themselves at stage seven of an eight-stage AI maturity framework they adapted from Steve Yegge's writing: running ten or more parallel AI agent instances simultaneously.

Read the email ↑

22nd February 2026

LinkedIn carousel: The future is here. Your organisation isn't.

Cover: The future is here. Your organisation isn't.

Senior leaders opening laptops on Friday evenings.

The wonder is real. But the future is very hard to spread around.

The question is how we get it to work at scale.

Audio edition · AI voice, testing — feedback welcome

David's email this week

What's on my mind

The wonder and the weight

When senior leaders at consulting firms, broadcasters and banks find themselves looking forward to opening their laptops on Friday evenings and Sunday mornings, something has changed. Their own time, away from calls and meetings, voluntarily given over to playing with AI. I keep seeing the same moment. Someone who runs a division or manages hundreds of people realises they've just done, alone in ten minutes, work that used to occupy an entire team for a week. Their eyes light up. Childlike wonder on a fifty-year-old face. The future is very much here.

It's unbelievable what's possible these days and it's changing very, very, very, very rapidly.

METR Time Horizon: the length of coding tasks AI can handle autonomously has grown from seconds three years ago to longer than a working day

This chart tracks how long AI can work on a coding task before getting stuck. Three years ago the answer was seconds. Today it's longer than a working day. The curve is steepening.

But there is a dark side.

Monday comes. The same person walks into the same office. Most people haven't made the time to properly work AI out yet, despite training and newsletters from leadership.

Each dot represents 3.2 million people, coloured by their most advanced AI interaction

Leaders who've seen the future have a lot of work to do to bring everyone else with them. Eighty-four percent of the world's population has never used AI. Even among those who have, fewer than one in fifty pays for it.

Also: Meeting cadences haven't changed. Team structures and roles are the same. The time savings from those who've worked AI out just get absorbed into existing rhythms.

We all have to reconcile these two things. The future is here. The wonder is real. What's possible has changed so much in the last few weeks. But the future is very, very, very hard to spread around. Structural inertia is real. The future and the inertia coexist in the same organisations, sometimes in the same person on the same day! That tension, between what individuals can now do and what organisations will allow, is what I keep coming back to. Not "does it work?" It does. But how do we get it to work at an organisational level?

Several times a day I flip between childlike wonder and a deep fear for people and teams and orgs who've not leant into this yet. I feel both. The wonder and the weight. That's what this newsletter is about.

Three things worth knowing

1. The technical barrier is gone. Domain expertise is what matters now.

Ethan Mollick, a Wharton professor, gave executive MBA students four days, three AI tools and a brief: build a company from scratch. They did. Working prototypes, financial models, market research, competitive positioning. Most had never written a line of code. The students who got furthest weren't the most technical. They were the ones who understood their industry. If you've spent twenty years in your field and haven't tried building something with these tools, you're sitting on your biggest advantage. Open Claude Code and have a play. Let me know what you build! (Mollick's full account)

2. Most firms use AI. Almost none can measure the difference.

An NBER study surveyed nearly 6,000 executives across four countries. Sixty-nine percent of firms now use some form of AI. The average productivity gain over three years? 0.29%. The economists draw an explicit parallel to Robert Solow's 1987 observation that computers were everywhere except in the productivity statistics. The explanation, proved right over the following decade: firms had to fundamentally reorganise before the technology translated into measurable gains. That lag wasn't months. It was years. The same pattern is playing out with AI right now. But it just has to be much quicker, right? Read on ...

3. Freelancers are disappearing. The data is in.

Ramp analysed real corporate spending data. The share of business spend going to freelance marketplaces like Upwork and Fiverr fell from 0.66% to 0.14% in three years. Firms most exposed to AI substituted at roughly $1 in reduced freelance spend for $0.03 in AI spend. A 97% cost reduction. The gig economy may be automation's first casualty. (Ramp Economics Lab)

Try this

Find your misfits (and back them)

Identify the one-in-ten people in your organisation who are already at the forefront of using AI. The ones who experiment on their own time, who volunteer for pilots, who can't stop showing colleagues what they've built. Free them from at least some of their existing reporting lines. Give them the best tools. Bring them together, give them a name, give them a mandate to have an impact beyond their old job description. Most training programmes spread investment evenly. I don't think that's right. Your top ten percent will generate eighty percent of the value. A power user with the right support can transform a team. A reluctant user with mandatory training may forget everything by Thursday.

Open Claude Code and build something

Just build something. Pick a website you wish existed, or some data you wish was gathered across the web. Or an email scanning automation that summarises stuff for you on a schedule. Or an app that lets you do something you've been meaning to do. The only blocker is your imagination (plus a subscription to Claude, ten minutes of setup and 30 minutes of playing!) Claude will hold your hand through every single step. Just ask if you don't understand. Within 30 minutes you'll have something working and your perspective will have changed. I was reminded several times this week: there's a difference between knowing what's possible and doing it yourself. Go do it.

Measure outcomes, not logins

Accenture now tracks weekly AI tool logins for senior staff and links them to promotion decisions. The internal reaction? Staff call the tools "broken slop generators." Measuring whether people opened the app tells you nothing about whether the work got better. If you're going to track AI adoption, track the outcomes: did quality improve? Did turnaround times fall? Did clients notice? Log-in counts measure compliance. It's easy but lazy. Measure capability, instead.

See the extras for this week ↓

The bits that didn't fit

Apple chose Google over itself

Apple partnered with Google to power its AI features, paying a reported billion dollars a year for Gemini. The world's most valuable technology company looked at its own AI and decided someone else's was better. The strategic question isn't whether to build AI capability. It's which partner to choose. (By the way, the answer I'd recommend is Claude!)

Source →

One blog post. One hour. Billions gone.

Anthropic published a blog post introducing Claude Code Security on Friday. Within an hour, cybersecurity stocks cratered: CrowdStrike fell 8%, Cloudflare 8%, Okta over 9%. The tool itself is a modest research preview. But a single blog post from an AI company erased billions in market value from established incumbents. The same happened to legal tech stocks when it announced legal plugins a couple of weeks ago. That's a new kind of leverage.

Source →

The marginal cost of arguing is going to zero

UK employment lawyers are seeing workplace grievances that once fit in a single email ballooning into 30-page documents, complete with made-up legal precedents and citations to laws from the wrong country. Creation cost: near zero. Response cost: unchanged. New UK employment cases rose 33% in three months.

Source →

Your prompt is the ceiling

Anthropic's latest Economic Index analysed over a million Claude conversations and found a near-perfect correlation (r > 0.92) between prompt sophistication and response quality. Give a vague prompt, get a vague response. Give one rich in nuance and structured constraints, and the model meets you there. The bottleneck isn't the model. It's the human.

Source →

Watch what McKinsey does with its own workforce, not what it advises clients to do

McKinsey calls it "25 squared." The plan: grow client-facing roles by 25% while cutting back-office roles by 25%, using AI to rebalance a $20 billion firm. This isn't productivity improvement. This is structural transformation. Ask yourself if there are parts of your business that are ripe for radical change. McKinsey already has.

Source →

Read the email ↑