Is there a Half-Life for the Success Rates of AI Agents?

Toby Ord’s analysis suggests that an AI agent’s chance of success drops off exponentially the longer a task takes. Some agents perform better than others, but the overall pattern holds—and may be predictable for any individual agent:

This empirical regularity allows us to estimate the success rate for an agent at different task lengths. And the fact that this model is a good fit for the data is suggestive of the underlying causes of failure on longer tasks — that they involve increasingly large sets of subtasks where failing any one fails the task.

Is there a Half-Life for the Success Rates of AI Agents? | Toby Ord

Sentient Design Is Here

Sentient Design is the practice of creating intelligent interfaces. The new book answers what’s next for design (and designers).

Look up from your tools: promo illustration of a person holding a product aloft, standing on a pile of discarded tools

strategy

Look Up from Your Tools: AI for Product over Production

Today’s focus on AI production efficiencies is only the first stage of a far more exciting arc.

Detail of an image of a community of robots interacting and performing various jobs

agents

What Happens When Agents Meet?

Designing agent experiences means thinking beyond individual users—to swarms of agents colliding in shared systems. What could go wrong?

Promo illustration of a self-drawing interface

sentient design

When Interfaces Design Themselves

Radically adaptive experiences change content, structure, style, or behavior—sometimes all at once—to provide the right experience for the moment.

A promo illustration of a brain with an electrical cord, symbolizing the wiring of intent

sentient design

Wiring Interface to Intent

Create radically adaptive interfaces by telling large language models how to choose the right design pattern for the moment. A primer.

Photo of smiling Josh Clark in a workshop group

strategy

We Can Help

“What should we do?” Our new product strategy intensives help you find clarity and action amid AI and other seismic shifts.

What We’re Reading

The Giant Test Kitchen Where Cooks Battle A.I. Slop

The New York Times explores the test kitchens of People Inc. to explain the how and why of creating tons of original recipes. (What AI is and isn’t good for.)

Sinceerly: AI to Mess Up Your Writing

Sinceerly is a (tongue-in-cheek?) Gmail plugin to undo AI writing and make emails messy and human.

Jakob Nielsen: ‘Buy the Book Already’

Usability legend Jakob Nielsen offers a thorough review of Sentient Design (and a detailed comic-strip summary, too!)

A Great Design Gathering

Leonardo De La Rocha describes his takeaways from a gathering of the tech industry’s top design leaders—and the special impact of Sentient Design on the day.

Kirkus on Sentient Design: “Get It”

Kirkus Reviews calls Sentient Design “an essential guide for building responsible and revolutionary AI-mediated user experiences.”

Reimagining the Mouse Pointer for the AI Era

Google DeepMind’s new magic cursor feature provides the awareness and agency to respond to users in context—a tidy distillation of Sentient Design’s continuous copilot pattern.