Before we start, here’s a little puzzle to mark the UK release of the paperback edition of Proof this Thursday:
I generated the following two numbers with AI-written code. Can you work out what number comes next in the sequence?
770487, 216739
I’ll send a signed copy of Proof to whoever comments with the correct answer first*.
*Europe/North America shipping only, I’m afraid!
“Why Do You Still Have a Job?” by
I asked John Mongan, a UCSF radiologist and AI expert, why he still had a job a decade after Hinton’s famous prediction. “The people who were making those predictions understood computer vision but didn’t really understand radiology,” he said. “They were writing algorithms that could tell you that an image was a dog or a sailboat. And they thought that radiology was just doing that for medical stuff. But radiology is a lot more than that.”
The end of the noisy London pedicab by &
A “stage carriage” is better known today as a “stagecoach”. As a result, it’s a law that has more in common with the era of Dick Turpin than modern London that has left us in this mess.
The AI Death Zone: a cautionary tale by
Programmers are not starved of oxygen, they are starved of understanding. They are looking at code they haven’t written, assumptions they never would have made. We only seem to hear these days from those who have summited, but I suspect the AI death zone is piled high with bodies, all with the words “just one more prompt” frozen on their lips.
Don’t Shoot the Messenger RNA by
Flu vaccines have traditionally relied on egg-based production that takes about six months from strain selection to administering a vaccine (assuming we have enough chicken eggs). This time could mean everything in the event of a new pandemic threat. While the threat of pandemic flu is always with us, the recent jump of highly pathogenic avian flu (H5N1) from birds to mammals like cattle and ongoing widespread transmission makes this at least slightly more likely at the moment.
Towards a science of AI agent reliability by &
Safety-critical engineering fields (aviation, nuclear, automotive) figured out decades ago that reliability is not the same as average performance. These fields independently converged on the above four dimensions: consistency, robustness, predictability, and safety (the frequency and severity of failures).
For example, nuclear reactor protection systems must respond identically every time conditions warrant shutdown. Automotive safety testing evaluates responses to sensor failures and adverse weather.
My go-to career advice for PhD students and postdocs is to never ever let fear discourage you, and to never set any limits to what you can accomplish. It’s not that you won’t encounter practical limits in the future—you will—but you currently have no idea when and where and no way of knowing except for trying.
my week with the AI populists by
The data centers are just so close to where people live, looming over suburban backyards and sports fields and sidewalks and schools. Two years of loud construction, two decades of noise. You stand there and hear them hissing, whirring, rattling, beeping. Some have cheap American flags draped over the side, and others are painted a bland ecru, a flimsy attempt at fading into the background.
What seemed less obvious was that we would, within a few years, be confronted with a closure of both of these terminals. Yet that is the extraordinary set of events that has occurred in the past week. Ras Tanura and Ras Laffan are both closed. I simply couldn’t imagine writing a sentence like that up until this week.
The Real Cost of Running AI by
The raw compute floor for a well-optimized 14B-class deployment is ~$0.004/M tokens at full utilization. APIs charge $0.30–$1.25/M. That gap isn’t margin — it’s the cost of actually running a production service.





