AI Reading List: April 24, 2026
Time for a link roundup!
New Models
First, some articles on newly released (or not) models:
-
Sign of the future: GPT-5.5 - Looks like GPT-5.5 caught up with (or maybe passed) the Opus models for coding. The examples gallery is fun and shows how different models handle the following prompt: “build me a procedurally generated 3D simulation showing the evolution of a harbor town from 3000 BCE to 3000 AD, it should look beautiful and allow me to have some control over it”
-
Anthropic’s Project Glasswing — restricting Claude Mythos to security researchers - Lots of news about Mythos, Project Glasswing and just AI-related security stuff in general. This video from Hank Green is a nice semi-technical overview: You Actually Do Need to Understand Mythos
-
You’re Paying $200/Month for AI. The Same Models Are Now Free. - Semi-related to the new models, I read a few interesting articles comparing frontier models to open source alternatives. Title here is a little bit click-baity, but article is interesting. This note from Dwarkesh discusses similar topics. His interview of NVIDIA CEO Jensen Huang is also worth a listen.
Best Practices
-
Your CEO is suffering from AI psychosis - Probably my favorite article of the week. I think the “AI Psychosis” label is clickbait, and this applies to everyone, not just CEOs. Headline aside, the article is great, especially “Your sprints still matter more than your agents”:
I spend a lot of time thinking about how to make agents productive. Maybe it’s the product manager in me, but the thing I keep coming back to is boring and unsexy: requirements documents, sprint planning, acceptance criteria, and measurement.
If I’m going to use Claude Code to build a feature, I won’t fire off a vague prompt and see what comes back. I’ll write a specification. I’ll define the acceptance criteria. I’ll set up the test cases. Only then will I let the agent execute against those constraints. When it’s done, I review the output against the spec, not my token counts.
My takeaway is that AI is an amplifier for both good and bad habits, so focus on reinforcing the good ones!
More quick hits around best practices:
- Agent Skills - Skill Library #1
- Agent Skills Specification - Skills framework/best practices
- Headless everything for personal AI - Importance of strong APIs for agents
- Agent Interview Best Practice - This whole Hacker News thread (and the original post from Willison) was all interesting, but the linked comment around having your agent interview you stood out for me.
One last thing …
- Sam Altman May Control Our Future - Can He Be Trusted? - A long New Yorker profile on Altman. Not technical, but an interesting read.
Comments
Sign in with a GitHub account to join the discussion. Threads are powered by giscus and live as GitHub Discussions.