weekend ai reads for 2025-09-19

šŸ“° ABOVE THE FOLD: ABDICATION TO A.I.

governance: World’s first AI minister will eliminate corruption, says Albania’s PM / British Broadcasting Corporation (5 minute read)

analysis: AI can’t write good analyst research yet, says analyst — Finbots make too many mistakes, lack predictive power and tend to miss the big picture, according to Bernstein Research / The Financial Times (7 minute read)

dispute resolution: I Vibecoded a Dispute Resolution App / Rough Diamonds, Substack, archive (15 minute read)

As with all technology, there are pros and cons to AI for kids, but parental involvement in navigating it is key. Kucirkova notes: ā€œAI introduces what has been called the ā€˜third digital divide’: families with resources can guide their children’s use of technology, while others cannot. Parents who come home exhausted from long hours or multiple jobs may see AI-powered chatbots as a way for their child to have someone responsive to talk to.ā€

 

šŸ“» QUOTES OF THE WEEK

If you’ve never had to shovel [manure], not only do you lack empathy for those who do, but you also tend to create more [manure] for others to shovel.

Hunter Walk (source)

 

now that coding’s been solved i spend most of my time thinking and thinking is honestly so much harder than writing code

my brain hurts

yacineMTB (source)

 

šŸ‘„ FOR EVERYONE

Review: If Anyone Builds It, Everyone Dies / Steven Adler (Clear-eyed AI), Substack, archive (25 minute read)

  • this is a reasonable review of ā€˜If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All’ by Eliezer Yudkowsky and Nate Soares (disclosure: affiliate link), where ā€œreasonableā€ means it aligns with our perspective

  • we also liked most of it but aren’t entirely convinced the only way everyone doesn’t die is a global treaty banning the development of frontier AI; or we hope not, anyway

Just a week ago Anthropic raised $13 billion in a deal that valued the company at $183 billion; the settlement sum represents less than one percent of that valuation. In order to achieve that valuation, Anthropic stole the work of many thousands of people, and then paid less than 1/100th of their valuation to simply make the whole thing go away—the settlement does not even require Anthropic to admit any wrongdoing. As a country, we should not allow multibillion-dollar companies to commit theft at vast scale and then settle out of court for a negligible fraction of their ill-gotten gains.

ChatGPT as the Original AI Error / Paul Kedrosky (6 minute read)

No, but it does mean that conversation-fixated humans have latched onto the most conventional and conversational aspect of language models, and thrown that at every application in sight. While it sometimes works, it often does not. To this way of thinking, ChatGPT was the original error: a seductive service that appealed to our biases, to a fault.

How people actually use ChatGPT vs Claude - and what the differences tell us — Most ChatGPT chats are for asking about non-work stuff Most Claude chats are automation directives (for coding) / Zdnet (8 minute read)

Education is a major use case for ChatGPT. 10.2% of all user messages and 36% of Practical Guidance messages are requests for Tutoring or Teaching. Another large share - 8.5% in total and 30% of Practical Guidance - is general how-to advice on a variety of topics. Technical Help includes Computer Programming (4.2% of messages), Mathematical Calculations (3%), and Data Analysis (0.4%). Looking at the topic of Self-Expression, only 2.4% of all ChatGPT messages are about Relationships and Personal Reflection (1.9%) or Games and Role Play (0.4%).

Screen readers do not need to be saved by AI / Craig Abbott (8 minute read)

However, once you introduce AI, the machine has to parse an input, break it into tokens, reason about meaning, generate an output, and then render it. Unless you have enough processing power, you can’t do that in real-time at 800 words per minute.

 

šŸ“š FOUNDATIONS

You should be rewriting your prompts — We talk about overfitting models but never overfitting prompts to models / Max Leiter (5 minute read)

The Secret Power of JSON Prompts — A custom GPT to turn messy prompts into JSON prompts (included) / The Digital Creator, Substack, archive (9 minute read)

Post-training 101 / Tokens for Thoughts, Notion (46 minute read)

 

šŸš€ FOR LEADERS

OpenAI’s Product Lead on Winning AI Through Distribution — From bundling and embedding to viral artifacts and trust loops — here’s how to survive GPT-5 and scale profitably. / The VC Corner, Substack, archive (39 minute read)

  • Miqdad Jaffer, Product lead @ OpenAI and instructor at AI Product Strategy Cohort

  • whether you’re a buyer or a seller, this (long) read feels necessary to understand

How tech companies measure the impact of AI on software development — How do GitHub, Google, Dropbox, Monzo, Atlassian, and 13 other companies know how well AI tools work for devs? A deepdive sharing exclusive details, with CTO Laura Tacho / Pragmatic Engineer, Substack, archive (36 minute read)

  • this image is a good overview of the companies’ approaches, with more context in the article

The price point reflects the ā€œintuitionā€ and technical skills needed to keep pace with a rapidly-changing technology, Tanmai Gopal, PromptQL’s cofounder and CEO, told Fortune.

Gopal said the company hourly wage for AI engineers as consultants is ā€œaligned with the going rate that you would see for AI engineers,ā€ but that ā€œit feels like we should be increasing that price even more,ā€ as customers aren’t pushing back on the price PromptQL sets.

 

šŸŽ“ FOR EDUCATORS

Learn Your Way is grounded in learning science and powered by LearnLM, our best-in-class pedagogy-infused family of models, now integrated directly into Gemini 2.5 Pro. It adapts content to a learner's selected grade level and personal interests, and generates multiple representations based on the source material, from mind maps and audio lessons to interactive quizzes which enable real-time feedback and further content personalization. It gives students agency over their learning process.

  • join the waitlist: Learn Your Way

  • if this works as well as the demos we’ve seen, courseware development has very few remaining technical moats

That dwarfs the budgets of any computer science department’s GPU budgets, where grad students are often forced to scrounge for the graphics processing units required to train and run AI models. Plus, many universities are tightening their belts, making it harder for computer science departments to justify spending money on expensive facilities.

As the scope of AI research grows, some professors say GPUs could determine recruiting outcomes. ā€œI think some universities are realizing that, more and more, this type of research requires those kind of resources and they’re allocating resources there,ā€ said Alvarez-Melis, who is also part of the Kempner Institute faculty.

Elise Porter, the executive director of Harvard’s Kempner Institute, said finding funds to grow their cluster is important for the organization. But it’s not as easy as pursuing gift-giving with naming rights. ā€œIt’s a critical tool for us to be able to do this, and it is extraordinarily expensive,ā€ she said. ā€œI think those donors are out there, but they’re few and far between.ā€

AI grading issue affects hundreds of MCAS essays in Mass. — The state’s testing contractor found roughly 1,400 essays did not receive the correct scores, according to a spokesperson with the Department of Elementary and Secondary Education. / NBC Boston (6 minute read)

Homework Checker — An AI-Powered Tool for Detecting and Correcting Errors in Homework Problems

  • a ChatGPT GPT (i.e., requires OpenAI login)

 

šŸ“Š FOR TECHNOLOGISTS

The Software Engineers Paid to Fix Vibe Coded Messes — Linkedin has been joking about ā€œvibe coding cleanup specialists,ā€ but it’s actually a growing profession. / 404 Media (6 minute read)

How We Built a New Standard in Financial Advice: The First AI Financial Advisor with Full-Context Reasoning regulated by the SEC — Passes CFP Exams and Outperforms Humans (by 17 points) and Leading Frontier Models / Origin blog (14 minute read)

At Origin, our breakthrough was architecting a system that pairs the contextual reasoning of frontier LLMs with deterministic computational engines. The LLM layer interprets complex financial scenarios and orchestrates tasks, while deterministic modules handle the underlying math with absolute precision.

Malekzadeh estimates he spends around 50% of his time writing requirements, 10% to 20% of his time on vibe coding, and 30% to 40% of his time on vibe fixing — remedying the bugs and ā€œunnecessary scriptā€ created by AI-written code.

He also doesn’t think vibe coding is the best at systems thinking — the process of seeing how a complex problem could impact an overall result. AI-generated code, he said, tries to solve more surface-level problems.

What the heck is Palantir’s ā€œOntologyā€? / Sherwood News (20 minute read)

Things are now getting a bit clearer. Essentially, Ontology is Palantir’s way of refining, structuring, and connecting the myriad kinds of data and information pipelines companies constantly use, creating a new stable foundation on which the company can run Palantir software.

You can see why this could be a pretty big advantage.

 

šŸŽ‰ FOR FUN

Can You Tell the Difference Between a Human Voice and AI? Take Our Quiz — A security firm made deepfake versions of us reporting bogus news. Listen to the results. / Wall Street Journal

AI-Powered Animal Crossing Villagers Begin Organizing Against Tom Nook — An LLM breathed new life into ā€˜Animal Crossing’ and made the villagers rise up against their landlord. / 404 Media (8 minute read)

Awesome Nano Banana images / PicoTrex, GitHub (35 minute read)

  • examples of many neat Nano Banana capabilities

LLMs Will be Like Ozempic for Golf / House of Strauss, Substack, archive (7 minute read)

A day later, my swing was different and self recorded video sent to Google’s Gemini confirmed the change. Swing errors that were decades in the making were corrected in the span of minutes. I’m not saying that I’ve suddenly made a leap from ā€œStruggles to break 100ā€ to ā€œscratch golfer.ā€ I’m just saying that a process that could have been expensive and arduous was instead efficient and relatively cheap. I apply the LLM’s fix, and it tells me whether I’ve actually applied it. The feedback is instant and objective.

History Before Sleep / YouTube channel

  • AI-generated history lessons intended to help you sleep; unsure how fact-checked their content is

  • not as good as Sleep Baseball

AI music I’ve made that I’ve liked — Mostly goofy / Andy Masley, Substack (9 minute read)

  • everything from Hamilton-like musical numbers to shoegaze to dungeon synth

 

🧿 AI-ADJACENT

Why Every Company Needs a Futurist-in-Residence — For companies looking to the near-term, it’s no longer a-nice-to-have / Ideo blog (6 minute read)

 

ā‹„