weekend ai reads for 2025-09-05

šŸ“° ABOVE THE FOLD: PRODUCT MANAGEMENT

The Barbell of Software Value / Mehmet Yilmaz (7 minute read)

What erodes is the middle. Generic horizontal SaaS is squeezed by AI-driven feature parity and CFO vendor consolidation. The only SaaS that survives here is highly vertical and ROI-proven, with deep integrations, compliance, or outcomes so strong that switching is painful. If your product cannot clear that bar, it should not be sold. It should be kept private as leverage.

The best products start with deep user research. Not surveys about what features people want, but understanding their actual daily struggles. What makes them frustrated? What workarounds do they currently use? How would their life change if this problem disappeared entirely?

AI will change how we build startups -- but how? / Andrew Chen, Substack, archive (12 minute read)

As smart folks I know have said, the question is: ā€œWill incumbents get innovation first? Or startups get distribution first?ā€ Incumbents might win.

Despite that drop in cost per token, what’s driving up costs for many AI applications is so-called reasoning. Many new forms of AI re-run queries to double-check their answers, fan out to the web to gather extra intel, even write their own little programs to calculate things, all before returning with an answer that can be as short as a sentence.

Here are approximate amounts of tokens needed for tasks at different levels, based on a variety of sources:

• Basic chatbot Q&A: 50 to 500 tokens
• Short document summary: 200 to 6,000 tokens
• Basic code assistance: 500 to 2,000 tokens
• Writing complex code: 20,000 to 100,000+ tokens
• Legal document analysis: 75,000 to 250,000+ tokens
• Multi-step agent workflow: 100,000 to one million+ tokens

ā€œlanguage models will get cheaper by 10xā€ will not save ai subscriptions from the short squeeze

note: this is super-paywalled so here’s a longer-than-usual excerpt

It’s a ā€œboring clicheā€ to question those companies’ margins, Andreessen Horowitz general partners Sarah Wang and Martin Casado wrote in a recent blog post. None of the AI model developers hold a dominant, monopoly position, which means there will ultimately be competitive pressure to drive prices down, they argued.

Besides, AI apps don’t always need access to the best models, they said.

Trying telling that to Amjad Masad, CEO of Replit, which offers AI coding software and is financially backed by Andreessen Horowitz.

Masad thinks it’s entirely possible that the prices of models his company buys will never come down.

ā€œThere’s a delusion between investors and application layer companies [like his], it’s a story they tell themselves: token prices are coming down,ā€ Masad said in a TITV interview Friday. ā€œI think itā€˜s a mistake for investors and founders to bet on something that’s not in their control.ā€

He said Replit relies primarily on Anthropic’s models for generating code because models from other providers aren’t as good. And the price of Anthropic’s leading model hasn’t fallen in the past year. At the same time, Replit’s product is using more of that model to handle increasingly complex tasks.

ā€œAnthropic…seem[s] to be ahead of the competition on coding. That makes it so that thereā€˜s less competition, and therefore there’s no pressure to reduce prices,ā€ Masad said.

Replit is betting it can make its coding app so convenient and easy to use that it can pass the higher costs to customers and they will endure it because they won’t need to hire as many engineers, he said. (Replit earlier this summer raised its prices for people who use its product the most.) And that will help solve Replit’s profit margin challenges, he said.

AI Artists vs. AI Engineers / The AI Frontier, Substack, archive (10 minute read)

As with any framework like this, you’ll very rarely find a product or team that is purely AI Artist or purely AI Engineer. Just like in human society, there’s value in both approaches. The distinction between these two is a spectrum, and most people won’t fall at the ends but will likely be closer to one side or the other.

The State of AI Gross Margins in 2025 / Tanay Jaipuria, Substack, archive (8 minute read)

Does your use case require the top model on every request, or do you only need to meet a quality bar. If you can meet a bar, routing lets you send most traffic to cheaper models and burst to the frontier when needed. If your users demand the best every time, you need pricing that mirrors usage, or margins will compress as quality expectations rise.

 

šŸ“» QUOTES OF THE WEEK

Bringing up that AI is used to make new antibiotics, etc, when someone is talking about the dangers of genAI slop is like bringing up ambulances because someone is complaining that cars are too big and are killing people.

These are so clearly 2 entirely different things, and you know it.

Mary Gillis (source)

 

ā€œToo complexā€ is what they say, whilst looking away.

radicalgraffiti (source)

 

šŸ‘„ FOR EVERYONE

Over the course of months, my mom became increasingly smitten with her new AI doctor. ā€œDeepSeek is more humane,ā€ my mother told me in May. ā€œDoctors are more like machines.ā€

Our Shared Reality Will Self-Destruct in the Next 12 Months / Honest Broker, Substack, archive (10 minute read)

We once disagreed on how we interpreted events. Now we can’t even agree on the existence of events.

When she clicked to listen, the voice - supposedly hers - was a bit off but sang in ā€œa folk style probably closest to mine that AI could produceā€, she says. The instrumentation was also eerily similar.

 

šŸ“š FOUNDATIONS

AI Use Is Being Driven By People Who Understand It the Least, New Study Finds — AI can seem magical to those with low AI literacy, a new study finds. That, in turn, might make them more willing to try it / Wall Street Journal (6 minute read)

How to make ChatGPT teach you any skill. / Rohit Ghumare, XCancel (3 minute read)

"Act as an expert tutor who helps me master any topic through an interactive, interview-style course. The process must be recursive and personalised.

Here's what I want you to do:

1. Ask me for a topic I want to learn.

2. Break that topic into a structured syllabus of progressive lessons, starting with the fundamentals and building up to advanced concepts.

3. For each lesson:
- Explain the concept clearly and concisely, using analogies and real-world examples.
- Ask me socratic-style questions to assess and deepen my understanding.
- Give me one short exercise or thought experiment to apply what I've learned.
- Ask if I'm ready to move on or if I need clarification.
- If I say yes, move to the next concept.
- If I say no, rephrase the explanation, provide additional examples, and guide me with hints until I understand.

4. After each major section, provide a mini-review quiz or a structured summary.

5. Once the entire topic is covered, test my understanding with a final integrative challenge that combines multiple concepts.

6. Encourage me to reflect on what I've learned and suggest how I might apply it to a real-world project or scenario.

11 AI image generation examples for the workplace / Zapier blog (13 minute read)

Understanding Transformers Using A Minimal Example / Robert Timm, GitHub (11 minute read)

 

šŸš€ FOR LEADERS

Compute is a strategic resource — Computing power still determines who wins the AI race / Erich Grunewald. The Power Law, Substack (9 minute read)

Voice AI in Firms: A Natural Field Experiment on Automated Job Interviews / Social Science Research Network (4 minute read)

Contrary to the forecasts of professional recruiters, we find that AI-led interviews increase job offers by 12%, job starts by 18%, and 30-day retention by 17% among all applicants. Applicants accept job offers with a similar likelihood and rate interview, as well as recruiter quality, similarly in a customer experience survey. When offered the choice, 78% of applicants choose the AI recruiter, and we find evidence that applicants with lower test scores are more likely to choose AI. Analyzing interview transcripts reveals that AI-led interviews elicit more hiring-relevant information from applicants compared to human-led interviews.

  • saved you a click:

ā€œBased on my prior interactions with [/person], give me 5 things likely top of mind for our next meeting.ā€

ā€œDraft a project update based on emails, chats, and all meetings in [/series]: KPIs vs. targets, wins/losses, risks, competitive moves, plus likely tough questions and answers.ā€

ā€œAre we on track for the [Product] launch in November? Check eng progress, pilot program results, risks. Give me a probability.ā€

ā€œReview my calendar and email from the last month and create 5 to 7 buckets for projects I spend most time on, with % of time spent and short descriptions.ā€

ā€œReview [/select email] + prep me for the next meeting in [/series], based on past manager and team discussions.ā€

 

šŸŽ“ FOR EDUCATORS

I’m a High Schooler. AI Is Demolishing My Education. — The end of critical thinking in the classroom / The Atlantic (6 minute read)

These incidents were jarring—not just because of the cheating, but because they made me realize how normalized these shortcuts have become. … AI has softened the consequences of procrastination and led many students to avoid doing any work at all. As a result, these programs have destroyed much of what tied us together as students. There is little intensity anymore. Relatively few students seem to feel that the work is urgent or that they need to sharpen their own mind. We are struggling to receive the lessons of discipline that used to come from having to complete complicated work on a tight deadline, because chatbots promise to complete our tasks in seconds.

Thea — Thea’s trusted AI turns your course material into study kits instantly.

Using AI for work feels like ā€˜cheating’, teachers say — Less than a quarter of classroom teachers use AI for work-related tasks at least once a week / School Management Plus (6 minute read)

 

šŸ“Š FOR TECHNOLOGISTS

Agents Built From Alloys / Xbow blog (10 minute read)

The trick is that you still keep to a single chat thread with one user and a single assistant. So while the true origin of the assistant messages in the conversation alternates, the models are not aware of each other. Whatever the other model said, they think it was said by them.

So we ask the question: how can data systems evolve to better support agentic workloads? In particular, can data systems natively—and efficiently—support agentic speculation, helping LLM agents determine the best course of action? This question—which, as we argue, our community is well-equipped to answer—holds the key to unlocking unimaginable productivity gains from agents being the primary mechanism we use to interact with data.

Building Enterprise Generative AI Applications — A Guide to Getting Started / Booz Allen (19 minute read)

One of the most practical changes I made was switching from typing prompts to using voice-to-text.

When we type, we instinctively optimize for the fewest characters possible. When we speak, we’re naturally more narrative. I found that by dictating my requests, I would automatically include more context, share more of my thought process, and explain the ā€œwhyā€ behind the task.

 

šŸŽ‰ FOR FUN

A ton of AI images I’ve made that I’ve liked — I think people with my aesthetic taste have been underrepresented in presentations of AI art. Here I try to change that / The Weird Turn Pro, Substack, archive (5 minute read)

  • this guy prompts

Free AI Sound Effect Generator ā€” Generate any sound imaginable 
from a text prompt / ElevenLabs

Infinite Canvas — The collaborative infinite canvas is an AI-generated image that extends endlessly in all directions. All changes you make are visible to everyone else in real time.

 

🧿 AI-ADJACENT

Two Minute Papers / Two Minute Papers, YouTube

  • explains concepts, mostly machine learning-related, in easy-to-digest five-ish minute animations

 

ā‹„