weekend ai reads for 2025-08-29

📰 ABOVE THE FOLD: DON’T USE AN A.I. BROWSER

Piloting Claude for Chrome / Anthropic (8 minute read)

When we added safety mitigations to autonomous mode, we reduced the attack success rate of 23.6% to 11.2%, which represents a meaningful improvement over our existing Computer Use capability (where Claude could see the user’s screen but without the browser interface that we’re introducing today).

  • 11.2% attack success rates on a browser — where users probably conduct their most personal and sensitive activities — is frankly terrible; imagine if 11.2% of the time someone with a fake debit card could withdraw money from your account

  • we’re surprised they even released it in this state

when users ask it to “Summarize this webpage,” Comet feeds a part of the webpage directly to its LLM without distinguishing between the user’s instructions and untrusted content from the webpage. This allows attackers to embed indirect prompt injection payloads that the AI will execute as commands. For instance, an attacker could gain access to a user’s emails from a prepared piece of text in a page in another tab.

  • this attack is successful 100% of the time and they want users to pay for this?

AI browsers may be the best thing that ever happened to scammers — A new report shows Perplexity’s agentic AI falling for obvious phishing sites and executing malicious prompts. / Engadget (4 minute read)

We Are Still Unable to Secure LLMs from Malicious Inputs / Schneier on Security (7 minute read)

This kind of thing should make everybody stop and really think before deploying any AI agents. We simply don’t know to defend against these attacks. We have zero agentic AI systems that are secure against these attacks. Any AI that is working in an adversarial environment—and by this I mean that it may encounter untrusted training data or input—is vulnerable to prompt injection. It’s an existential problem that, near as I can tell, most people developing these technologies are just pretending isn’t there.

Piloting Claude for Chrome / Simon Willison (3 minute read)

Anthropic’s approach here is the most open-eyed I’ve seen yet but it still feels doomed to failure to me.

A Better Way to Think About AI — Artificial intelligence is ready to collaborate. Why fixate on automation? / The Atlantic (19 minute read)

 

đŸ“» QUOTES OF THE WEEK

A significant portion of LLM-generated research ideas appear novel on the surface but are actually skillfully plagiarized in ways that make their originality difficult to verify.

Tarun Gupta and Danish Pruthi (source)

 

đŸ‘„ FOR EVERYONE

ChatGPT: A.I. is making your writing worse—but not in the way you think. — In the age of A.I. paranoia, people are cutting em dashes, skipping metaphors, and leaving in typos to prove their human. / Slate (8 minute read)

  • A-plus subtitle

AI is already cutting Gen-Z jobs, Stanford study shows / San Francisco Chronicle, archive (4 minute read)

It found that employment for workers ages 22 to 25 in the most AI-exposed jobs, including software development and customer service, has dropped by 13% relative to less-exposed roles.

It also creates an unsustainable creative ecosystem. Today’s generative AI models are built on the world’s creative output—mostly uncompensated. If we don’t value and sustain human creativity now, future creativity will be nothing but AI-remixed leftovers.

 

📚 FOUNDATIONS

MythBusting Large Language Models — Chatbots can be deceptive. How do LLMs really work under the hood? / Joseph Lochlann Smith, Medium (23 minute read)

 

🚀 FOR LEADERS

AWS CEO says AI replacing junior staff is ‘dumbest idea’ — They’re cheap and grew up with AI 
 so you’re firing them why? / The Register (5 minute read)

In the Costco Era of Software, Design Is the Differentiator / Digital Native, Substack, archive (9 minute read)

Good design doesn’t mean shoving every bell and whistle into a product. (This is one way I see many startups struggle, particularly in vertical AI: skeptical lawyers and construction workers and restauranteurs don’t want a product overloaded with state-of-the-art features from the latest model; they want a simple, intuitive product that gets the job done).

How to Build AI Product Strategy — Build AI products that scale profitably, retain users, and defend against commoditization / Miqdad Jaffer, Product Lead at OpenAI, The VC Corner, Substack, archive (40 minute read)

 

🎓 FOR EDUCATORS

Some educators are automating grading; others are deeply opposed

In our Claude.ai data, faculty used AI for grading and evaluation less frequently than other uses, but when they did, 48.9% of the time they used it in an automation-heavy way (where the AI directly performs the task). That’s despite educator concerns about automating assessment tasks, as well as our surveyed faculty rating it as the area where they felt AI was least effective.

Selfplanr — AI-Powered College Planning

Get a detailed college audit with personalized profile recommendations, a day-to-day planner, and Cori, the AI counselor to help you manage your tasks & get into your dream schools.

Why learn French when your phone can do it for you? / Financial Times, archive (10 minute read)

 

📊 FOR TECHNOLOGISTS

Why AI Isn’t Ready to Be a Real Coder — AI’s coding evolution hinges on collaboration and trust / IEEE Spectrum (14 minute read)

How we vibe code at a FAANG. / TreeTopologyTroubado, Reddit (4 minute read)

Overall, we’re seeing a ~30% increase in speed from the feature proposal to when it hits prod. This is huge for us.

TL;DR: Always start with a solid design doc and architecture. Build from there in chunks. Always write tests first.

SuperClaude_Framework — A configuration framework that enhances Claude Code with specialized commands, cognitive personas, and development methodologies. / SuperClaude-Org, GitHub

 

🎉 FOR FUN

Gemini’s new AI image model is designed to make more precise edits to images — based on natural language requests from users — while preserving the consistency of faces, animals, and other details, something that most rival tools struggle with.

21 Ways People Are Using A.I. at Work / New York Times (26 minute read)

  • diverse list from “As a ‘muse’” to “Detect leaks in a water system”

ChatGPT has been legitimately unusable for me since watching this (from ‘South Park’) / SMB_Attorney, XCancel (2 minute video)

 

🧿 AI-ADJACENT

I Am An AI Hater / Moser’s Frame Shop (5 minute read)

And even as it consumes those who use it, even as the scammers become their own marks, even as it is sustained by exploited workers slotted in as human filters for algorithmic abuse – some people want to have a little, as a treat. As a joke. Just to make fun of it, just for the busywork, because it’s good enough, right? You understand.

I'm Worried It Might Get Bad / Daniel Miessler (19 minute read)

What I’m really worried about is those companies and services and products disappearing altogether—because they’re replaced by more efficient companies and products that do the same thing but better and with fewer people.

 

⋄