weekend ai reads for 2025-04-25

šŸ“° ABOVE THE FOLD: AI AGENTS IN THE WILD

At the same time, the risk is immediate and present with agents. When models are not just contained boxes but can take actions in the world, when they have end-effectors that let them manipulate the world, I think it really becomes much more of a problem.

Law enforcement agencies have taken steps to prevent specifics about what Massive Blue is and how it works from becoming public. At public appropriations hearings in Pinal County about the Massive Blue contract, the sheriff’s office refused to tell county council members about what the product even is.

ā€œHey! We have no such policy,ā€ wrote a Cursor representative in a Reddit reply three hours later. ā€œYou’re of course free to use Cursor on multiple machines. Unfortunately, this is an incorrect response from a front-line AI support bot.ā€

 

šŸ“» QUOTES OF THE WEEK

A reminder to myself that taking up space, even imperfectly, is still worth doing.

Ashley Willis (source)

 

 

šŸ‘„ FOR EVERYONE

While AI cannot exercise emotions, its perceived human-like nature in these interactions instinctively makes us blurt out courtesies like ā€œThank Youā€ and ā€œPlease.ā€

Sam Altman acknowledges this and reports that ChatGPT costs the company tens of millions of dollars just generating responses to these prompts.

South Korean actor Simon Lee’s AI likeness was similarly used to spook naĆÆve Internet users but in a potentially more harmful way. He told the AFP that he was ā€œstunnedā€ to find his AI avatar promoting ā€œquestionable health cures on TikTok and Instagram,ā€ feeling ashamed to have his face linked to obvious scams.

One of the maps shows the flow of data and knowledge out of 39 African nations to subcontractors, mostly located in the United Arab Emirates, North America, and Europe, with four outsourcing firms in Africa. From there, it goes on to clients such as Meta, OpenAI, and Samsung.

AI is Like Cars / Andrey Ozornin (12 minute read)

But we totally should fight for the right to live, think, and create without any new trending technology, including AI. We should fight for AI safety, for control over our data, for improved energy effectiveness. We should fight against AI being imposed on us as the default mode of human-computer interaction. We should fight to ban AI-powered weapons worldwide. We should demand climate action from AI profiteers.

 

šŸ“š FOUNDATIONS

  • we agree with Zvi Mowshowitz: ā€œDemis Hassabis continues to be by far the lab leader publicly talking the most responsibly and well about AI, in sharp contrast to Sam Altman, Dario Amodei, Elon Musk or Mark Zuckerberg.ā€

What Is an LLM and How Does It Work? / Extremetech (10 minute read)

Artificial intelligence guru Andrej Karpathy has produced one of the best tech videos I’ve ever watched. It's not for the faint of heart, but in 3.5 hours, he leads anyone with a basic understanding of neural networks to a similar knowledge of how modern LLMs, ā€œchat-basedā€ LLMs, and ā€œreasoningā€ LLMs are constructed

But what if you don’t stick to zero? Let’s say you crank it just a bit - up to 0.2.

Suddenly, the conversation changes. It’s as if you're interrupting this expert coding wizard just as he’s about to confidently hand you his top solution, saying:

ā€œHang on a sec - before you give me your absolute #1 solution, could you instead jot down your top two or three best ideas, toss them into a hat, shake ā€˜em around, and then randomly draw one? Yeah, let’s just roll with whatever comes out.ā€

  • smart analogy of temperature in LLMs from Lawncareguy85

  • surprisingly few posts from them about lawn care though

 

šŸš€ FOR LEADERS

Winning Gen AI Race with Your Custom Data Strategy / Saurabh Kaushik, Becoming Human (13 minute read)

Timely data access ensures relevance and adaptability, while data quality directly impacts output excellence. Robust data governance practices address ethical concerns, like privacy and bias. Prioritizing these aspects in product development lays the foundation for a Gen AI solution that excels in performance while upholding ethical standards and user trust.

However, the trust in the data underpinning these decisions is falling. Business leaders do not feel equipped to find, analyze, and interpret the data they need in an increasingly competitive business environment.

2025: The Year the Frontier Firm Is Born / Microsoft (31 minute read)

As agents increasingly join the workforce, we’ll see the rise of the agent boss: someone who builds, delegates to, and manages agents to amplify their impact—working smarter, scaling faster, and taking control of their career in the age of AI. From the boardroom to the front line, every worker will need to think like the CEO of an agent-powered startup, directing teams of agents with specialized skills like research and data analysis.

Why taste matters now more than ever / Fast Company (7 minute read)

Not all design disciplines are equally affected by AI. Those who work with material, scale, and space—book designers, muralists, sign painters, mosaicists—continue to operate through tacit knowledge and touch. Their work still resists automation because it’s rooted in place and presence—it has ā€œaura.ā€

 

šŸŽ“ FOR EDUCATORS

But, from my conversations, I also learned that for every student misusing AI to avoid hard thinking, there’s another embracing it to learn more deeply. Some use it to check their math homework. Others ask AI to explain calculus to them, because their parents certainly can’t. Some create practice tests to prepare for their exams. And, perhaps most powerfully of all, others turn to AI to ask honest, "dumb" questions they genuinely want answers to, igniting curiosity in topics their classrooms might overlook.

Using AI to predict student success in higher education / The Brookings Institution (12 minute read)

When we compared the model predictions to actual outcomes, distinct patterns emerged across racial groups. First, Black and Hispanic students were more likely to be incorrectly predicted to fail when they actually graduated (known as false negatives).

Meanwhile, white and Asian students were more likely to be incorrectly predicted to succeed (known as false positives). Among ā€œfailingā€ students who did not attain a bachelor’s degree within eight years, the predicted probability of success varied significantly by racial group (see right panel). Estimates of success were overly optimistic for white and Asian students (61% and 68% estimated probability of success, respectively). Conversely, the model estimates only a 32% probability of success for Hispanic students and 39% for Black students in similar circumstances.

Bulls__t universities: the future of automated education / Robert Sparrow, & Gene Flenady, Springer Nature (56 minute read)

Despite the impressive advances in Generative AI over recent years, the outputs of these systems are neither oriented towards the truth nor connected to action in such a way as to have conceptual content. Employing AIs as teachers would undermine the authority of all teachers and of the body of established knowledge that they aim to impart to students. Moreover, students must learn how in order to learn that, be exposed to the personal examples of their teachers in order to understand what it means to be committed to a discipline, and interact with other students and staff in order to realize themselves as moral agents and citizens.

 

šŸ“Š FOR TECHNOLOGISTS

Claude Code Best Practices / Anthropic (22 minute read)

This post outlines general patterns that have proven effective, both for Anthropic's internal teams and for external engineers using Claude Code across various codebases, languages, and environments. Nothing in this list is set in stone nor universally applicable; consider these suggestions as starting points. We encourage you to experiment and find what works best for you!

Most of the time, you just want to turn your idea into a working app - fast. Here’s how to do it without driving yourself insane. These aren’t fancy tricks, just things that work.

Get the system architecture right first. Document it, streamline it and test it. Then divide the components into small pieces and let AI finish the work.

Everything Wrong with MCP / Shrivu’s Substack, Substack archive (19 minute read)

Similar to exposing sensitive data but much more nuanced, companies who are hooking up a lot of internal data to AI-power agents, search, and MCPs (i.e. Glean customers) are going to soon discover that ā€œAI + all the data an employee already had access toā€ can occasionally lead to unintended consequences. It’s counterintuitive but I’ll claim that even if the data access of an employee’s agent+tools is a strict subset of that user’s own privileges, there’s a potential for this to still provide the employee with data they should not have access to.

 

šŸŽ‰ FOR FUN

Keep tongue on roof of mouth as you open your mouth wide. Once you open wide enough that your tongue don't touch, keep opening but keep it center as you do it.

Badge AI — AI Usage Transparency System

This tool will help you determine to what extent your creative process uses artificial intelligence. Inspired by the Creative Commons model, our badge offers transparency on the genesis of your work.

  • this newsletter has always been ā€œT-AI-0: Exclusively human-written text without AIā€

  • the header image has almost always been ā€œV-AI-3: AI-generated visual content with human prompts and editingā€

Why are AI companies so bad at naming their models? — GPT-4o, Llama-4, Claude 3.7 Sonnet. Why can’t AI companies come up with compelling model names? / Fast Company (4 minute read)

Snow White starring Samuel Jackson (2026) | Teaser Trailer / Wicked AI, YouTube (2 minute video)

 

🧿 AI-ADJACENT

Go Delete Yourself From the Internet. Seriously, Here’s How. / Wall Street Journal archive (7 minute read)

Google updated its ā€œResults About Youā€ tool, and using it has been an eye-opening experience.

It uncovered my home address, phone number and email on so-called people-search websites, along with my birth date and grandma’s name—even though I requested removal from some databases years ago. It regularly sends emails alerting me to more exposed data.

 

ā‹„