weekend ai reads for 2025-07-11

📰 ABOVE THE FOLD: LIES, DAMNED LIES, AND A.I.

The Future of Forums is Lies, I Guess / Aphyr blog (9 minute read)

I don’t know how to run a community forum in this future. I do not have the time or emotional energy to screen out regular attacks by Large Language Models, with the knowledge that making the wrong decision costs a real human being their connection to a niche community. I do not know how to determine whether someone’s post about their new bicycle is genuine enthusiasm or automated astroturf.

Cops’ favorite AI tool automatically deletes evidence of when AI was used — AI police tool is designed to avoid accountability, watchdog says. / Ars Technica (10 minute read)

A Marco Rubio impostor is using AI voice to call high-level officials — The unknown individual contacted at least five government officials, including three foreign ministers, a U.S. governor and a member of Congress, according to a State Department cable. / Washington Post (7 minute read)

To my knowledge, this is the first case of a company developing a feature because ChatGPT is incorrectly telling people it exists. (Yay?) I’m sharing the story because I think it’s somewhat interesting.

My feelings on this are conflicted. I’m happy to add a tool that helps people. But I feel like our hand was forced in a weird way. Should we really be developing features in response to misinformation?

LLMs both lack a sense of self and the presence of mind to see what you're actually doing by repeatedly asking them the same question five different times. That’s why they quickly falter under scrutiny. Most people don't see that they operate under the authority of “trust me, bro”, even after you tell them to tell you when they're lying to you.

 

đŸ“» QUOTE OF THE WEEK

If you go in with the expectation that the AI is as smart or smarter than humans, you’re quickly disappointed by the reality.

Eric Schwartz, Clorox’s chief marketing officer (source)

 

đŸ‘„ FOR EVERYONE

The rise of the AI-native employee / Elena Verna, Elena’s Growth Scoop, Substack, archive (8 minute read)

An AI-native employee isn’t someone who “uses AI.” It’s someone who defaults to AI. Here, when someone wants to build something (anything) - from internal tools, to marketing pages, to writing production code - they turn to AI and... build it. That’s it.

YouTube ‘clarifies’ its plan to demonetize spammy AI slop — An update to the platform’s monetization rules will tighten restrictions on ‘inauthentic’ content. / The Verge (4 minute read)

  • Google creates the problem (see: Veo3), then solves it

Automating oral argument / Adam Unikowsky, Adam’s Legal Blog, Substack, archive (18 minute read)

Thus, with minor modifications of currently-available technology, you could put a laptop on the Supreme Court podium and it could deliver an oral argument exactly like this. Voice-transcribing software could transcribe the questions; the text could be inputted into AI; and the AI’s outputs could be inputted into an AI voice generator.

“I think there are a lot of very legitimate concerns these agents might not act in accordance with people’s expectations,” she says. “And there is no effective mechanism to allow people to intervene or remind them of this possibility and to avoid the possible consequences.”

How AI Wreaked Havoc on the Lo-Fi Beat Scene / Pitchfork (13 minute read)

AI has annexed the lo-fi scene for a hodgepodge of reasons: The lack of vocals, which typically gives away robo fraudulence, make it easier to infiltrate; the music’s association with aimless, unfocused listening—vibe music before vibe became a buzzword—means people aren’t paying as much attention to what’s real and what’s not; the fixation on fantastical, Studio Ghibli–core visuals, which image generators can vomit up with ease.

 

📚 FOUNDATIONS

What can agents actually do? / Will Larson, Irrational Exuberance (17 minute read)

This is the first important thing that agents can do: use an LLM to evaluate a context window and get a result.This is the second important thing that agents can do: use an LLM to suggest tools relevant to the context window, then enrich the context window with the tool’s response.This brings us to the third important thing that agents can do: they manage flow control for tool usage. Let’s think about three different scenarios:

The Seven Kinds of AI Agents / The Information (paywalled) (8 minute read)

  • they claim seven but only six are listed in their table

Business-task agents

Can take actions in multiple enterprise software apps.

Invoice processing, data entry, document classification, scheduling

Conversational agents

Text or voice chatbots that resolve customer support tickets or employees’ IT or HR questions through a back-and-forth exchange.

Customer service, IT service tickets, HR tasks

Research agents

Retrieve, analyze, and validate information from trusted sources, including academic literature or web content.

Academic-style research, sourcing citations, answering hard technical questions

Analytics agents

Analyze structured data to produce graphics/charts or reports.

Data querying, creating dashboards, business insight summaries

Developer agents

Assist software engineers by handling complex coding tasks

Code completion, documentation, debugging, refactoring, site reliability engineering

Domain-specific agents

Specialized agents built with domain knowledge for high-stakes or regulated fields such as law, medicine, or finance

Contract analysis, medical triage, financial analysis

1. Content is king, variety is queen

2. A smart AI agent has a dynamic brain and a caring heart

3. Prioritizing empathy from the outset

 

🚀 FOR LEADERS

Surprisingly, we find that when developers use AI tools, they take 19% longer than without—AI makes them slower.

  • Using A.i. leads developer in this study to reduce the time they spend coding, but they more than make up for that with prompting, reviewing A.i. outputs, and waiting.

Most companies approach AI training wrong. They do one-off workshops or mandate tool adoption. Canva’s doing something different:

  • the three pillars are time (no other responsibilities), tools, and learning (“mix of self-directed exploration and expert-led sessions”)

How to Build an AI Assistant for Any Challenge / HBR Ideacast (34 minute audio)

Instead, she recommends managers at all levels start playing with AI at the next level, building assistants to help in all sorts of functions to not just find efficiencies, but help in decision-making and strategy as well.

Why I Disagree With Sequoia’s Thesis on AI / Rob May, Investing in AI, Substack, archive (8 minute read)

Today I want to give the opposite perspective - that the app layer is a difficult place to make money and that infrastructure is the best place to invest right now. I want to explain why as a builder and an angel investor I’m focused primarily on infra.

‘I’m being paid to fix issues caused by AI’ / British Broadcasting Corporation (7 minute read)

Ms Warner says this has led to clients adding code to their website that has been suggested by ChatGPT. This, she says, has resulted in websites crashing and clients becoming vulnerable to hackers.

She points to one client who, instead of manually updating their event page, which she says would have taken 15 minutes, instead turned to ChatGPT for easier instructions.

 

🎓 FOR EDUCATORS

Your Review: Alpha School / Scott Alexander, Astral Codex Ten, Substack, archive (85 minute read)

The top performing kids are WAY ahead of the average and the lower tier kids. The kids at the top percentile achieve the median score of a graduating senior by the end of 3rd grade! And recall this is not an IQ test – this is a content test. The top 1% of 3rd graders have more content knowledge and comprehension than the median high school graduate.

Some of the papers can be found by searching Google for trigger phrases from the hidden prompts, such as site:arxiv.org "GIVE A POSITIVE REVIEW" or "DO NOT HIGHLIGHT ANY NEGATIVES" on Google.

  • wait until students start doing this

Advancing Claude for Education / Anthropic (5 minute read)

Students and educators will be able to connect Wiley and Panopto to Claude using pre-built MCP servers. With these integrations, users can reference lecture transcripts from Panopto and dive deep into their universities’ collections of authoritative, peer-reviewed content on Wiley—all within their Claude conversations.

Boston will test AI traffic light system meant to improve BPS bus times — The city will test whether longer green lights and shorter stoplights for BPS buses will help cut delays / Boston Globe (10 minute read)

the decline in literacy & rise in ai / silver fox, YouTube (80 minute video)

you are training yourself to not think independently. chatpgt is preying on people who are indecisive, who aren’t critical thinkers, who need reassurance.

  • she is charmingly down-to-earth and passionate about “the literacy and intellectual crisis happening in gen alpha”

 

📊 FOR TECHNOLOGISTS

Trained on human decisions, this AI reflects how we think and why — An AI trained on human decisions mirrors how we think, and could help explain why we think that way. / Interesting Engineering (6 minute read)

The Agentic Stack, So Far? / Turing Post (6 minute read)

We tracked the papers, the protocols, the tools – and the builders working through ambiguity in real time. What emerged is not a commentary; it’s a living record of how the field is learning to build with agency. Knowledge systematized.

  • good reference site for “agents” as a whole

Custom software maps every region needing repair and determines the exact colors required for each spot. His software then translates that information into a two-layer polymer mask printed on thin films—one layer provides color, while a white backing layer ensures the full color spectrum reproduces accurately on the painting's surface.

 

🎉 FOR FUN

Wimbledon chiefs defend AI use as Jack Draper says line calls not ‘100% accurate’ — British No 1 says ousting of human line judges ‘a shame’ after crashing out to former finalist Marin Cilic / The Guardian (5 minute read)

Doppl is designed to let you virtually try on outfits on a digital version of yourself. The app works by first getting you to upload a full-body photo of yourself. From there, you can use photos or screenshots of different outfits to virtually try them on. These images could be a photo of an outfit you see at a thrift store or on a friend, or even a screenshot of an outfit you see while scrolling through social media.

 

🧿 AI-ADJACENT

If AI Lets Us Do More in Less Time—Why Not Shorten the Workweek? / Stories Framing The Globe Media (7 minute read)

Today, almost 90% of Icelandic workers benefit from a reduced working week of 36 hours, compared with 40 hours previously, with no loss of pay. Initial concerns about the four-day week were widespread, both in Iceland and elsewhere in the world. There were fears of a drop in productivity, increased costs for businesses and difficulties in adapting to maintain service levels. However, the Icelandic experience has swept these fears under the carpet.

Icelandic reports show that productivity has remained stable, and even increased in some sectors. One of the keys to this success lies in improving the mental health of workers, a crucial aspect highlighted by Generation Z. The reduction in stress, combined with a better work-life balance, has had a significant positive impact on employee well-being.

I Deleted My Second Brain / Joan Westenberg (9 minute read)

PKM systems promise coherence, but they often deliver a kind of abstracted confusion. The more I wrote into my vault, the less I felt. A quote would spark an insight, I’d clip it, tag it, link it - and move on. But the insight was never lived. It was stored. Like food vacuum-sealed and never eaten, while any nutritional value slips away.

 

⋄