That AI Thing
Posts
weekend ai reads for 2025-12-12

weekend ai reads for 2025-12-12

December 12, 2025

direct links are available on the web at https://thataithing.beehiiv.com/p/weekend-ai-reads-for-2025-12-12

📰 ABOVE THE FOLD: EDGE USE CASES (FOR FUN)

I failed to recreate the 1996 Space Jam Website with Claude / j0nah (14 minute read)

For now, this task stands undefeated. A monument to 1996 web design and a humbling reminder that sometimes the simplest tasks are the hardest. That orbital pattern of planets, thrown together by some Warner Brothers webmaster 28 years ago, has become an inadvertent benchmark for Claude.

Until then, the Space Jam website remains proof that not everything old is obsolete. Some things are just irreproducibly perfect.

Police department replaces sketches of suspects with AI images — The images distributed by Goodyear, Arizona, police look like photos, but they’re created by artificial intelligence and input from witnesses. / Washington Post (7 minute read)

AI chatbots can sway voters better than political advertisements — A conversation with a chatbot can shift people’s political views—but the most persuasive models also spread the most misinformation. / MIT Technology Review (9 minute read)

In this experiment, Gemini 3 displays the kind of intelligence we should be aiming for. I don't know exactly what Google did differently, but it gives me hope. The other models demonstrate powerful but narrow intelligence that can immediately trip down a rabbit hole and waste its compute in a fruitless direction. / Josh Whiton, XCancel (3 minute read)

Only Gemini 3 solved this riddle. Even Opus 4.5 couldn't. And Grok? Good lord... someone check on Grok.

It shows how brittle these intelligences are. To see their respective failure modes (and Gemini 3’s impressive performance) read on.

📻 QUOTES OF THE WEEK

Move slowly and methodically to go fast

Andy Bell (source)

It seems fairly safe to assume you aren’t in the first group against the wall, if you’re excited about the thing putting them there.

Josh Collinsworth (source)

👥 FOR EVERYONE

The state of enterprise AI / OpenAI blog (6 minute read)

Crucially, AI is not just helping people do the same work faster—it is enabling people to do new kinds of work.

Coding-related messages increased 36% for workers outside of technical functions.

75% of users report being able to complete new tasks they previously could not perform.

AI reduces the gap between intent and execution. Individuals can translate ideas into concrete outputs, regardless of their specialization or technical expertise.

related, What People Do with Copilot [PDF] / Microsoft (17 minute read)

On mobile, health is the dominant topic, which is consistent across every hour and every month we observed—with users seeking not just information but also advice.

“Surfing the edge”: Tim O’Reilly on how humans can thrive with AI — Media trailblazer Tim O’Reilly tells Big Think why AI requires “get yourself dirty” work — and warns us not to buy the hype. / Big Think (11 minute read)

The Reverse-Centaur’s Guide to Criticizing AI / Cory Doctorow, Pluralistic (47 minute read)

The promise of AI – the promise AI companies make to investors – is that there will be AIs that can do your job, and when your boss fires you and replaces you with AI, he will keep half of your salary for himself, and give the other half to the AI company.

related (1), CEOs Are All-In on AI — A survey shows that chief executives expect AI to boost productivity and economic growth, even as it weighs on the labor market / Wall Street Journal (4 minute read)
related (2), AI Art Is Weird, Sad, and Ugly. Let’s Not Pretend Otherwise. / Nicholas Liu, Jacobin (10 minute read)

Because capitalism orients people toward profit rather than allowing us to pursue our interests freely, it inevitably separates humans from the creative act. AI art is just the slop frothing up from that gap.

LLMs Make Legal Advice Lossy / Kyle E. Mitchell, /dev/lawyer (9 minute read)

Clients can use LLMs to compress my advice into summaries without telling me. Pressure to keep writing short and plain is good, but they’re losing my careful choice of how much to generalize rules and realities. They’re also interrupting how I choose to teach and reinforce useful vocabulary.

📚 FOUNDATIONS

NotebookLM: The Most Useful Free AI Tool of 2025 / Jeremy Caplan, Wondertools, Substack, archive (18 minute read)

unfortunately, the video is only on the site
related, Google Illuminate is quietly becoming the best research tool I didn't know I needed / Android Police (9 minute read)

After you feed it a research paper, Illuminate produces a conversation between two synthetic voices that explains the paper’s ideas in plain language.

For those who learn better by listening or lack the time to read a PDF, Illuminate provides access to information that often seems restricted by academic jargon.

The Complete Guide to Nano Banana Pro: 10 Tips for Professional Asset Production / Google AI Studio, Twitter, archive (13 minute read)

Nano-Banana Pro is a “Thinking” model. It doesn’t just match keywords; it understands intent, physics, and composition. To get the best results, stop using “tag soups” (e.g., dog, park, 4k, realistic) and start acting like a Creative Director.

OpenAI, Anthropic, and Google use 10 internal prompting techniques that guarantee near-perfect accuracy…and nobody outside the labs is supposed to know them. Here are 10 of them: / Louis Gleeson, XCancel (12 minute read)

How HTR Models work — An educational look at the mechanics behind Handwritten Text Recognition models.

interactive playgrounds to help understand

🚀 FOR LEADERS

UX Is Your Moat (And You’re Ignoring It) / Eleganthack (11 minute read)

Netflix feels like a comfortable sofa. You sink into it. The other streaming services feel like you’re shopping.

This is the moat that AI companies keep ignoring. They’re obsessed with model capabilities—who has the best reasoning, who generates the most accurate images, whose context window is bigger. Meanwhile, the real competition is happening at the interface level, and most of them are losing.

AI use by directors and boards: Early insights [PDF] / Australian Institute of Company Directors (31 minute read)

related, Elevating board governance through AI posture and archetypes / McKinsey Technology (17 minute read)

The Impact of Visual Generative AI on Advertising Effectiveness / Emory University & New York University, Social Science Research Network (51 minute read)

Across studies, we find that genAI-created ads consistently outperform both human-and genAI-modified ads, increasing click-through rates by up to 19% in field settings. In contrast, genAI-modified ads show no significant improvement over human-created benchmarks. These results reveal an asymmetry: visual genAI delivers greater value when used for holistic ad creation rather than for modification, where creative constraints may limit its effectiveness. … Finally, we find that disclosing AI involvement in ad generation significantly reduces advertising effectiveness by up to 31.5%, underscoring trade-offs relevant to evolving AI disclosure policies.

🎓 FOR EDUCATORS

OpenAI, ChatGPT: AI Should Be Asking Your Kids More Questions / Bloomberg (8 minute read)

So Estonia is pioneering a better way of using ChatGPT to learn — by making it harder and retraining the model to pose more questions than answers. The effort challenges ChatGPT’s core appeal and could get the cold shoulder from OpenAI, but it could also unlock AI’s true potential in education in a way no one else has managed.

How AI is rewiring childhood / The Economist (8 minute read)

Being reared by robots has advantages. Tech firms are already showing how AI can enhance learning, especially where teachers and materials are scarce. Literacy and language-learning have been boosted in early trials. The dream is that, with an AI tutor, children can be saved from classes pitched to the median, in which bright pupils are bored and dim ones are lost.

related (?), Elite Colleges Have an Extra-Time-on-Tests Problem / The Atlantic (15 minute read)

At Brown and Harvard, more than 20 percent of undergraduates are registered as disabled. At Amherst, that figure is 34 percent. Not all of those students receive accommodations, but researchers told me that most do. The schools that enroll the most academically successful students, in other words, also have the largest share of students with a disability that could prevent them from succeeding academically.

“You hear ‘students with disabilities’ and it’s not kids in wheelchairs,” one professor at a selective university, who requested anonymity because he doesn’t have tenure, told me. “It’s just not. It’s rich kids getting extra time on tests.” Even as poor students with disabilities still struggle to get necessary provisions, elite universities have entered an age of accommodation. Instead of leveling the playing field, the system has put the entire idea of fairness at risk.

Grading Machines: Can AI Exam-Grading Replace Law Professors? / University of Minnesota, University of Virginia, University of Chicago, et al, Social Science Research Network (38 minute read)

Rather, we seek to determine whether existing models—which can be straightforwardly applied by most professors and students—are already suitable for the task of law exam evaluation. We find that, when provided with a detailed rubric, the LLM grades correlate with the human grader at Pearson correlation coefficients of up to 0.93. Our findings suggest that, even if they do not fully replace humans in the near future, LLMs could soon be put to valuable tasks by law school professors, such as reviewing and validating professor grading, providing substantive feedback on ungraded midterms, and providing students feedback on self-administered practice exams.

📊 FOR TECHNOLOGISTS

Agentic AI Foundation (AAIF)

The AAIF provides a neutral, open foundation to ensure this critical capability evolves transparently, collaboratively, and in ways that advance the adoption of leading open source AI projects. Its inaugural projects, AGENTS.md, goose and MCP, lay the groundwork for a shared ecosystem of tools, standards, and community-driven innovation.

Measuring Agents in Production / UC Berkeley, Intesa Sanpaolo, UIUC, Stanford University, IBM Research, arXiv (93 minute read)

We investigate why organizations build agents, how they build them, how they evaluate them, and what the top development challenges are. We find that production agents are typically built using simple, controllable approaches: 68% execute at most 10 steps before requiring human intervention, 70% rely on prompting off-the-shelf models instead of weight tuning, and 74% depend primarily on human evaluation. Reliability remains the top development challenge, driven by difficulties in ensuring and evaluating agent correctness.

useful examples of effective implementations, as well

What Is Agentic Browsing and Why Are AI Browsers Not Replacing Chrome Yet? — A crop of under-baked new products is splitting the internet into two lanes, forcing website developers to rethink who their pages are for. / Bloomberg (10 minute read)

get ready to have a shadow website for A.i.

🎉 FOR FUN

Perplexity x CR7

Cristiano Ronaldo is an investor and brand ambassador for Perplexity, so they came up with this sometimes interesting, often confounding landing page

Duolingo couldn’t teach me Spanish in 2 years. ChatGPT did it in just 2 weeks. No overpriced apps. No boring repetition. Just results. Here are the only 6 prompts I used: / Rishabh, XCancel (6 minute read)

Creed sings “Rudolph the Red-Nosed Reindeer” / There I Ruined It, YouTube (2 minute video)

related (?), Radiohead sings “Santa Claus is Coming to Town” / There I Ruined It, YouTube (2 minute video)

Who’s Right — Don’t just say you’re right. Prove it with an AI-generated certificate. Perfect for group chats, couples, and petty feuds.

AI Trade Arena

Grok ended up performing the best while DeepSeek came close second. Almost all the models had a tech-heavy portfolio which led them to do well. Gemini ended up in last place since it was the only one that had a large portfolio of non-tech stocks.

I’m obsessed with Redfin’s AI search — AI is a surprisingly useful tool when you’re sifting through real estate listings. / The Verge (8 minute read)

McDonald’s Pulls Down AI-Generated Holiday Ad After Deluge of Mockery / Futurism (6 minute read)

Similar to Coke’s 2025 Holiday ad, the McDonald’s spot is like a visual seizure, full of grotesque characters, horrible color grading, and hackneyed AI approximations of basic physics.

Though the abomination of an ad only has 20,000 views on YouTube, backlash in the comments was so intense that McDonald’s shut down comments over the weekend, before delisting the video entirely. (Some marketing research databases managed to scrape the clip, if you’re curious.)

🧿 AI-ADJACENT

The Minnesota Model: What the Digital Fair Repair Act Means for Your Home Network Security / Richard Callaby (10 minute read)

Your Data Might Determine How Much You Pay for Eggs — A newly enacted New York law requires retailers to say whether your data influences the price of basic goods like a dozen eggs or toilet paper, but not how. / Wired (9 minute read)

get a VPN

⋄