weekend ai reads for 2025-02-14

📰 ABOVE THE FOLD: ART & CREATIVITY

GenAI Art Is the Least Imaginative Use of AI Imaginable / Ge Wang, Human-Centered Artificial Intelligence, Stanford University (13 minute read)

This mindset tends to be good for #Capitalism, but betrays not only a lack of understanding of why people make music, but also a profound lack of imagination regarding how we could, or would want to live with our technologies in our lives. I, for one, would go as far as to say using generative AI for creative expression in this manner (“describe what you have in mind and AI will create it for you”) amounts to the least imaginative of use of AI that I can imagine.

But a voice is not just a sound. And I’d like to think that no matter how much an A.I. version of Moe or Snake or Chief Wiggum will sound like my voice, something will still be missing — the humanness. There’s so much of who I am that goes into creating a voice. How can the computer conjure all that?

“A.I. provided a way also to achieve this without intruding on real lives or placing real Costa Rican faces that people of the community might recognize,” he said. “Since the pegamachos culture remains hidden, these A.I. images serve as a mimicry of photography, a fiction, and a medium through which I can imagine and construct an imagined parallel history.”

Sauter Morera said the use of A.I. allows him to pose hypotheses that he can then answer through the use of the technology, such as whether pegamachos would have expressed themselves more freely if Costa Rican society had been different at the time.

“Would cowboy culture have embraced latex?” he wondered. “These speculative questions are at the core of my work.”

Quick reminder: no one is saying AI will make great art; it can make mediocre content, which the attention economy runs on now. The entirety of this post has (hopefully) shown how social media mediocrity has loomed over greatness, which has cowered in the corner, patiently looking for rocks to bang together.

 

đŸ“» QUOTES OF THE WEEK

You cannot be great without the greatness of others.

Nick Sirianni (source)

 

đŸ‘„ FOR EVERYONE

Why Chatbots Are Not the Future / Amelia Wattenberger (9 minute read)

Good tools let the user choose when to switch between implementation and evaluation. When I work with a chatbot, I’m forced to frequently switch between the two modes. I ask a question (implement) and then I read a response (evaluate).

Current AI tools pretend writing software is like having a conversation. It’s not. It’s like writing laws. You’re using English, but you’re defining terms, establishing rules, and managing complex interactions between everything you’ve said.

You don’t program by chatting. You program by writing documents.

Claude.app / claude.ai did not feel well-designed for this problem. I spent a lot of time copy-pasting from the UI into on-disk files, and copy-pasting errors or other output back to Claude.

I found it challenging figuring out how to maintain appropriate context for my conversations.

Overall, these workers self-reported that the more confidence they had in AI doing the task, the more they observed “their perceived enaction of critical thinking.” When users had less confidence in the AI’s output, they used more critical thinking and had more confidence in their ability to evaluate and improve the quality of the AI’s output and mitigate the consequences of AI responses.

The good news: ChatGPT appropriately selected and accurately summarized a set of recent court rulings, all of which exist. The so-so news: it missed some broader points that a competent human expert might acknowledge. The bad news: it ignored a full year’s worth of legal decisions, which, unfortunately, happened to upend the status of the law.

How AI will divide the best from the rest / The Economist (10 minute read)

Aidan Toner-Rodgers of MIT, for instance, found that using an AI tool to assist with materials discovery nearly doubled the productivity of top researchers, while having no measurable impact on the bottom third. The software allowed researchers to specify desired features, then generate candidate materials predicted to possess these properties. Elite scientists, armed with plenty of subject expertise, could identify promising suggestions and discard poor ones. Less effective researchers, by contrast, struggled to filter useful outputs from irrelevant ones.

 

📚 FOUNDATIONS

Deep Dive into LLMs like ChatGPT / Andrej Karpathy, YouTube (211 minute video)

This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full training stack of how the models are developed, along with mental models of how to think about their “psychology”, and how to get the best use them in practical applications. I have one “Intro to LLMs” video already from ~year ago, but that is just a re-recording of a random talk, so I wanted to loop around and do a lot more comprehensive version.

  • we finally got around to watching this and it’s as good as everyone said

  • if you’re not into videos, here’s a 200-page book that’s also good, Foundations of Large Language Models / arXiv (323 minute read)

The Price of Intelligence — Three risks inherent in LLMs / ACM Queue (28 minute read)

Robust Open Online Safety Tools — Open and accessible tools that put safety back in the hands of the people

ROOST develops, maintains, and distributes open source building blocks to safeguard global users and communities. Backed by dedicated technical teams and leading experts, ROOST meets organizations where they are and provides hands-on support at every stage of their safety journey.

  • Meta is a glaring omission

Now more than ever, AI needs a governance framework — Now more than ever, AI needs a governance framework / Fei-Fei Li, Financial Times (6 minute read)

It is possible to develop a model with the best intentions, and for that model to be misused later on

 

🚀 FOR LEADERS

The future belongs to idea guys who can just do things / Geoffrey Huntley (8 minute read)

Companies, look at the roadmap you have carefully developed and consider throwing out parts that no longer make sense. Start motions towards up-skilling how your employees think.

Winning with AI / Bain & Company (3 minute read)

Five questions for every CEO

1. Am I moving fast enough?

2. How might AI change the future of my industry?

3. How can AI strengthen our competitive advantage?

4. How is this different, and how do I enable the tech foundation?

5. How should I lead the organization on this journey?

The Anthropic Economic Index / Anthropic (11 minute read)

AI use is more prevalent for tasks associated with mid-to-high wage occupations like computer programmers and data scientists, but is lower for both the lowest- and highest-paid roles. This likely reflects both the limits of current AI capabilities, as well as practical barriers to using the technology.

 

🎓 FOR EDUCATORS

Key policy measures include the development of guidelines for ethical GAI use, the design of authentic assessments to mitigate misuse, and the provision of training programs for faculty and students to foster GAI literacy. Despite these efforts, gaps remain in comprehensive policy frameworks, particularly in addressing data privacy concerns and ensuring equitable access to GAI tools. The study underscores the importance of clear communication channels, stakeholder collaboration, and ongoing evaluation to support effective GAI adoption. These insights provide actionable insights for policymakers to craft inclusive, transparent, and adaptive strategies for integrating GAI into higher education.

  • notable for its frameworks; just skim to those

Workspace for Nonprofits users accessing the Gemini app now have enterprise-grade data and privacy protections, which means chats and uploaded files will not be reviewed by human reviewers or otherwise used to improve generative AI models. These same protections are available on NotebookLM for Workspace for Nonprofits.

  • on quick scan, it looks like you have to opt-out to get these protections but good on them anyway

Separated out like this, content and activities can be mixed and matched in both directions. For example, a learner can use different activities (like retrieval practice, reciprocal teaching, or worked examples) to study a single topic (like supply). Similarly, they could reuse a single activity across multiple topics (sometimes even across multiple courses).

 

📊 FOR TECHNOLOGISTS

Challenges in AI data integrity / Deloitte Insights (23 minute read)

Inconsistent information retrieval, chunking, and integration across multimodal solutions: Retrieval solutions and multimodal approaches can introduce data integration and engineering challenges, which can be resolved by setting up processes for retrieval augmented generation (RAG) integration with human oversight and improving chunking and advanced retrieval methods at all integration points.

They helped us identify recurring friction points and prioritize solutions that would resonate across diverse user scenarios. The challenge wasn’t just to reorganize – it was to rethink what organization itself meant for a growing product.

Request for Proposals: Technical AI Safety Research / Open Philanthropy (21 minute read)

In particular, we should prepare for the risk that AIs could be misaligned — that they might pursue goals that no one gave them and harm people in the process. We think that ML research today can help to clarify and mitigate the likelihood of this failure mode.

  • seems like they’re using a low-friction approach to awarding grants

Customers don't care about your AI feature / Growth Unhinged (12 minute read)

Canva says its “Magic Design” feature helps users effortlessly make beautiful designs. Their audience doesn’t need “AI-powered productivity.” They need the result of that AI-powered productivity. They need “beautiful marketing templates ready in 10 seconds.”

 

🎉 FOR FUN

Can AI read pain and other emotions in your dog’s face? / American Association for the Advancement of Science (17 minute read)

Scientists have spent thousands of hours sitting in front of stalls and cages observing the faces of animals in these painful or stressful situations, then comparing them against animals who were most likely pain- or stress-free. As a result, they’ve developed “grimace scales” for a variety of species, which provide a measure of how much pain or stress an animal is experiencing based on the movements of its facial muscles.

  • more descriptions of ways in which animals feel pain than we needed this early in the morning

  • but at least now you can pretend your cat is telling you that it loves you

Reor — Private & local AI personal knowledge management app for high entropy thinkers.

  • no, we don’t know what that means either

  • seems like an AI-enabled Obsidian at first test

  • related, AnythingLLM — The all-in-one AI application for everyone

    • still does not include image and text generation side-by-side

Tough Tongue AI — Practice Difficult Conversations with AI

  • mostly interview practice in the scenario library

 

🧿 AI-ADJACENT

Are We Entering the Era of Artificial Friendship? / American Enterprise (10 minute read)

Writing in the MIT Technology Review, researchers Robert Mahari and Pat Pataranutaporn warned that sophisticated chatbots and other non-human agents posed new risks to human beings, a kind of artificial “addictive intelligence” that takes advantage of what we know about human behavior. “The allure of AI lies in its ability to identify our desires and serve them up to us whenever and however we wish,” they note. “AI has no preferences or personality of its own, instead reflecting whatever users believe it to be,” what researchers call “sycophancy.”

The Duolingo Handbook [PDF] / Duolingo (27 minute read)

At the center of this book are five principles. These aren’t aspirational—they’re lessons we've learned through experience. But they’re also living ideas: they have tensions within them, and there are places where they don’t always fit.

  • thye’re getting better than 37signals at this sort of writing

 

⋄