weekend ai reads for 2024-11-22

programming note: we are off next week and will be back on 01 December

📰 ABOVE THE FOLD: LONGREADS

War and Peace in the Age of Artificial Intelligence — What It Will Mean for the World When Machines Shape Strategy and Statecraft / Foreign Affairs (21 minute read)

  • by Henry A. Kissinger, Eric Schmidt, and Craig Mundie

The generative agents replicate participants’ responses on the General Social Survey 85% as accurately as participants replicate their own answers two weeks later, and perform comparably in predicting personality traits and outcomes in experimental replications. Our architecture reduces accuracy biases across racial and ideological groups compared to agents given demographic descriptions.

  • alphXiv link

  • related (?), TinyTroupe — LLM-powered multiagent persona simulation for imagination enhancement and business insights. / Microsoft, GitHub

Which Foundation Model is best for Agent Orchestration / Artificial Intelligence Made Simple, Substack (sorry) (17 minute read)

GPT o1 (by far the worst)- As an orchestrator, I have nothing positive to say about o1. It’s terrible at following instructions, adds 30 lines of commentary where it shouldn’t (messes up the inputs being sent to the tools), and is just generally awful. Working with o1 made me want to call my parents and apologize for every time I disobeyed them

Inside the Booming ‘AI Pimping’ Industry — AI-generated influencers based on stolen images of real-life adult content creators are flooding social media. / Wired (29 minute read)

  • nothing explicit but depending where you work, some images might not be appropriate

How Did You Do On The AI Art Turing Test? / Astral Codex Ten, Substack (sorry) (17 minute read)

Your instincts were worst for Impressionism; you identified every single Impressionist painting as human except the sole actually-human Impressionist work in the dataset (Paul Gauguin’s Entrance To The Village Of Osny).

  • the average was 60%, or slightly better than a coin flip

 

đŸ“» QUOTE OF THE WEEK

It’s actually pretty fun. I already talk to myself all the time, but now I get to do it with an AI.

Rex Woodbury (source)

 

đŸ‘„ FOR EVERYONE

AI and The Last Mile / Hollis Robbins, Anecdotal, Substack (sorry) (9 minute read)

The issue isn’t that AI assistants get things wrong — it’s that they get things almost right in ways that can be more dangerous than obvious errors. They’re missing local knowledge: that messy, contextual, contingent element that often makes all the difference.

  • le chat is a good alternative to the others and it’s free (?)

Craftsman is knowing how to work. Art is knowing when to stop. And I think knowing when to stop is going to be a very difficult thing for AI to learn because [of] it’s taste, and also lack of consistency, lack of controls, lack of quality.

 

📚 FOUNDATIONAL

What are AI guardrails? — AI guardrails help ensure that an organization’s AI tools, and their application in the business, reflect the organization’s standards, policies, and values. / McKinsey & Company (11 minute read)

LangChain State of AI Agents Report / Lang Chain (10 minute read)

We surveyed over 1,300 professionals — from engineers and product managers to business leaders and executives — to uncover the state of AI agents. Dive into the data as we break down how AI agents are being used (or not) today.

How to make a Claude RAG/Document chatbot - (No Code) / Pickaxe A.I., YouTube (12 minute video)

 

🚀 FOR LEADERS

AI Makes Tech Debt More Expensive / Gauge blog (4 minute read)

Instead of trying to force genAI tools to tackle thorny issues in legacy codebases, human experts should do the work of refactoring legacy code until genAI can operate on it smoothly. When direct refactoring is still too risky, teams can adjust their development strategy with approaches like strangler fig to build greenfield modules which can benefit immediately from genAI tooling.

Last year, U.S. venture firms invested $60 billion more than they collected, the highest such deficit in PitchBook’s 26 years of data. As a result, the investors that back VC firms, such as university endowments and pension funds, aren’t seeing the type of profits the industry has long delivered.

  • fine article but they throw a confusing, seemingly unrelated sandwich graphic at the top

 

🎓 FOR EDUCATORS

via brandon, Dimensions of AI Literacies / Opened Culture (9 minute read)

The Dimensions of AI Literacies (presented below) offer a flexible and transformative lens for understanding teaching and learning practices in an AI-enabled world. We invite you to explore this taxonomy to reflect on the challenges and opportunities presented by AI in education, leveraging a common vocabulary to identify and share effective practices for meaningfully implementing AI-supported approaches into the classroom.

PaperGen — AI-Driven Writing Assistant for Business & Academia

PaperGen helps you generate well-structured long-form papers with fully referenced citations. It ensures originality, clarity, and precision with AI detection bypassing for a more human-like writing experience.

PaperGen crafts content that feels authentic and engaging, blending automation bypassing AI detection for any use cases.

OpenAI launches free AI training for educators - how to access it — This one-hour course, developed with Common Sense Media, helps teachers learn gen AI basics. Here's how to access it. / Zdnet (3 minute read)

 

📊 FOR TECHNOLOGISTS

As he shared on the social network Twitter recently, the UK-based Cheema connected four Mac Mini M4 devices (retail value of $599.00) plus a single Macbook Pro M4 Max (retail value of $1,599.00) with Exo’s open source software to run Alibaba’s software developer-optimized LLM Qwen 2.5 Coder-32B.

After all, with the total cost of Cheema’s cluster around $5,000 retail, it is still significantly cheaper than even a single coveted NVidia H100 GPU (retail of $25,000-$30,000).

  • everyone with an under-utilized data center should be using Exo; it’s as good as advertised

Droidspeak: AI models work together faster when they speak their own language — Letting AI models communicate with each other in their internal mathematical language, rather than translating back and forth to English, could accelerate their task-solving abilities / New Scientist (3 minute read)

  • on Epoch AI’s math benchmark, of which the best LLMs have not been able to solve more than 2% of the problems

Mathematical reasoning of this caliber demands more than just brute-force computation or simple algorithms. It requires what Fields Medalist Terence Tao calls “deep domain expertise” and creative insight. After reviewing the benchmark, Tao remarked, “These are extremely challenging. I think that in the near term, basically the only way to solve them is by a combination of a semi-expert like a graduate student in a related field, maybe paired with some combination of a modern AI and lots of other algebra packages.”

 

🎉 FOR FUN

  • they knew what they were doing with the name

  • uses AI to determine how terrible the terms of service you agree to are

  • lots of poop emojis; Charlotte Airport looks like the winner at 5/7 (still not great)

  • don’t look at the services aimed at kids

  • stop blindly giving agency over your data and privacy to everyone!

ESPN is testing a generative AI avatar called ‘FACTS’ — The AI bot’s analysis and commentary are supposed to ‘complement’ other hosts on ESPN’s SEC Nation show. / The Verge (3 minute read)

PokĂ©mon Go Players Have Unwittingly Trained AI to Navigate the World — Niantic says it is using data generated by PokĂ©mon Go players to create a “Large Geospatial Model” that can navigate the real world and power robots. / 404 Media (3 minute read)

Cat dance / David Szauder, Instagram (1 minute video)

  • David Bowie, cats, 70s fashion; you’re welcome

 

🧿 AI-ADJACENT

 

⋄