- That AI Thing
- Posts
- weekend ai reads for 2024-11-22
weekend ai reads for 2024-11-22
programming note: we are off next week and will be back on 01 December
đ° ABOVE THE FOLD: LONGREADS
War and Peace in the Age of Artificial Intelligence â What It Will Mean for the World When Machines Shape Strategy and Statecraft / Foreign Affairs (21 minute read)
by Henry A. Kissinger, Eric Schmidt, and Craig Mundie
Generative Agent Simulations of 1,000 People / arXiv (18 minute read)
The generative agents replicate participantsâ responses on the General Social Survey 85% as accurately as participants replicate their own answers two weeks later, and perform comparably in predicting personality traits and outcomes in experimental replications. Our architecture reduces accuracy biases across racial and ideological groups compared to agents given demographic descriptions.
related (?), TinyTroupe â LLM-powered multiagent persona simulation for imagination enhancement and business insights. / Microsoft, GitHub
Which Foundation Model is best for Agent Orchestration / Artificial Intelligence Made Simple, Substack (sorry) (17 minute read)
GPT o1 (by far the worst)- As an orchestrator, I have nothing positive to say about o1. Itâs terrible at following instructions, adds 30 lines of commentary where it shouldnât (messes up the inputs being sent to the tools), and is just generally awful. Working with o1 made me want to call my parents and apologize for every time I disobeyed them
Inside the Booming âAI Pimpingâ Industry â AI-generated influencers based on stolen images of real-life adult content creators are flooding social media. / Wired (29 minute read)
nothing explicit but depending where you work, some images might not be appropriate
How Did You Do On The AI Art Turing Test? / Astral Codex Ten, Substack (sorry) (17 minute read)
Your instincts were worst for Impressionism; you identified every single Impressionist painting as human except the sole actually-human Impressionist work in the dataset (Paul Gauguinâs Entrance To The Village Of Osny).
the average was 60%, or slightly better than a coin flip
đ» QUOTE OF THE WEEK
Itâs actually pretty fun. I already talk to myself all the time, but now I get to do it with an AI.
Rex Woodbury (source)
đ„ FOR EVERYONE
AI and The Last Mile / Hollis Robbins, Anecdotal, Substack (sorry) (9 minute read)
The issue isnât that AI assistants get things wrong â itâs that they get things almost right in ways that can be more dangerous than obvious errors. Theyâre missing local knowledge: that messy, contextual, contingent element that often makes all the difference.
Mistral unleashes Pixtral Large, upgrades Le Chat with image gen / Venture Beat (7 minute read)
le chat is a good alternative to the others and itâs free (?)
Ben Affleck Shares Candid Take on the Positive Use of AI in Hollywood, but Doesn't See It Threatening Creativity / Movie Web (6 minute read)
Craftsman is knowing how to work. Art is knowing when to stop. And I think knowing when to stop is going to be a very difficult thing for AI to learn because [of] itâs taste, and also lack of consistency, lack of controls, lack of quality.
related (literally), Casey Affleck on Metaâs New AI Tool and How It Will Help Filmmakers â âI think that we just have to be really conscious of how we start to incorporate it,â Affleck told IndieWire. / Indiewire (6 minute read)
đ FOUNDATIONAL
What are AI guardrails? â AI guardrails help ensure that an organizationâs AI tools, and their application in the business, reflect the organizationâs standards, policies, and values. / McKinsey & Company (11 minute read)
LangChain State of AI Agents Report / Lang Chain (10 minute read)
We surveyed over 1,300 professionals â from engineers and product managers to business leaders and executives â to uncover the state of AI agents. Dive into the data as we break down how AI agents are being used (or not) today.
How to make a Claude RAG/Document chatbot - (No Code) / Pickaxe A.I., YouTube (12 minute video)
đ FOR LEADERS
AI Makes Tech Debt More Expensive / Gauge blog (4 minute read)
Instead of trying to force genAI tools to tackle thorny issues in legacy codebases, human experts should do the work of refactoring legacy code until genAI can operate on it smoothly. When direct refactoring is still too risky, teams can adjust their development strategy with approaches like strangler fig to build greenfield modules which can benefit immediately from genAI tooling.
AI Investments Are Booming, but Venture-Firm Profits Are at a Historic Low / Wall Street Journal (7 minute read)
Last year, U.S. venture firms invested $60 billion more than they collected, the highest such deficit in PitchBookâs 26 years of data. As a result, the investors that back VC firms, such as university endowments and pension funds, arenât seeing the type of profits the industry has long delivered.
Scaling AI: Find the Right Strategy for Your Organization / Gartner (6 minute read)
fine article but they throw a confusing, seemingly unrelated sandwich graphic at the top
đ FOR EDUCATORS
via brandon, Dimensions of AI Literacies / Opened Culture (9 minute read)
The Dimensions of AI Literacies (presented below) offer a flexible and transformative lens for understanding teaching and learning practices in an AI-enabled world. We invite you to explore this taxonomy to reflect on the challenges and opportunities presented by AI in education, leveraging a common vocabulary to identify and share effective practices for meaningfully implementing AI-supported approaches into the classroom.
PaperGen â AI-Driven Writing Assistant for Business & Academia
PaperGen helps you generate well-structured long-form papers with fully referenced citations. It ensures originality, clarity, and precision with AI detection bypassing for a more human-like writing experience.
PaperGen crafts content that feels authentic and engaging, blending automation bypassing AI detection for any use cases.
OpenAI launches free AI training for educators - how to access it â This one-hour course, developed with Common Sense Media, helps teachers learn gen AI basics. Here's how to access it. / Zdnet (3 minute read)
related, A Studentâs Guide to Writing with ChatGPT / OpenAI blog (6 minute read)
đ FOR TECHNOLOGISTS
You can run powerful open source AI locally with Exo Labs / Venture Beat (8 minute read)
As he shared on the social network Twitter recently, the UK-based Cheema connected four Mac Mini M4 devices (retail value of $599.00) plus a single Macbook Pro M4 Max (retail value of $1,599.00) with Exoâs open source software to run Alibabaâs software developer-optimized LLM Qwen 2.5 Coder-32B.
After all, with the total cost of Cheemaâs cluster around $5,000 retail, it is still significantly cheaper than even a single coveted NVidia H100 GPU (retail of $25,000-$30,000).
everyone with an under-utilized data center should be using Exo; itâs as good as advertised
Droidspeak: AI models work together faster when they speak their own language â Letting AI models communicate with each other in their internal mathematical language, rather than translating back and forth to English, could accelerate their task-solving abilities / New Scientist (3 minute read)
reminded us of this story from seven years ago, Fact check: Facebook chatbots werenât shut down for creating language / USA Today (5 minute read)
AIâs math problem: FrontierMath benchmark shows how far technology still has to go / Venture Beat (8 minute read)
on Epoch AIâs math benchmark, of which the best LLMs have not been able to solve more than 2% of the problems
Mathematical reasoning of this caliber demands more than just brute-force computation or simple algorithms. It requires what Fields Medalist Terence Tao calls âdeep domain expertiseâ and creative insight. After reviewing the benchmark, Tao remarked, âThese are extremely challenging. I think that in the near term, basically the only way to solve them is by a combination of a semi-expert like a graduate student in a related field, maybe paired with some combination of a modern AI and lots of other algebra packages.â
đ FOR FUN
they knew what they were doing with the name
uses AI to determine how terrible the terms of service you agree to are
lots of poop emojis; Charlotte Airport looks like the winner at 5/7 (still not great)
donât look at the services aimed at kids
stop blindly giving agency over your data and privacy to everyone!
How Coca-Cola Reinvented a Holiday Classic With AI / Adweek (7 minute read)
itâs objectively flawed; why?
ESPN is testing a generative AI avatar called âFACTSâ â The AI botâs analysis and commentary are supposed to âcomplementâ other hosts on ESPNâs SEC Nation show. / The Verge (3 minute read)
PokĂ©mon Go Players Have Unwittingly Trained AI to Navigate the World â Niantic says it is using data generated by PokĂ©mon Go players to create a âLarge Geospatial Modelâ that can navigate the real world and power robots. / 404 Media (3 minute read)
Meet your own personal AI Jesus in this Swiss churchâs confessional / Tech Radar (5 minute read)
Cat dance / David Szauder, Instagram (1 minute video)
David Bowie, cats, 70s fashion; youâre welcome
đ§ż AI-ADJACENT
Question: Are you asking the ârightâ questions? / Big Think (11 minute read)
â