- That AI Thing
- Posts
- weekend ai reads for 2025-01-10
weekend ai reads for 2025-01-10
š° ABOVE THE FOLD: SPORTS
Football coaches could soon be calling on AI to scout the next superstar / The Guardian (6 minute read)
global football, that is
Using what it claims is the largest video database of global youth football ā with players logged from 28 countries ā the company says it can now determine which young players most fit the description of current or recent top stars as defined by one of eight archetypes. These include the ideal ābox-to-box midfielderā, āmodern No 9ā, āplaymaking No 10ā and āinverted wing-backā.
Is AI The Answer To Standardizing Targeting Calls In College Football? / Forbes (9 minute read)
American football, that is
Wimbledon ditches human line judges for electronic line calling ā The grass-court Grand Slam announced it will switch to electronic-line calling next year. / The Verge (6 minute read)
š» QUOTES OF THE WEEK
Because one agent is just software, two agents are an undebuggable mess.
Andriy Burkov (source)
Weāre not adding any more software engineers next year because we have increased the productivity this year with Agentforce and with other AI technology that weāre using for engineering teams by more than 30%, to the point where our engineering velocity is incredible. I canāt believe what weāre achieving in engineering.
Marc Benioff (source)
š„ FOR EVERYONE
By default, capital will matter more than ever after AGI / Less Wrong (85 minute read)
This means that those with significant capital when labour-replacing AI started have a permanent advantage. They will wield more power than the rich of todayānot necessarily over people, to the extent that liberal institutions remain strong, but at least over physical and intellectual achievements. Upstarts will not defeat them, since capital now trivially converts into superhuman labour in any field.
the message, as always: being poor is worse than not being poor
Once It Has Been Trained, Who Will Own My Digital Twin? / The Scholarly Kitchen (12 minute read)
spoiler: probably not you
o3 āARC AGIā postmortem megathread: why things got heated, what went wrong, and what it all means / Gary Marcus, Substack (sorry) (5 minute read)
a slightly less-than-usual curmudgeonly contextualizing of the o3 news
The AI Reporter That Took My Old Job Just Got Fired ā A local newspaper in Hawaii experimented with AI-generated presenters to engage and boost its readership. After two months, the bots have been shelved. / Wired (8 minute read)
In one particularly stilted exchange about the pumpkin giveaway, Rose asked James, āAnd how have these free pumpkins impacted the community?ā to which James responded, āThe free pumpkins have brought joy to many.ā
Mechanized minds: AIās hidden impact on human thought ā While weāre busy wondering whether machines will ever become conscious, we rarely stop to ask: What happens to us? / Big Think (18 minute read)
Instead of asking whether machines will ever become conscious, we might ask whether humans can become conscious enough to outgrow the āartificial intelligenceā both inside them and in the machines around them.
š FOUNDATIONS
Somethingās Coming ā This post is meant to be an explainer for friends and readers who havenāt been paying close attention to whatās been happening in AI. / John August (12 minute read)
How to Write AI Art Prompts? (Examples + Templates) / Hypotenuse AI (16 minute read)
the new models, especially the Fluxes, are even more funner when you prompt them well
āContemplative reasoningā response style for LLMs like Claude and GPT-4o / Maharshi Pandya, Github
an easy prompt to copy-paste into your LLM
even works with Gemma and Mistral
š FOR LEADERS
Integrating AI Agents into Companies / Austin Vernonās Blog (7 minute read)
valuable throughout
1. Massively increase the use of wikis and other written content.
ā¦
2. Move from reviews to standardized pre-approvals and surveillance.
related (1), Your AI Agent Probably Should Be a Workflow / Tobias Zwingmann (8 minute read)
related (2), Five contrarian ideas about genAI in the workplace ā Everyone knows of AI, but do they know AI? / Exponential View (7 minute read)
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks / Carnegie Mellon University, Duke University, and independent researchers, arXiv (58 minute read)
We test baseline agents powered by both closed API-based and open-weights language models (LMs), and find that with the most competitive agent, 24% of the tasks can be completed autonomously. This paints a nuanced picture on task automation with LM agents -- in a setting simulating a real workplace, a good portion of simpler tasks could be solved autonomously, but more difficult long-horizon tasks are still beyond the reach of current systems.
Gen AI Present and Future: A Conversation with Rashmi Kumar, SVP and CIO at Medtronic / Greylock (9 minute read)
While we are committed to making these types of investments, we recognize that we cannot do everything ourselves. We need to tap into the platforms and expertise that technology companies can provide, as demonstrated by our recently announced collaboration between Tempus and Medtronic Structural Heart.
š FOR EDUCATORS
āI received a first but it felt tainted and undeservedā: inside the university AI cheating crisis / The Guardian (15 minute read)
What counts as cheating is determined, ultimately, by institutions and examiners. Many universities are already adapting their approach to assessment, penning āAI-positiveā policies.
via george, Interactionalism: Re-Designing Higher Learning for the Large Language Agent Era / Mihnea C. Moldoveanu, George Siemens, arXiv (32 minute read)
We introduce Interactionalism as a new set of guiding principles and heuristics for the design and architecture of learning now available due to Generative AI (GenAI) platforms. Specifically, we articulate interactional intelligence as a net new skill set that is increasingly important when core cognitive tasks are automatable and augmentable by GenAI functions.
Educating Our Kids on AI | Free Live Event ā January 23, 2025, 3-4 p.m. ET / Section School
Featuring Garrett Smiley, CEO & Co-founder of Sora Schools, John Danner, Co-founder of Project Read AI and Spark Space, and Ted Dintersmith, Founder of WhatSchoolCouldBe.org
not sure if this is āeducating our kids about A.i.ā or āeducating our kids using A.i.ā ā let us know what you find out
š FOR TECHNOLOGISTS
How to Build a Truly Useful AI Product ā Generative AI breaks the old startup playbook / Thesis, Every (12 minute read)
How I program with LLMs / David Crawshaw (23 minute read)
Agents / Chip Huyen (41 minute read)
This section will start with an overview of agents and then continue with two aspects that determine the capabilities of an agent: tools and planning. Agents, with their new modes of operations, have new modes of failure. This section will end with a discussion on how to evaluate agents to catch these failures.
š FOR FUN
Evaluating Large Language Modelsā Capability to Launch Fully Automated Spear Phishing Campaigns: Validated on Human Subjects / Harvard Kennedy School, Avant Research Group, and independent researchers, arXiv (71 minute read)
A control group of arbitrary phishing emails, which received a click-through rate (recipient pressed a link in the email) of 12%, emails generated by human experts (54% click-through), fully AI-automated emails 54% (click-through), and AI emails utilizing a human-in-the-loop (56% click-through). Thus, the AI-automated attacks performed on par with human experts and 350% better than the control group.
not sure whether this is a stronger signal about AIās strengths or the test groupās weaknesses
the lag between this and when spam filters catch up is going to be frustrating
LG and Samsung are adding Microsoftās Copilot AI assistant to their TVs / The Verge (4 minute read)
A new, uncensored AI video model may spark a new AI hobbyist movement / Ars Technica (7 minute read)
it wonāt
lots of interesting examples at the link
the last one, āA young woman doing a complex floor gymnastics routine at the Olympics, featuring running and flips.ā is our new go-to GIF response for every situation
My Stupid Friend / Chrome Web Store
Reclaims the internet by replacing all instances of āChatGPTā with āmy stupid friend.ā
š§æ AI-ADJACENT
The 7 Coolest Mathematical Discoveries of 2024 / Scientific American (6 minute read)
They used information theory to find patterns in his music that help explain how Bach conveyed messagesāincluding musical, mathematical and emotional informationāthrough his works.
there is such a thing as model-overfitting
ā