That AI Thing
Posts
weekend ai reads for 2024-06-28

weekend ai reads for 2024-06-28

June 28, 2024

📰 ABOVE THE FOLD: IMPACT ON CREATIVE FIELDS

The first film written by AI has arrived – and Hollywood is terrified / Telegraph (10 minute read)

When problems arose – from broad ones, like characters randomly switching sex, to more nuanced issues such as opaque motivation – he asked the programme to go back in and rework what it had come up with.

OpenAI CTO: AI Could Kill Some Creative Jobs That Maybe Shouldn't Exist Anyway / PC Magazine (3 minute read)

“Some creative jobs maybe will go away, but maybe they shouldn’t have been there in the first place,” the CTO said of AI’s role in the workplace. “I really believe that using it as a tool for education, [and] creativity, will expand our intelligence.”

the context (YouTube)

Labels body RIAA sues AI music firms Suno and Udio for copyright infringement / Music Ally (12 minute read)

the complaint [PDF]
the examples at the 404 Media link seem damning: Listen to the AI-Generated Ripoff Songs That Got Udio and Suno Sued / 404 Media (6 minute read)
AI is also the infinite monkey theorem in action with the addition of nudging toward things it has seen before, so these outcomes seem inevitable?
not legal advice or even useful insight

Peacock Unveils Personalized Olympic Recaps Featuring the Voice of Legendary Sports Announcer Al Michaels Generated with A.I. / NBC Sports (10 minute read)

NBC begins to ruin the Olympics earlier and earlier, don’t they?

📻 QUOTE OF THE WEEK

You think it’s cool to hate things. And it’s not. It’s boring.

Talk about what you love and keep quiet about what you don’t.

Liberal Arts (2012) (source)

🏗️ FOUNDATIONS & CULTURE

On Claude 3.5 Sonnet / Zvi Mowshowitz, Substack (sorry) (19 minute read)

The review by UK’s AISI is very good news, especially after Jack Clark’s statements that making that happen was difficult. Now that both DeepMind and Anthropic have followed through, hopefully that will put pressure on OpenAI and others to do it.

related (1), Why Anthropic’s Artifacts may be this year’s most important AI feature: Unveiling the interface battle / Venture Beat (6 minute read)

By creating a space where AI-generated content can be easily edited, refined, and incorporated into ongoing projects, Anthropic is bridging the gap between AI as a tool and AI as a teammate. This shift has the potential to revolutionize knowledge work across industries.

related (2), I think the Claude system prompt might already be out there, but here's what I got from claude-3.5-sonnet, for good measure / Pliny the Prompter, Twitter (sorry)

EU launches AI-powered ‘digital twin’ of the Earth / The Next Web (3 minute read)

AI Survey: Four Themes Emerging / Bain & Company (7 minute read)

Concerns about security and conversations around implementation are more deliberate and informed as companies have a better understanding of the challenges based on learnings from their pilot programs. Concerns about organizational readiness grew while those around quality and risk have declined.

related (?), How a CPG company can outcompete in digital and AI / McKinsey & Co. (18 minute read)

The CPG sector faces some unique challenges. The proliferation of data, for example, and its complexity—sources are scattered across retailers, suppliers, manufacturers, and consumers—have created massive issues in terms of harnessing the data to find, track, and capture value. At its core, the reason for this low success rate is that companies fail to perform the deep organizational surgery required to affect the broad-based change that’s needed. It’s never “just tech” when it comes to successful digital and AI transformations. Companies need to rewire how they work.

I Will Piledrive You If You Mention AI Again / Ludic Maratoa, Ludicity (23 minute read; star of the week so totally worth it)

Unless you are one of a tiny handful of businesses who know exactly what they're going to use AI for, you do not need AI for anything - or rather, you do not need to do anything to reap the benefits. Artificial intelligence, as it exists and is useful now, is probably already baked into your businesses software supply chain.

related (1), The HMEC Principle: Finding the Sweet Spot for Generative AI / Chris Gorgolewski, Medium (6 minute read)

Generative AI is most helpful when assisting humans by solving problems where solutions are Hard to Make, but Easy to Check (HMEC).

related (2), The New Language Model Stack — How companies are bringing AI applications to life / Sequoia Capital (12 minute read)
related (3), The call of LLMs is strong, we get to pick up the pieces later / Counting Stuff (7 minute read)

One thing is becoming quite clear in this space of “Let’s make an LLM generate DATA for us” efforts – regardless of whether it actually works on a conceptual or practical level, people who make decisions will most definitely make attempts to apply the technology to every domain they can. Stopping them ahead of time is likely an exercise in futility until they see it break themselves.

🎓 EDUCATION

Conversation with Edison Durán Lucena: Enhancing Education with AI-Powered Books — EdTech startup secures $16,000 prize, plans to revolutionize publishing with augmented reality and AI for K-12 education. / Contxto (5 minute read)

Akadimia — Step into the future of learning with Akadimia AI, an immersive educational platform that brings history’s icons as your AI mentor

iOS only
character.ai with historical people like MLK & Dalí to support interactive learning (?)

Coach — AI-powered career development for every learner

collegiate (?) and career pathways and seemingly designed for pre-postsecondary students

AI Tutors Don't Know When to Stop Shutting Up — The chatbots don't know when to start chatting with learners. / Dan Meyer, Substack (sorry) (5 minute read)

mostly describing a UX problem

📊 DATA & TECHNOLOGY

Frontiers in synthetic data / Nathan Lambert, Interconnects, Substack (sorry) (13 minute read)

related (1), Report - State of Data AI 2024 / Hakkoda (21 minute read)

Correlations are also beginning to emerge between organization size, revenue, and rates of AI deployment—though the largest and highest-grossing organizations aren’t necessarily the most successful early AI adopters. Rather, the right blend of raw investment power and an agile data strategy seems to be the single greatest predictor of success.

related (2), Data Is An Agenda. / Off Kilter (8 minute read)

In other words, if we rely on black box systems, thinking the error percentage looks pretty good while having no idea what that error percentage covers or how the system even works, then we owe it to ourselves to understand just how big the risk potentially is.

Announcing the AI Forecasting Benchmark Series / Metaculus (8 minute read)

the contest is uninteresting (to us) but the things to think when testing prompt engineering is quite useful
related (1), 𝜏-bench: Benchmarking AI Agents for the Real-World / Sierra (10 minute read)

Drawing on our experience with live agents in production, we distilled the requirements for a realistic agent benchmark to three key points. First and foremost, most real-world settings require agents to interact seamlessly with both humans and programmatic APIs over long horizons, in order to incrementally gather information and solve complex problems.

related (2), Open-LLM performances are plateauing, let’s make the leaderboard steep again / Hugging Face Blog (7 minute read)
related (3), Salesforce debuts gen AI benchmark for CRM — The software company’s new generative AI evaluation tool for CRM will help businesses choose the best LLM for a given job. / CIO Magazine (4 minute read)

Building a personalized code assistant with open-source LLMs using RAG Fine-tuning / Together Blog (12 minute read)

Furthermore, beyond their benefits in quality, when you build and deploy the RAG fine-tuned models in the Together platform, they also offer significant economic advantages, up to 150x cheaper, while it’s 3.7x faster during inference.

AI-Flow — Open-source platform for creating custom AI tools through a simple drag and drop interface, designed for innovators and creators

quite useful; complicated to setup locally

AI scaling myths — Scaling will run out. The question is when. / AI Snake Oil, Substack (sorry) (12 minute read)

🎉 FUN and/or PRACTICAL THINGS

How to use AI prompt chains for better results / Geeky Gadgets (9 minute read)

At their core, prompt chains are carefully crafted sequences of prompts designed to break down intricate processes into manageable steps. By decomposing a complex task into a series of interconnected prompts, each building upon the output of the previous one, prompt chains enable the AI to generate more coherent and contextually relevant outputs.

useful prompting advice

Pixlyze — Detect AI Images

Recognizing affective states from the expressive behavior of tennis players using convolutional neural networks / Science Direct (59 minute read)

Our CNN-based models demonstrate an accuracy rate of up to 68.9%, outperforming or matching human observers in many instances. Intriguingly, both the machine learning models and human observers exhibited a shared propensity to more effectively identify negative affective states, which may be attributed to the more intense and straightforward expression of these states.

having played tennis semi-competitively, “expressive behavior” is rarely vague, particularly for negative affective states; the accuracy rate seems really low for both models and humans (source: observed many racquets being broken mid-set)
regardless, the techniques used are interesting

Ario — a consumer AI assistant that initially focuses on the everyday tasks that are universal to so many of us: managing time, schedules, and family obligations

iOS only
free; and no data collection

🧿 AI-ADJACENT

Asking Al to generate the Tour de France — Nailed it. / crit.meme, Instagram (sorry) (1 minute video)

reminder: the Tour de France starts tomorrow, 29 June, in Florence

⋄