That AI Thing
Posts
weekend ai reads for 2024-05-24

weekend ai reads for 2024-05-24

That AI Thing
May 24, 2024

📰 ABOVE THE FOLD: SCARLETT JOHANSSON & OPENAI

only because so much happened, in case you missed it or purposely tuned it out, the Scarlett Johansson story (so far):

May 13:

OpenAI released “Sky” last week, and Sam Altman even not-so-cryptically tweeted just the word “her”, probably referencing the 2013 Spike Jonze film, ‘Her’, in which the lead character develops a relationship with an AI companion, voiced by Scarlett Johansson (may be spoilers at the second link; also, the movie is over 10 years old so does it really still warrant a spoiler-warning?)

May 20:

Scarlett Johansson ‘Angered’ After ChatGPT Used ‘Eerily Similar’ Voice — OpenAI claimed it used the voice of a different actress to voice “Sky” / Rolling Stone (4 minute read)
Scarlett Johansson’s official statement; tl;dr is she says they asked to use her voice, she declined, and she believes they did so anyway / Bobby Allyn (NPR), Threads
OpenAI released a statement in response, quoted in the same Threads link above:
Sky is not Scarlett Johansson’s, and it was never intended to resemble hers. We cast the voice actor behind Sky’s voice before any outreach to Ms. Johansson. Out of respect for Ms. Johansson, we have paused using Sky’s voice in our products. We are sorry to Ms. Johansson that we didn’t communicate better.

May 21:

OpenAI did suspend “Sky” while they sort this out: OpenAI suspends ChatGPT voice ‘that sounds like Scarlett Johansson’ / Sky News (2 minute read)

May 22:

as OpenAI are wont to do, they took this to the court of public opinion, sharing recordings of a actress hired to provide the voice for “Sky” with the Washington Post: OpenAI didn’t copy Scarlett Johansson’s voice for ChatGPT, records show / Washington Post (6 minute read)

and next:

OpenAI are still going to get sued, and they’re going to lose, based on Midler v. Ford Motor Co. and Waits v Frito-Lay, Inc.; these cases ruled and upheld, respectively, that if a characteristic of a famous person—their voice in these two cases—is a distinctive characteristic of that person, then it is unlawful to imitate their voice without express written consent (disclaimer: not a lawyer)
OpenAI came off looking poorly, again: The Scarlett Johansson Incident Makes OpenAI Look Desperate / New York Magazine (6 minute read)
Setting aside the legal questions here, such behavior would align with some of the harshest criticism of Sam Altman and OpenAI — that it’s a company with little regard for the value of creative work led by a scheming, untrustworthy operator. This episode also complicates the company’s preferred narrative of unstoppable inevitability: You’re either the company harnessing the barely controlled phenomenon of imminent self-replicated machine intelligence, leading humanity into its next technological epoch, or you’re a mid-stage start-up that for some reason really needs to copy that voice from that movie to market an incremental product upgrade.

📻 QUOTE OF THE WEEK

You want to go grab the tub of ice cream and give up. Even though it felt like it was my fault, it really wasn’t. The system was working against job seekers.

Victor Schwartz (source: ‘You’re Fighting AI With AI’: Bots Are Breaking the Hiring Process / Wall Street Journal)

🏗️ FOUNDATIONS & CULTURE

Introducing Copilot+ PCs / Microsoft Blog (18 minute read)

Copilot+ PCs are the fastest, most intelligent Windows PCs ever built. With powerful new silicon capable of an incredible 40+ TOPS (trillion operations per second), all–day battery life and access to the most advanced AI models, Copilot+ PCs will enable you to do things you can’t on any other PC.

related (1), Microsoft’s AI chatbot will ‘recall’ everything you do on its new PCs / The Guardian (3 minute read)
related (2), Microsoft Team Copilot: A virtual team member to run meetings and projects / Venture Beat (3 minute read)
maybe Ray Dalio really was ahead of his time (e.g., Ray Dalio: It’s ‘Fantastic’ When We Play Conversations We’ve Recorded Back To Our Employees, 2014)

Faking William Morris, Generative Forgery, and the Erosion of Art History — Buying fake William Morris prints on Etsy and other early signs of epistemological collapse / Maggie Appleton (6 minute read)

To confound matters, the Etsy stores selling these generated images also sell genuine prints by Morris, Monet, Klimt, and Matisse. These sit alongside their modern expansion packs, blending right in. But which are which? You get coerced into playing an art historian forgery spotter.

Nvidia says 20K AI startups are building on its platform / Venture Beat (9 minute read)

“The next company who reaches the next major plateau gets to announce a groundbreaking AI, and the second one after that gets to announce something that’s 0.3% better,” Huang explained. “Time to train matters a great deal. The difference between time to train that is three months earlier is everything.”

related, Ten Gifts Nvidia Gave Its Investors / The Information ($) (6 minute read)
Even if competitors are able to build better chips, Huang reminded investors of the dominance of Cuda, the Nvidia-made software that AI app developers use in conjunction with its chips, essentially locking developers into Nvidia GPUs. Kress and Huang said recent improvements to Cuda improved inference performance of its H100 chips by three times. “That kind of tells you something about the richness of our architecture and the richness of our software,” Huang said. That, and app developers don’t want the hassle of switching to new software if they use another chip!

via renee, Women Leaders in Tech Are Paving the Way in GenAI / Boston Consulting Group (12 minute read)

🎓 EDUCATION

Does Emory University really not get how technology works? / AI Log, Substack (sorry) (8 minute read)

Emory University faces a lawsuit involving a student who built an AI study tool with a $10,000 prize and then suspended him for cheating for using a Canvas API (?)
this post is one perspective but Emory does not look great here

AI Robot Gives Graduation Speech at Buffalo’s D’Youville University / New York Times (6 minute read)

Andrew Fields, a D’Youville University student who started the petition, wrote in the petition that many students “feel disrespected” by the university’s decision to have a robot address them, especially those who could not attend their high school graduations in 2020 because of the coronavirus pandemic.

Microsoft and Khan Academy offer a free AI assistant to all US teachers — With Khanmigo for Teachers, K-12 educators can spend less time prepping lessons and more time with students. / Zdnet (3 minute read)

related, 5 great things to read or watch this summer / Bill Gates, Gates Notes (5 minute read)
referring to Sal Khan’s new book, Brave New Words: How AI Will Revolutionize Education (and Why That's a Good Thing)
Sal argues that AI will radically improve both outcomes for students and the experiences of teachers, and help make sure everyone has access to a world-class education. He’s well aware that innovation has had only a marginal impact in the classroom so far but makes a compelling case that AI will be different.

Opinion: What Is Higher Ed’s Role in Providing AI Job Skills? / Government Technology (7 minute read)

In addition to programming and technical skills, the next generation of AI developers may also need training in subjects traditionally aligned with liberal-arts education, such as ethics, problem-solving and communication.

related, Computer-Science Majors Graduate Into a World of Fewer Opportunities / Wall Street Journal ($) (4 minute read)

📊 DATA & TECHNOLOGY

MOMENT: A Family of Open Time-series Foundation Models / arXiv (57 minute read)

we are bullish on time-series foundation models

Linearly probing MOMENT achieves near state-of-the-art performance on most datasets and horizons, and is only second to PatchTST which generally achieves the lowest MSE. On many datasets and horizons, forecasting models based on LLMs– TimeLLM and GPT4TS perform worse than MOMENT. Notably, N-BEATS outperforms several recent methods, emphasizing the importance of comparing forecasting performance beyond transformer-based approaches.

AI’s Black Boxes Just Got a Little Less Mysterious / New York Times (6 minute read)

Researchers at the A.I. company Anthropic claim to have found clues about the inner workings of large language models, possibly helping to prevent their misuse and to curb their potential threats.

For example, they discovered that if they forced a feature linked to the concept of sycophancy to activate more strongly, Claude would respond with flowery, over-the-top praise for the user, including in situations where flattery was inappropriate.

related (1), from Anthropic: Mapping the Mind of a Large Language Model / Anthropic Blog (11 minute read)
related (2), the paper: Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet (132 minute read)

Introducing the Frontier Safety Framework / Google DeepMind Blog (6 minute read)

related (1), the report: Frontier Safety Framework [PDF]
related (2), Responsible Generative AI Toolkit / Google for Developers Blog

GPT-4o’s Chinese token-training data is polluted by spam and porn websites — The problem, which is likely due to inadequate data cleaning, could lead to hallucinations, poor performance, and misuse. / MIT Technology Review (11 minute read)

(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts / arXiv (111 minute read)

the researchers used a series of agents to replicate what a real translation company would do
We establish a virtual multi-agent translation company, TRANSAGENTS, featuring a diverse range of employees including a CEO, senior editors, junior editors, translators, localization specialists, and proofreaders. When a human client assigns a book translation task, a team of selected agents from TRANSAGENTS collaborates to translate the book. This paradigm simulates the entire book translation process, where agents with different roles work together to ensure that the translation maintains high quality and consistency throughout.
and the agents produced outputs that were preferred over human-written translations
Empirical findings indicate that despite lower d-BLEU scores, translations from TransAgents are preferred by both human evaluators and LLMs over human-written references, particularly in genres requiring domain-specific knowledge.
we are also bullish on agents

🎉 FUN and/or PRACTICAL THINGS

Google's AI Feature Suggested Using Glue to Keep Cheese on a Pizza / Business Insider (4 minute read)

A screenshot of the summary it generated, shared on X, shows it responded with “cheese can slide off pizza for a number of reasons,” and that the user could try adding “about ⅛ cup of non-toxic glue to the sauce to give it more tackiness.”

avoid the rush: Ten Blue Links — How to Turn Off AI Overview in Google and Set “Web” as Default
or just add “udm=14” to the search URL

Dupe.com

finds items similar to some highfalutin one, at a lower cost, using vision AI

Arrange — Generate a plan for anything and add it to your calendar in seconds.

looks promising; waitlist

Infinite Wonderland — This is an AI experiment where the timeless classic Alice’s Adventures in Wonderland is endlessly reimagined by artists, AI and you.

Wikipedia Citation Needed / Chrome Web Store

The Future Audiences team at the Wikimedia Foundation built an experimental new AI tool to check what Wikipedia has to say about what you’re reading, no matter where you are on the web, and we’d love your feedback.

🧿 AI-ADJACENT

Black Games Archive — Context, Culture, and Representation

Black Games Archive is a multimedia, public-facing database of games, digital resources, accessible scholarship, and designer interviews that are relevant to the intersections between Black culture, games, and play.

⋄