That AI Thing
Posts
weekend ai reads for 2024-01-26

weekend ai reads for 2024-01-26

January 26, 2024

THAT AI THING HAS MOVED

If you haven’t noticed, we’ve moved to Beehiiv.

A co-founder of Substack, Hamish McKenzie, has basically said that Substack intends to support and protect Nazis on their platform. That seems like plenty of reason to remove our even nominal association with Substack.

Going forward, we will regrettably have links to content on Substack. We will acknowledge that with an apology, just like we apologize when we send you to Twitter. We will continue to prioritize authors and publishers that don’t align themselves with Nazis and hate speech.

We have spent time and money to move everything over. It took a while but well worth it. We have some grand(ish) plans for 2024; some of those may take a little longer because of this annoying distraction, but we’re excited to share those with you soon.

On to the show …

📰 ABOVE THE FOLD: TEACHING & STUFF

Teaching CS50 with AI — Leveraging Generative Artificial Intelligence in Computer Science Education [PDF] Harvard University

In this paper, we detail how AI tools have augmented teaching and learning in CS50, specifically in explaining code snippets, improving code style, and accurately responding to curricular and administrative queries on the course’s discussion forum. Additionally, we present our methodological approach, implementation details, and guidance for those considering using these tools or AI generally in education

Economist Tyler Cowen on How ChatGPT Is Changing Your Job (1:07:26) Every, YouTube

short portion (~7:00) on “Using ChatGPT in university classrooms”

Google announces new AI-powered features for education TechCrunch

The company added that the Practice sets feature, which uses AI to create answers and general hints, is now available in over 50 languages. Plus, educators can turn a Google Form into a practice set.

Miko Mini — GPT-Powered Conversational Learning Robot For Kids

Miko Mini is the perfect companion for kids, offering meaningful interactions, realistic reactions, impressive dance moves, and engaging educational STEAM content. It is the ideal teacher and companion for kids.

A Technologist Spent Years Building an AI Chatbot Tutor. He Decided It Can’t Be Done. EdSurge News

To Nitta, the stronger role is to serve as an assistant to experts rather than a replacement for an expert tutor. In other words, instead of replacing, say, a therapist, he imagines that chatbots can help a human therapist summarize and organize notes from a session with a patient.

Hologram lecturers thrill students at trailblazing UK university The Guardian

The university’s pro-vice-chancellor, Prof Rachel Thomson, said the technology could help the university achieve its sustainability strategy by reducing the need to fly in guest speakers and by facilitating international research collaborations, as well as by reducing the amount of material used by students building prototypes in engineering, design and the creative arts.

📻 QUOTE OF THE WEEK

I’m sure some of them are legitimately concerned about safety, but it’s a hell of a thing how much it lines up with the strategy.

Mark Zuckerberg (source)

🏗️ FOUNDATIONS & CULTURE

ChatGPT is an engine of cultural transmission Programmable Mutter, Substack (sorry)

Our capacity as humans to get things right or wrong depends on our relationship to base reality, and our ability to try to solve the “inverse problem” of mapping how this reality works. LLMs don’t have that opportunity to explore and try to figure out what causes what.

Beyond AI Exposure: Which Tasks are Cost-Effective to Automate with Computer Vision? [PDF] Massachusetts Institute of Technology, Computer Science & Artificial Intelligence Laboratory

Because computer vision, as it stands today, only has an economic advantage in 23% of vision tasks at the firm-level and barriers to AI-as-a-service deployments exist, there will most likely need to be a sharp reduction in cost for computer vision to replace human labor.

Even with a 50% annual cost decrease, it will take until 2026 before half of the vision tasks have a machine economic advantage and by 2042 there will still exist tasks that are exposed to computer vision, but where human labor has the advantage. At a 10% annual system cost decrease, computer vision market penetration will still be less than half of exposed task compensation by 2042.

Now decides next: Insights from the leading edge of generative AI adoption [PDF] Deloitte

Organizations that report very high expertise in generative AI tend to feel more positive about it—but also more pressured and threatened

and

Most organizations are still primarily relying on off-the-shelf generative AI solutions.

and

Talent, governance and risk are critical areas where generative AI preparedness is lacking

OpenAI Bans Use of AI Tools for Campaigning, Voter Suppression Wall Street Journal

The company said it also banned applications that discouraged voting—by claiming a vote was meaningless, for example. Questions about how and where to vote would be addressed with a link to CanIVote.org, operated by the National Association of Secretaries of State.

related, Most Top News Sites Block AI Bots. Right-Wing Media Welcomes Them Wired

UK government says AI-reproduced copyrighted works violate law without license or exception Decoder

The UK government has decided not to introduce broad copyright exceptions for text and data mining for AI systems. In addition, the reproduction of content by AI systems violates copyright law.

🎓 EDUCATION

NC public schools agency recommends educators ‘rethink’ plagiarism in the age of ChatGPT (Charlotte) News & Observer

the report [PDF] with a good bike analogy

This analogy demonstrates that without AI, some students’ struggles will inhibit learning, like a mountain bike; while with too much reliance on and lack of understanding of AI is unpredictable and can even be harmful like a motorcycle. Ideally, AI would be used like an E bike, with the human in control.

AI and Higher Education — Implications for Learning, Curricula, and Institutions American Association of Colleges and Universities

recording at the link for AAC&U members

Graide — AI grading. Increase feedback and save time.

Digital SAT Math Prep

Digital SAT Math Prep powered by Active Recall AI

100,000+ Personalized Test-Like Questions

Loneliness and suicide mitigation for students using GPT3-enabled chatbots Nature Mental Health Research

The combination of conversational ability, embodiment, and deep user engagement shows a pathway for generalist Intelligent Social Agents to aid students in informal contexts, scaffolding their stress and mental health and even countering suicidal ideation.

📊 DATA & TECHNOLOGY

via adam, More than an OpenAI Wrapper: Perplexity Pivots to Open Source The New Stack

But over the past year, Perplexity has evolved rapidly. It now has its own search index and has built its own LLMs based on open source models. They’ve also begun to combine their proprietary technology products.

paraphrasing adam:
- Developers are getting better at steering models to cite sources and thus, hallucination mitigation
- A few LLM-first orgs are finding product-market fit; “our product is LLM-assisted search summaries” is a clear, succinct statement of what they're trying to do
- Most importantly, open source models (Mistral, Llama) are being used in production to complement GPT-4 and make real money

The Modern AI Stack: Design Principles for the Future of Enterprise AI Architectures Menlo Ventures

Today, we’re excited to share our thesis for how AI development will evolve, as well as the core infrastructure components that will combine to create the modern AI stack—the new runtime architecture that will drive AI applications for the coming decade.

and

Four Key Design Principles for the New AI Infrastructure Stack

1. The Majority of Spend Is for Inference vs. Training

2. We Live in a Multi-Model World

3. RAG Is the Dominant Architectural Approach

4. All Developers Are Now AI Developers

Sampling for Text Generation Chip Huyen

To understand why AI’s responses are probabilistic, we need to understand how models generate responses, a process known as sampling (or decoding).

primer on sampling

OpenAI Quietly Scrapped a Promise to Disclose Key Documents to the Public Wired

From its founding, OpenAI said its governing documents were available to the public. When WIRED requested copies after the company’s boardroom drama, it declined to provide them.

the “open” part might be becoming more conceptual

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text arXiv

Over a wide range of document types, Binoculars detects over 90% of generated samples from ChatGPT (and other LLMs) at a false positive rate of 0.01%, despite not being trained on any ChatGPT data.

may have implications for “plagiarism” detection

ArtificialAnalysis.ai — Independent analysis of AI models and hosting providers - choose the best model and API hosting provider for your use-case

One of the world's largest AI training datasets is about to get bigger and ‘substantially better’ VentureBeat

But far from stopping their dataset work, EleutherAI is now building an updated version of the Pile dataset, in collaboration with multiple organizations including the University of Toronto and the Allen Institute for AI, as well as independent researchers.

A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism arXiv

Multi-way parallel, machine generated content not only dominates the translations in lower resource languages; it also constitutes a large fraction of the total web content in those languages.

the widespread use of low-quality machine translation (MT) for generating content in multiple languages suggests that training models, such as multilingual large language models, on data scraped from the web may result in less fluent models with more errors
additionally, this paper shows evidence of a selection bias in the type of content that is translated into many languages, indicating that the data may be of lower quality even before considering MT errors

Coding on Copilot: 2023 Data Suggests Downward Pressure on Code Quality (incl 2024 projections) [PDF] GitClear

Code churn -- the percentage of lines that are reverted or updated less than two weeks after being authored -- is projected to double in 2024 compared to its 2021, pre-AI baseline. We further find that the percentage of “added code” and “copy/pasted code” is increasing in proportion to “updated,” “deleted,” and “moved” code.

🎉 FUN and/or PRACTICAL THINGS

Runway had to go and ruin my day. Rory Flynn, Twitter (sorry)

demos animating images from Midjourney in Runway

Lumiere — A Space-Time Diffusion Model for Video Generation Google Research

examples at the link

Calorieasy — Track calories in seconds with AI

tracks calories from photos of meals

Same Energy

“visual search engine”; returns many images similar to the one selected
seemingly endless library

Emergent Mind

review of AI/ML papers on arXiv with AI-generated summaries

ArxivPaperAI — Summarize papers in seconds, and chat more insight with the power of chatgpt

free tier is limited to one paper at a time (you can delete one before uploading a new one), and five questions per day

Google Chrome is getting AI features for organizing tabs and custom themes The Verge

Google is adding its “Help me write” feature to every site on the web; you just right-click on any text box anywhere, select the feature, and Google’s AI will ask you what you want to write and then generate a first draft for you. Google suggests you might use “Help me write” to write reviews and emails or RSVP to parties.

Test Yourself: Which Faces Were Made by A.I.? New York Times

🧿 AI-ADJACENT

Discover how designers create fantasy apps for TV and film WePresent

But what’s the aim of [Fictional UI]? In some cases, it’s simply to make phones seem real or narrative settings function as a whole. Rather than foreground the tech, it’s often used in the background, sometimes designed specifically for the film or show due to copyright reasons. Finder-Spyder, for example, is a bogus Google featured in countless major films.

For now, though, sit back and immerse yourself in the fictional interfaces of five futuristic shows and films—“Black Mirror”, “Her”, “Extrapolations”, “Ted Lasso” and “Red Rose”—hearing the creative process from the—thankfully for now, human—designers.

⋄