That AI Thing
Posts
weekend ai reads for 2024-06-14

weekend ai reads for 2024-06-14

June 14, 2024

📰 ABOVE THE FOLD: DIGITAL TRANSFORMATION

AI Intensifies Digital Transformation — Findings from the State of Entperise AI and the Enterprise Cloud Index reveal why IT leaders are enabling AI applications and leveraging AI to manage their complex data center operations. / Nutanix (14 minute read)

Forerunner's 2024 Consumer Trend Report / Forerunner Ventures

In this year’s report we discuss the latest developments on 3 values shifts and 1 technological shift, including the potential upsides and downsides, the innovative companies spurring the shift, and white space opportunity.

PDF at link; requires email

What Generative AI Means for Business / Gartner

The State of AI at Work [PDF] / Asana with Anthropic

📻 QUOTE OF THE WEEK

I’m the CEO of a two-month-old company and have been able to get meetings with CEOs of 160-year-old publishing companies

James Smith, CEO of Human Native AI (source)

🏗️ FOUNDATIONS & CULTURE

WWDC, Apple Intelligence, Apple Aggregates AI / Stratechery by Ben Thompson (15 minute read)

In other words, to the extent that Musk hates OpenAI, he should be happy about this partnership: Apple is clearly not sharing private data with OpenAI, and honestly the warnings it throws up every time you access the service are probably going to get pretty annoying pretty quickly; what the company is doing is providing a standardized interface for OpenAI to get access to potential customers for impressive yet commoditized use cases that Apple doesn’t need to spend resources on, because OpenAI and any of its would be competitors will be compelled to make the investment and accept Apple’s terms in an attempt to find some sort of sustainable advantage.

related (1), Apple's AI Strategy in a Nutshell / Enterprise AI Trends, Substack (sorry) (7 minute read)

on-edge LLM inference: a small, low latency AI model (3bn params) will be included in future iOS versions, and it will be able to understand user commands, the current screen, and take actions on apps.

private cloud compute: The on-device LLM may decide to offload certain complex tasks to more powerful models hosted in Apple’s data centers (called “private cloud compute”).

3rd party model inference: Users will also have the option to use OpenAI’s ChatGPT directly from Siri or certain iOS apps. Note, this is not the same as using ChatGPT as a replacement for Siri - which is what many thought the OpenAI partnership meant. Rather, ChatGPT is provided as an alternative to Apple’s models in certain situations (e.g. user is about to perform email revision, and ChatGPT’s response is offered as a choice).

related (2), Introducing Apple’s On-Device and Server Foundation Models / Apple Machine Learning Research (18 minute read)
related (3), Private Cloud Compute: A new frontier for AI privacy in the cloud / Apple Security Research (23 minute read)
related (4), How Siri Made Apple Cautious About AI / New York Magazine (10 minute read)

Situational Awareness — The Decade Ahead [PDF] / Leopold Aschenbrenner

only bringing this up because it got a lot of traction
we’re not convinced (a) that the doom scenarios are imminent and (b) the government really speaks for us
also, arbitrary markers like “elementary schooler” and “smart high schooler” on a graph raise our antennae
related, OpenAI Insider Estimates 70 Percent Chance That AI Will Destroy or Catastrophically Harm Humanity / Futurism (4 minute read)

Developing an LLM: Building, Training, Finetuning / Ahead of AI, Substack (sorry) (58:46 video at link)

If your weekend plans include catching up on AI developments and understanding Large Language Models (LLMs), I've prepared a 1-hour presentation on the development cycle of LLMs, covering everything from architectural implementation to the finetuning stages.

related (1), Generative AI for Beginners — Learn the fundamentals of building Generative AI applications with our 18-lesson comprehensive course by Microsoft Cloud Advocates. / Microsoft, Github
- good content, lot of reading and decent videos
- obviously Azure-centric
related (2), Say What You See — Learn the art of image prompting with the help of Google Al. / Google Arts & Culture
- a proper fun way to learn about prompting

🎓 EDUCATION

How Students are Actually Using Generative AI as Early Adopters — Discover how students are leveraging AI to transform learning / AI Supremacy, Substack (sorry) (22 minute read)

related, Harvard Undergraduate Survey on Generative AI / arXiv (31 minute read)

Predictive Analytics & the Business of College Admissions / AI Tool Report (51:06 Spotify podcast at link)

Education and Tech Companies Make Commitment to Responsible AI Development / Ed Week (5 minute read)

Personalized AI Tutoring as a Social Activity: Paradox or Possibility? / Educause Review (9 minute read)

Three features of AI Tutor Pro are designed to support a variety of learner characteristics. One feature allows learners to select the language complexity of a session by choosing whether they want to interact using language on the elementary school, high school, undergraduate, or professional level.

Inside Barnard’s pyramid approach to AI literacy — The New York institution’s unusual take on artificial intelligence could serve as a blueprint for others grappling with implementation. / Inside Higher Ed (6 minute read)

📊 DATA & TECHNOLOGY

AI in software engineering at Google: Progress and the path ahead / Google Research (11 minute read)

We achieved the highest impact with UX that naturally blends into users’ workflows. In all the above examples, a suggestion is presented to the user, taking them to the next step in their workflow with one tab or click. Experiments requiring the user to remember to trigger the feature have failed to scale.

related (1), 20 Top AI Coding Tools and Assistants / Built In (15 minute read)

Pricing: Costs vary depending on business size, model capabilities, usage and other factors. IBM also offers a free trial of a “lite” version of its watsonx Code Assistant for Red Hat Ansible Light Speed product.

related (2), An entirely open-source AI code assistant inside your editor / Ollama Blog (5 minute read)

The Rise of AI Agent Infrastructure / Madrona (11 minute read)

Today, many agents are almost entirely vertically integrated, without much managed infrastructure. That means: self-managed cloud hosts for the agents, databases for memory and state, connectors to ingest context from external sources, and something called Function Calling, Tool Use, or Tool Calling to use external APIs.

related, Hi, AI: Our Thesis on AI Voice Agents / Andreessen Horowitz (11 minute read)

For some companies/approaches, the LLM or a series of LLMs handles the conversational flow and emotionality. In other cases, there are unique engines to add emotion, manage interruptions, etc. “Full stack” voice providers offer this all in one place.

How Bias Hides in ‘Kitchen Sink’ Approaches to Data — In risk modeling, AI researchers take a more-is-better approach to training data, but a new study argues that a less-is-more approach may be preferable. / Human-Centered Artificial Intelligence, Stanford University (5 minute read)

🎉 FUN and/or PRACTICAL THINGS

Hadana: Your AI travel planner

requires registration

Transferscope – Synthesized Reality

Created by Christopher Pietsch, Transferscope is a working prototype that merges a handheld apparatus with artificial intelligence, allowing the user to view our physical world through an AI interpreter.

fascinating use of AI in a real-world device

Ancestry.com using AI to make new enslavement docs searchable / Axios (4 minute read)

GigaBrain - Search Reddit and Other Communities for Answers from Real People

AI Text to Sound Effects Generator — Generate any sound imaginable from a text prompt / ElevenLabs

5 Steps To Use ChatGPT To Craft The Perfect Resume / Forbes (6 minute read)

Advancing personal health and wellness insights with AI / Google Research (12 minute read)

The Personal Health Large Language Model (PH-LLM) is a fine-tuned version of Gemini, designed to generate insights and recommendations to improve personal health behaviors related to sleep and fitness patterns. By using a multimodal encoder, PH-LLM is optimized for both textual understanding and reasoning as well as interpretation of raw time-series sensor data such as heart rate variability and respiratory rate from wearables.

Luma Dream Machine

amazing generative video
free to try; requires sign-in

🧿 AI-ADJACENT

NotebookLM — Note Taking & Research Assistant Powered by AI

⋄