weekend ai reads for 2024-06-21

📰 ABOVE THE FOLD: AGENTS

AI Agents and Education: Simulated Practice at Scale / Lilach Mollick, Ethan Mollick, et al, Social Science Research Network (25 minute read)

Building AI Agents: Lessons Learned over the past Year / Patrick Dougherty, Medium (13 minute read)

Focus on giving the agent context and letting it “think”, instead of hoping it gets the answer right in one try.

More interestingly, the knowledge the doctor agents have acquired in Agent Hospital is applicable to real-world medicare benchmarks. After treating around ten thousand patients (real-world doctors may take over two years), the evolved doctor agent achieves a state-of-the-art accuracy of 93.06% on a subset of the MedQA dataset that covers major respiratory diseases. This work paves the way for advancing the applications of LLM-powered agent techniques in medical scenarios.

Building open source LLM agents with Llama 3 / Langchain, YouTube (17 minute video)

Build reliable AI agents with monitoring, testing, and replay analytics. No more black boxes and prompt guessing.

  • ample free tier for testing

 

📻 QUOTE OF THE WEEK

The fact that generative AI is advancing and evolving so quickly is why it’s important to be an active participant in defining the future for it.

Kyle Bowen, Deputy Chief Information Officer, Arizona State University (source)

 

🏗️ FOUNDATIONS & CULTURE

“People are using AI tools at work as they feel overwhelmed and under duress at work, and they want some relief,” Stallbaumer says, “It’s still early! Companies and people are on the journey…(but) there's a lot of light bulbs going on.”

She notes the recent Microsoft Work Trend Index found three quarters (75%) of respondents are now using AI at work in some way - double the number from six months ago, with the technology helping boost creativity and freeing up time to focus on crucial tasks.

Vinod Khosla on What to Build in AI — The prolific AI investor joins the gang to discuss OpenAI, Apple and the AI deals he is—and isn't—doing and why. Plus, what's in Vinod's ChatGPT history. / More or Less Podcast, Apple Podcasts (58 minute audio)

My Thoughts on Apple Intelligence — Leveling the Stakes & Betraying the Essence / Ignacio de Gregorio, Read Medium (11 minute read)

The Who’s Who in Responsible AI / MMC (13 minute read)

However, conversations with enterprise customers have convinced us that the demand for responsible deployments is real. We believe that Responsible AI is rapidly turning into an area of utmost importance, and we’re excited about investment opportunities in this area.

The popularisation of artificial intelligence (AI) has given rise to imaginaries that invite alienation and mystification. At a time when these technologies seem to be consolidating, it is pertinent to map their connections with human activities and more than human territories.

 

🎓 EDUCATION

Is AI Disrupting Higher Ed? / The Prof G Pod with Scott Galloway, Apple Podcast (21 minute audio, but the education part is about half that)

Scott speaks about ChatGPT Edu, specifically how it will affect higher education and the edtech industry.

Inncivio — An AI-Powered Education Platform for Businesses

 

📊 DATA & TECHNOLOGY

The Future of AI is Vertical / Euclid Insights, Substack (sorry) (22 minute read)

While Human-Machine Interface (HMI) traditionally referred to the panels and dashboards engineers use to control a piece of equipment or a factory-floor robot, the HMI we envision here will come in the form of an application software layer with a UI / UX tailored to the use-case.

It saddens me that “looking cool” seems to have become preferable to “useful and usable” in the minds of application and operating system developers.

  • related (2), Building AI products — How do we build mass-market products that change the world around a technology that gets things ‘wrong’? What does wrong mean, and how is that useful? / Benedict Evans (8 minute read)

AI and the “Why Now” of Data DAOs / Li Jin, Substack (sorry) (10 minute read)

Any individual data point might be negligible in value to a model’s performance, but collectively, a large group of users can aggregate novel data sets that are valuable for AI training. This is where the idea of data DAOs can fit in. With data DAOs, data contributors could see economic upside from contributing data as well as govern how that data is used and monetized.

What is LoRA?: A Visual Guide to Low-Rank Approximation for Fine-Tuning LLMs EfficientlyWhy LoRA Is Essential For Model Fine-Tuning / The Code Compass, Substack (sorry) (14 minute read)

Cost Of Self Hosting Llama-3 8B-Instruct / Lytix Blog (6 minute read)

You have a fixed cost of ~$100 a month, which results in a ‘profit’ of $57. To make up your initial server cost of $3,800, you’d need about 66 months or 5.5 years to benefit from this approach.

Our position is that open-endedness is a property of any ASI, and that foundation models provide the missing ingredient required for domain-general open-endedness. Further, we believe that there may be only a few remaining steps required to achieve open-endedness with foundation models.

 

🎉 FUN and/or PRACTICAL THINGS

Lil AI Gen - Drinking Gasoline (Official Video) / Natural Hours, YouTube (2 minute video)

  • just the best

  • also: will cause nightmares. seriously.

Anyone can create an AI persona, called a Butterfly, in minutes on the app. After that, the Butterfly automatically creates posts on the social network that other AIs and humans can then interact with.

  • feels like being on social media with your Tamagotchi

  • the embedded trump impregnating biden video has swear words (never thought we’d write that phrase)

How to Use ChatGPT to Find the Job of Your Dreams — AI can do more than help you write a cover letter and resume -- it can also strategize with you on how to find your ideal job. / Cnet (6 minute read)

  • seemingly practical advice

Dot — Need the power of language models but don't want to send away your documents and data?

  • works as advertised but is in beta so requires some patience

 

🧿 AI-ADJACENT

If different fonts were outfits / Wisdom Kaye, Twitter (sorry) (1 minute video)