- That AI Thing
- Posts
- weekend ai reads for 2024-03-01
weekend ai reads for 2024-03-01
📰 ABOVE THE FOLD: SAFETY
Accelerating Progress Toward Trustworthy AI / Mozilla Foundation
Microsoft’s AI Access Principles: Our commitments to promote innovation and competition in the new AI economy / Microsoft Blog
Rethinking Privacy in the AI Era: Policy Provocations for a Data-Centric World [PDF] / Stanford Institute for Human-Centered Artificial Intelligence
With elections looming worldwide, here’s how to identify and investigate AI audio deepfakes / Nieman Journalism Lab
She adds: “Analysis is more complex, and voice generation tools are more advanced than video generation tools. Even with voice samples and spectral analysis skills, it takes time, and there is no guarantee that the result will be accurate. In addition, there are many opportunities to fake audio without resorting to deepfake technology.”
PyRIT can generate thousands of malicious prompts to test a gen AI model, and even score its response. “For instance, in one of our red teaming exercises on a Copilot system, we were able to pick a harm category, generate several thousand malicious prompts, and use PyRIT’s scoring engine to evaluate the output from the Copilot system all in the matter of hours instead of weeks,” said Microsoft in the release.
📻 QUOTE OF THE WEEK
In fact, I would say that Nvidia’s business today is probably, if I were to guess, 40 percent inference, 60 percent training. The reason why that’s a good thing is because that’s when you realize AI is finally making it.
Jensen Huang, CEO, Nvidia (source)
🏗️ FOUNDATIONS & CULTURE
AI Is Like Water / NFX Blog
The formula your AI company actually needs has these multipliers:(Data + Model) x UX x (Distribution + Perceived Value to Customers) = Your new AI MVP
It is basically impossible to differentiate yourself now when it comes to data and model. Sure, there will be some sources of unstructured data that may give you an edge for a little while. But ultimately, data isn’t enough on its own. As for models, most will be interchangeable.
AI Tools Like Google Gemini Are Tailor-Made for Culture War — Google’s new image generator is yet another half-baked AI tool designed to provoke controversy. / New York Magazine
To explore these opportunities, I built a gpt wrapper using FastAPI and Postmark that executes different scripts depending on the inbound email address. You can think of this as a personal platform for email-based custom GPTs – though I call each app a "HaiHai."
AI race: Businesses still at the starting line / Australian Financial Review
Yearsely says she thinks this future will mean having AI trying to tackle business challenges through a lens that takes in more than just productivity and customer service, and will give organisations an opportunity to play a much more intrinsic and helpful role in their customers’ lives.
Tyler Perry Raises Alarm on AI, Puts $800M Studio Expansion on Hold — The actor, filmmaker and studio owner is raising the alarm about the impact of the tech, saying, “I feel like everybody in the industry is running a hundred miles an hour to try and catch up, to try and put in guardrails.” / Hollywood Reporter
related, When A.I. Can Make a Movie, What Does “Video” Even Mean? — Sora, the new text-to-video system from OpenAI, doesn’t make recordings—it renders ideas. / The New Yorker
Does Offering ChatGPT a Tip Cause it to Generate Better Text? An Analysis / Max Woolf's Blog
Looking at the results of both experiments, my analysis on whether tips (and/or threats) have an impact on LLM generation quality is currently inconclusive.
NonprofitAMA — Ask Me Anything About Nonprofits: Leadership, Management, Boards, Governance
NonprofitAMA is an AI chatbot built and trained on curated, high quality resources covering everything you want to know about nonprofits: leadership and management, governance, fundraising, marketing, incorporation, fiscal sponsorship, organizational design, strategy, advocacy, DEI, policies, etc.
We’re already using AI more than we realize (6:31) / Vox, YouTube
🎓 EDUCATION
However, one takeaway from this is that graphic design + web design jobs are still in demand, and not being replaced by AI tools yet.
Again, I think this is because tools like DALL-E and MidJourney requires some knowledge and creativity.
Project FTK — Educators,Build Your Own AI
We're a couple of Microsoft developers that do this in our free time. Our journey began as volunteers, teaching courses at Purdue Polytechnic High School. Through this experience, we recognized the immense effort required to create lessons and the scarcity of easily accessible resources online.
Our inspiration? The open-source spirit prevalent in software development. We are driven to bring the same ethos to education, making it effortless for teachers to discover and create content that perfectly complements their teaching style and classroom needs.
AI Will Shake Up Higher Ed. Are Colleges Ready? / The Chronicle of Higher Education
Three presentations from Lance Eaton
Enhancing Tutoring With AI (Google Slides)
Exploring AI in Instructional Design (Google Slides)
Empowering Career Services with AI (Google Slides)
CourseFactory AI — Create Your Online Course with AI Assistance
The AI Influencers Selling Students Learning Shortcuts / Rhetorica, Substack
📊 DATA & TECHNOLOGY
Why Doesn’t My Model Work? / The Gradient
Take hidden variables. These are features that are present in data, and which happen to be predictive of class labels within the data, but which are not directly related to them. If your model latches on to these during training, it will appear to work well, but may not work on new data.
ChatGPT for Data Analytics: Beginner Tutorial (1:07:09) / Luke Barousse, YouTube
How many news websites block AI crawlers? / Reuters Institute for the Study of Journalism
Examining the 15 most widely used online news sources in ten countries, we find that by the end of 2023, 48% of top news websites across ten countries were blocking OpenAI’s crawlers. Around half as many (24%) were blocking Google’s AI crawler.
My benchmark for large language models / Nicholas Carlini
Existing benchmarks tend to focus on solving typical problems that might be assigned to a student as homework. But the types of questions that are assigned to students are different from the types of questions I want to ask a language model to solve for me.
DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. / Data Dreamer, GitHub
DataDreamer is a powerful open-source Python library for prompting, synthetic data generation, and training workflows. It is designed to be simple, extremely efficient, and research-grade.
The Aravind Srinivas Interview: How Perplexity Is Revolutionizing The Future Of Search (56:47) / Aarthi and Sriram, YouTube
🎉 FUN and/or PRACTICAL THINGS
explore any subject through a hierarchy
model generates images as you type the prompt
usual flaws (faces, hands) but amazing speed
the paper: SDXL-Lightning: Progressive Adversarial Diffusion Distillation
Think smarter with AI — A thoughtfully composed collection of GPTs, built to empower your thinking, enhance your logic, and make better decisions. / Tonki Labs
Pogi — Interact with your virtual pet.
play with, feed, and chat with it
Book Pecker — 14509 books summarized in 5 bullet points.
unrelated, Read Swiftly - AI Powered Book Recommendations
🧿 AI-ADJACENT
A new estimate from researchers at Harvard and the University of Toronto finds that the worldwide value of all open-source software is about $8.8 trillion. To place that in perspective, about $3.4 trillion was spent on commercial software globally in 2020.
⋄