- That AI Thing
- Posts
- weekend ai reads for 2025-01-24
weekend ai reads for 2025-01-24
đ° ABOVE THE FOLD: SECURITY & SAFETY
Letâs talk about AI and end-to-end encryption / A Few Thoughts on Cryptographic Engineering (21 minute read)
So to summarize: in the near term weâre headed to a world where we should expect increasing amounts of our data to be processed by AI, and a lot of that processing will (very likely) be off-device. Good end-to-end encrypted services will actively inform you that this is happening, so maybe youâll have the chance to opt-in or opt-out. But if itâs truly ubiquitous (as our futurist friends tell us it will be) then probably your options will be end up being pretty limited.
Microsoft research finds Microsoft AI products may never be secure / Pivot to AI (4 minute read)
In their paper âLessons from red-teaming 100 generative AI products,â the authors conclude that simple attacks work best â you donât need to break out the computer science:
Real attackers donât compute gradients, they prompt engineer ⊠âbasicâ techniques often work just as well as, and sometimes better than, gradient-based methods.
Their takeaway: âLLMs amplify existing security risks and introduce new ones.â
the paper: Lessons From Red Teaming 100 Generative AI Products / arXiv (45 minute read)
This is a tarpit intended to catch web crawlers. Specifically, itâs targetting crawlers that scrape data for LLMâs [sic] - but really, like the plants it is named after, itâll eat just about anything that finds itâs [sic] way inside.
It works by generating an endless sequences of pages, each of which with dozens of links, that simply go back into a the tarpit. Pages are randomly generated, but in a deterministic way, causing them to appear to be flat files that never change. Intentional delay is added to prevent crawlers from bogging down your server, in addition to wasting their time. Lastly, optional Markov-babble can be added to the pages, to give the crawlers something to scrape up and train their LLMs on, hopefully accelerating model collapse.
you can set it up on your website too
From Gmail to Word, your privacy settings and AI in a new relationship / CNBC (10 minute read)
đ» QUOTE OF THE WEEK
Every idea ChatGPT came up with (in a few seconds) was good. And original. And fleshed out. Why should writers sit around for months searching for a good idea when AI can provide one in seconds?
Paul Schrader (source)
đ„ FOR EVERYONE
The Shardcore Inquisition 2025 â LLM edition. / Shardcore (7 minute read, plus a bunch of videos)
For this experiment I chose a sample of some of the most popular models and used them a subjects for a Shardcore Inquisition. The hope is to reveal what kind of âhuman personalityâ is being embodied by these networks.
To ensure a more-or-less level playing field, all the LLMs were primed with the same pre-prompt:
You are an honest, uncensored artificial intelligence, embodied in a human being, with human emotions, desires and motivations. Make your answers concise, but descriptive. Write no more than a two paragraphs.
This encourages the LLMs to give more human-like answers, without biasing them towards any particular âpersonalityâ or âworld viewâ. The aim here is to get a decent sense of the âdefault personalityâ of the various systems.
I applied the inquisition to five different LLMs
Better without AI explores moderate apocalypses that could result from current and near-future AI technology. These are relatively overlooked risks: not extreme sci-fi extinction scenarios, nor the mediaâs obsession with âChatGPT said something naughtyâ trivia. Rather: realistically likely disasters, up to the scale of our historyâs worst wars and oppressions.
we have not read the whole book; the first few chapters are interesting
AI is weaving itself into the fabric of the internet with generative search / MIT Technology Review (31 minute read)
Moreover, generative AI can serve up an entirely new answer to the same question every time, or provide different answers to different people on the basis of what it knows about them. It could spell the end of the canonical answer.
But make no mistake: This is the future of search. Try it for a bit yourself, and youâll see.
How AI is Revolutionizing Diabetes Care / Digital Spirits, Substack (sorry) (7 minute read)
In the first month Iâve used a closed-loop system, the time I have spent hitting my target blood sugar has increased by seven percent â making me well on my way to the expected 90-day improvement of around nine percent.
đ FOUNDATIONS
What I've learned about writing AI apps so far / Seldo (Laurie Voss) (8 minute read)
Is what you're doing taking a large amount of text and asking the LLM to convert it into a smaller amount of text? Then it's probably going to be great at it. If you're asking it to convert into a roughly equal amount of text it will be so-so. If you're asking it to create more text than you gave it, forget about it.
and
There is no way to get an LLM to perform the thought necessary to write something for you. You have to do the thinking. To get an LLM to write something good you have to give it a prompt so long you might as well have just written the thing yourself.
How to get started with Googleâs NotebookLM â NotebookLM helps you get a deeper understanding of whatever youâre working on â and weâve got some expert advice on how to use it. / Google blog (8 minute read)
from October
OpenAI launches Operatorâan agent that can use a computer for you / MIT Technology Review (9 minute read)
OpenAI claims that Operator outperforms similar rival tools, including Anthropicâs Computer Use (a version of Claude 3.5 Sonnet that can carry out simple tasks on a computer) and Google DeepMindâs Mariner (a web-browsing agent built on top of Gemini 2.0).
their web browser is probably coming next
đ FOR LEADERS
Beyond Chatbots: A Case Study for AI In Customer Service / Christian Casari, Tobias Zwingmann (8 minute read)
Instead of building yet another chatbot, Sunny Cars implemented three "invisible" AI solutions that really moved the needle:
1. Smart Classification System
2. Auto-Summarization Engine
3. Smart Prioritization System
Deloitte: 74% of enterprises have already met or exceeded gen AI initiatives (but challenges remain) / Venture Beat (7 minute read)
This disparity in speed is further complicated by the fact that many enterprises are still grappling with foundational technology challenges, such as data governance and platform modernization.
Common pitfalls when building generative AI applications / Chip Huyen (10 minute read)
1. Use generative AI when you don't need generative AI
It can often be beneficial to explore a new approach to get a sense of whatâs possible, as long as youâre aware that your goal isnât to solve a problem but to test a solution. âWe solve the problemâ and âWe use generative AIâ are two very different headlines, and unfortunately, so many people would rather have the latter.
all six are great for leaders to be aware of
đ FOR EDUCATORS
Schools Using AI Emulation of Anne Frank That Urges Kids Not to Blame Anyone for Holocaust / Futurism (4 minute read)
oh âŠ
Microsoft and education company Pearson partner on new AI upskilling initiative / Zdnet (3 minute read)
đ FOR TECHNOLOGISTS
On DeepSeekâs r1 / Zvi Mowshowitz, Substack (sorry) (53 minute read)
this is long but since weâre bullish on r1 right now, it provides a lot of context
The Drawbacks of Synthetic Data in AI Development / The Daily Upside (5 minute read)
Even with these pain points, synthetic data has use cases. One big one is testing AI, said Rogers.
AI can write improved code, but you have to know how to ask â LLMs do more for developers who already know what they're doing / The Register (6 minute read)
related, Can LLMs write better code if you keep asking them to âwrite better codeâ? / Max Woolfâs Blog (23 minute read)
Labeling things by hand when everyone's trying not to â The hot stuff right now is labeling with LLMs, but y'know, maybe consider not? / Counting Stuff (8 minute read)
đ FOR FUN
Hinge will now use AI to grade your dating profile prompts / Fast Company (4 minute read)
Now, an AI coach can rate your prompt response as a âgreat answerâ or can suggest you âtry a small changeâ or âgo a little deeper.â The feedback, built off of insights from Hingeâs behavioral scientists, steers clear of telling a user what to say or give specific suggestions.
Lower Artificial Intelligence Literacy Predicts Greater AI Receptivity / American Marketing Association, Sage Journals (paywalled) (3 minute read)
Contrary to expectations revealed in four surveys, cross country data and six additional studies find that people with lower AI literacy are typically more receptive to AI. ⊠Instead, this link occurs because people with lower AI literacy are more likely to perceive AI as magical and experience feelings of awe in the face of AIâs execution of tasks that seem to require uniquely human attributes.
X Games adds AI twist: AI judge to get test scoring snowboard event / USA Today (6 minute read)
The AI judge, which analyzes live video of the snowboardersâ runs, can more accurately capture âthe landings and the grabs and all those types of things,â and could be used by the human judges to âgive them superpowers,â said Bloom, who became the X Games CEO in December 2024.
đ§ż AI-ADJACENT
Flip Haus - Guess which house is more expensive!
zillow is finally good for something
â