Weekly Extract

The LLM week, compressed.

A focused weekly brief of the AI model, research, safety, and product updates worth reading. Built from LLMgram's canonical AI Signal pipeline, ranked for source quality, event relevance, and usefulness to builders. Click any item to open its full AI Signal card without leaving LLMgram.

10signals selected
7dranking window
May 21, 2026 · 23:45 UTCgenerated
fresh sourceAI Signal data
May 21, 2026 · 22:30 UTCsource refreshed
Top 10 This Week
01
Simon Willison · model · May 19, 2026

llm-gemini 0.32

Release: llm-gemini 0.32 New model gemini-3.5-flash for Gemini 3.5 Flash . See also my notes on Gemini 3.5 Flash , and the pelican I drew using this upgrade to the plugin. Tags: llm , gemini

02
Google AI · model · May 19, 2026

Gemini 3.5: frontier intelligence with action

At Google I/O we released Gemini 3.5, our latest series of models combining frontier intelligence with action.

03
OpenAI · model · May 20, 2026

How Ramp engineers accelerate code review with Codex

How Ramp engineers use Codex with GPT-5.5 to review code and ship improvements, allowing them to get substantive feedback in minutes instead of hours.

04
arXiv cs.CL · model · May 21, 2026

Under Pressure: Emotional Framing Induces Measurable Behavioral Shifts and Structured Internal Geometry in Small Language Models

arXiv:2605.20202v1 Announce Type: new Abstract: I study whether emotionally framed evaluation follow-ups change both the behavior and the calm-relative internal representations of small, locally deployed language models. Our main benchmark…

05
arXiv cs.AI · model · May 21, 2026

Evaluating the Utility of Personal Health Records in Personalized Health AI

arXiv:2605.18937v1 Announce Type: new Abstract: Patient-managed Personal Health Records (PHRs) promises to empower patients to better understand their health; but information in the record is complex, potentially hindering insights. In thi…

06
arXiv cs.LG · model · May 20, 2026

Compositional Literary Primitives in Instruction-Tuned LLMs: Cross-Architectural SAE Features for Self, Style, and Affect

arXiv:2605.18808v1 Announce Type: new Abstract: We characterize a compositional architecture of literary primitives in two instruction-tuned large language models (Llama 3.1 8B-Instruct and Gemma 2 9B-IT) via sparse autoencoders on mid-dep…

07
Ethan Mollick · model · May 19, 2026

Also had some early access to Gemini 3.5 Flash. Very fast for a flash model and very capable, though not as powerful as a full frontier model. I added it to th…

Also had some early access to Gemini 3.5 Flash. Very fast for a flash model and very capable, though not as powerful as a full frontier model. I added it to the gallery or procedurally generated one-shot towns (it made one error that it co…

08
LessWrong · model · May 16, 2026

Trying to use NLAs to find out how Qwen 2.5 7B does multiplication

Neural language autoencoders were just introduced by Anthropic. In a fascinating paper , they showed that you can take the residual stream activations of a language model and then train two instantiations of that same model (an encoder and…

09
Bindu Reddy (X) · model · May 20, 2026

TBH, Kimi 2.6 beats Gemini Flash 3.6 Plus it is 10x cheaper So, yes, open source is still winning

TBH, Kimi 2.6 beats Gemini Flash 3.6 Plus it is 10x cheaper So, yes, open source is still winning

10
Jeff Dean (X) · model · May 19, 2026

Highly capable models that are fast are super important. Our new Gemini 3.5 Flash model is a great mix of fast and capable.

Highly capable models that are fast are super important. Our new Gemini 3.5 Flash model is a great mix of fast and capable. Sundar Pichai (@sundarpichai) Just off stage at #GoogleIO , some highlights from this morning 🧵 Gemini 3.5 Flash is…