Weekly Extract

The LLM week, compressed.

A focused weekly brief of the AI model, research, safety, and product updates worth reading. Built from LLMgram's canonical AI Signal pipeline, ranked for source quality, event relevance, and usefulness to builders. Click any item to open its full AI Signal card without leaving LLMgram.

10signals selected
7dranking window
May 28, 2026 · 23:45 UTCgenerated
fresh sourceAI Signal data
May 28, 2026 · 22:02 UTCsource refreshed
Top 10 This Week
01
OpenAI · model · May 27, 2026

Warp’s big bet on building open source with GPT-5.5

Warp uses GPT-5.5 and OpenAI models to coordinate coding agents across local, cloud, and open-source development workflows.

02
LessWrong · model · May 26, 2026

Are Mythos' Cyber Capabilities Overstated? - Yes and No

TL;DR: Anthropic restricted access to Claude Mythos Preview, citing a major leap in vulnerability discovery and exploitation capability. I review the 3 most common arguments from skeptics: (1) AISLE Security’s paper showing cheaper models…

03
AWS ML · model · May 28, 2026

Claude Opus 4.8 is now available on AWS

This post covers Opus 4.8's improvements and practical guidance for AI engineers integrating the model into agentic systems and production inference workloads on Amazon Bedrock.

04
arXiv cs.CL · model · May 25, 2026

Memorization Dynamics of Fill-in-the-Middle Pretraining

arXiv:2605.22981v1 Announce Type: new Abstract: Fill-in-the-middle (FIM) is a pretraining objective widely used to equip causal language models with infilling ability, yet its effect on verbatim memorization remains underexplored. We study…

05
Bindu Reddy (X) · model · May 28, 2026

🚨 Opus 4.8 Still Trails Behind GPT 5.5 And Is A Very Incremental Release Opus 4.8 barely inches past 4.7 on benchmarks but lags behind GPT 5.5. considerably!!…

🚨 Opus 4.8 Still Trails Behind GPT 5.5 And Is A Very Incremental Release Opus 4.8 barely inches past 4.7 on benchmarks but lags behind GPT 5.5. considerably!! Anthropic may be stalling a bit given it's last two releases. OpenAI has a huge…

06
Qwen (X) · model · May 28, 2026

📢Qwen3.7-Max just hit #3 on ITbench-AA — a fresh benchmark testing how well models handle real-world enterprise IT tasks, agentic-style. 🔧Agentic era, go with…

📢Qwen3.7-Max just hit #3 on ITbench-AA — a fresh benchmark testing how well models handle real-world enterprise IT tasks, agentic-style. 🔧Agentic era, go with Qwen.🏃🏃 Artificial Analysis (@ArtificialAnlys) Artificial Analysis and IBM Resea…

07
Gary Marcus · model · May 22, 2026

I have to eat crow on this, in light of further information. whatever OpenAI spent on Erdos using a new model, apparently you can get GPT 5.5 to do something s…

I have to eat crow on this, in light of further information. whatever OpenAI spent on Erdos using a new model, apparently you can get GPT 5.5 to do something similar; @emollick ’s presumably estimates more or less apply there (even if not…

08
Perplexity (X) · model · May 28, 2026

Claude Opus 4.8 is now available for Max subscribers on Perplexity and Computer.

Claude Opus 4.8 is now available for Max subscribers on Perplexity and Computer. Video

09
Cursor (X) · model · May 28, 2026

Claude Opus 4.8 is now available in Cursor. On CursorBench, it's able to work much more efficiently than Opus 4.7. We've also found it to be more persistent on…

Claude Opus 4.8 is now available in Cursor. On CursorBench, it's able to work much more efficiently than Opus 4.7. We've also found it to be more persistent on harder tasks.

10
swyx (X) · model · May 28, 2026

"Developers can update Claude’s instructions mid-task without breaking the prompt cache or routing the update through a user turn" wtf? how??

"Developers can update Claude’s instructions mid-task without breaking the prompt cache or routing the update through a user turn" wtf? how?? Claude (@claudeai) Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more…