Weekly Extract

The LLM week, compressed.

A focused weekly brief of the AI model, research, safety, and product updates worth reading. Built from LLMgram's canonical AI Signal pipeline, ranked for source quality, event relevance, and usefulness to builders. Click any item to open its full AI Signal card without leaving LLMgram.

10signals selected
7dranking window
Jun 03, 2026 · 23:15 UTCgenerated
fresh sourceAI Signal data
Jun 03, 2026 · 22:02 UTCsource refreshed
Top 10 This Week
01
LessWrong · model · Jun 02, 2026

Claude Opus 4.8: Capabilities and Reactions

You need a lot of data points to understand a new model, and what you have. Trying to gauge from a few benchmarks is misleading. But if you have dozens of them, from a variety of sources, and you put them together with the model card tests…

02
TheSequence · model · Jun 03, 2026

The Sequence AI of the Week #871: Inside the Loop with Claude Opus 4.8

Might seem like a minor release. But it really isn't.

03
arXiv cs.CL · model · Jun 03, 2026

Linear Probes Detect Task Format, Not Reasoning Mode in Language Model Hidden States

arXiv:2606.02907v1 Announce Type: new Abstract: Linear probing of large language model (LLM) hidden states is widely used to claim that models learn distinct representations for different reasoning types. We test this by probing Qwen3-14B…

04
arXiv cs.LG · model · Jun 03, 2026

Hallucination Is Linearly Decodable from Mid-Layer Hidden States in Quantized LLMs

arXiv:2606.02628v1 Announce Type: new Abstract: We investigate whether open-source LLMs encode a linearly separable truthfulness signal in their hidden states, and at which network depth this signal is strongest. Across three $7$B--$8$B in…

05
Google AI · model · May 29, 2026

9 demos of Gemini Omni and Gemini 3.5 in action

Watch 9 videos showing the capabilities of Gemini Omni and Gemini 3.5, announced at Google I/O 2026.

06
Simon Willison · model · Jun 02, 2026

datasette-agent-micropython 0.1a0

Release: datasette-agent-micropython 0.1a0 I want Datasette Agent to be able to generate and execute Python code safely. This alpha is looking promising so far. GPT-5.5 has so far failed to break out of the sandbox! Tags: python , sandboxi…

07
arXiv cs.AI · model · Jun 02, 2026

From "Weak" Signals to Strong Models: Preference Delta Aggregation with LoRA Merging

arXiv:2606.00357v1 Announce Type: new Abstract: Training strong large language models (LLMs) requires high-quality supervision, which is often scarce. Recent work shows that paired preference data from weak-weaker model pairs (e.g., Qwen3…

08
Bindu Reddy (X) · model · Jun 03, 2026

Open Source AI has gone up by 500% in 2 months - Gemma 4 12B just dropped - Deepseek Flash is used in production workloads - Kimi 2.6 for coding - Minimax in a…

Open Source AI has gone up by 500% in 2 months - Gemma 4 12B just dropped - Deepseek Flash is used in production workloads - Kimi 2.6 for coding - Minimax in always-on agents Bigger growth rate than Anthropic 😉 🚀

09
AWS ML · model · Jun 01, 2026

OpenAI models and Codex on Amazon Bedrock are now generally available

GPT-5.5, GPT-5.4, and Codex are now generally available on Amazon Bedrock. Deploy them in production applications and agents today, on Bedrock’s high performance inference engine.

10
AI News · model · May 29, 2026

Anthropic releases Claude Opus 4.8

Anthropic has released Claude Opus 4.8, an upgrade to Claude Opus 4.7 that the company says brings improved results for coding, agent work, reasoning, and knowledge work. The platform can be used through claude.ai, Claude Code and the Clau…