Weekly Extract

The LLM week, compressed.

A focused weekly brief of the AI model, research, safety, and product updates worth reading. Built from LLMgram's canonical AI Signal pipeline, ranked for source quality, event relevance, and usefulness to builders. Click any item to open its full AI Signal card without leaving LLMgram.

10signals selected
7dranking window
Jun 06, 2026 · 23:15 UTCgenerated
fresh sourceAI Signal data
Jun 06, 2026 · 22:02 UTCsource refreshed
Top 10 This Week
01
LessWrong · model · Jun 02, 2026

Claude Opus 4.8: Capabilities and Reactions

You need a lot of data points to understand a new model, and what you have. Trying to gauge from a few benchmarks is misleading. But if you have dozens of them, from a variety of sources, and you put them together with the model card tests…

02
OpenAI · model · Jun 03, 2026

How Wasmer used Codex to build a Node.js runtime for the edge

See how Wasmer used Codex with GPT-5.5 to build a Node.js runtime for the edge, accelerating development 10x to 20x and shipping in weeks instead of months.

03
TheSequence · model · Jun 03, 2026

The Sequence AI of the Week #871: Inside the Loop with Claude Opus 4.8

Might seem like a minor release. But it really isn't.

04
arXiv cs.CL · model · Jun 03, 2026

Linear Probes Detect Task Format, Not Reasoning Mode in Language Model Hidden States

arXiv:2606.02907v1 Announce Type: new Abstract: Linear probing of large language model (LLM) hidden states is widely used to claim that models learn distinct representations for different reasoning types. We test this by probing Qwen3-14B…

05
arXiv cs.LG · model · Jun 03, 2026

Hallucination Is Linearly Decodable from Mid-Layer Hidden States in Quantized LLMs

arXiv:2606.02628v1 Announce Type: new Abstract: We investigate whether open-source LLMs encode a linearly separable truthfulness signal in their hidden states, and at which network depth this signal is strongest. Across three $7$B--$8$B in…

06
Simon Willison · model · Jun 02, 2026

datasette-agent-micropython 0.1a0

Release: datasette-agent-micropython 0.1a0 I want Datasette Agent to be able to generate and execute Python code safely. This alpha is looking promising so far. GPT-5.5 has so far failed to break out of the sandbox! Tags: python , sandboxi…

07
Ethan Mollick · model · Jun 06, 2026

The Gemini Pro models do not seem to be iterating anywhere near as quickly as Claude or GPT (last release was 3.1 Pro in February). Its causing a growing perfo…

The Gemini Pro models do not seem to be iterating anywhere near as quickly as Claude or GPT (last release was 3.1 Pro in February). Its causing a growing performance gap between Google and the other two labs, and the Gemini 3.5 Flash model…

08
AWS ML · model · Jun 04, 2026

NVIDIA Nemotron 3 Ultra now available on Amazon SageMaker JumpStart

Deploy NVIDIA Nemotron 3 Ultra on Amazon SageMaker JumpStart. Get 5x faster inference and 30% lower cost for agentic AI workloads with this frontier reasoning model.

09
Bindu Reddy (X) · model · Jun 04, 2026

Mythos will be timed to launch with GPT 5.6 and Gemini 3.5 The only problem is that it costs $70 per 1M output tokens 😲

Mythos will be timed to launch with GPT 5.6 and Gemini 3.5 The only problem is that it costs $70 per 1M output tokens 😲

10
arXiv cs.AI · model · Jun 02, 2026

From "Weak" Signals to Strong Models: Preference Delta Aggregation with LoRA Merging

arXiv:2606.00357v1 Announce Type: new Abstract: Training strong large language models (LLMs) requires high-quality supervision, which is often scarce. Recent work shows that paired preference data from weak-weaker model pairs (e.g., Qwen3…