Weekly Extract

The LLM week, compressed.

A focused weekly brief of the AI model, research, safety, and product updates worth reading. Built from LLMgram's canonical AI Signal pipeline, ranked for source quality, event relevance, and usefulness to builders. Click any item to open its full AI Signal card without leaving LLMgram.

10signals selected
7dranking window
Jun 07, 2026 · 23:15 UTCgenerated
fresh sourceAI Signal data
Jun 07, 2026 · 22:02 UTCsource refreshed
Top 10 This Week
01
LessWrong · model · Jun 02, 2026

Claude Opus 4.8: Capabilities and Reactions

You need a lot of data points to understand a new model, and what you have. Trying to gauge from a few benchmarks is misleading. But if you have dozens of them, from a variety of sources, and you put them together with the model card tests…

02
OpenAI · model · Jun 03, 2026

How Wasmer used Codex to build a Node.js runtime for the edge

See how Wasmer used Codex with GPT-5.5 to build a Node.js runtime for the edge, accelerating development 10x to 20x and shipping in weeks instead of months.

03
Bindu Reddy (X) · model · Jun 07, 2026

🚨 Multi-Agent - Lite Agent Swarms - Optimize Cost On Large Agentic Loops After a lot of experimentation we have open-source AI agent swarms live!! - Opus 4.8 a…

🚨 Multi-Agent - Lite Agent Swarms - Optimize Cost On Large Agentic Loops After a lot of experimentation we have open-source AI agent swarms live!! - Opus 4.8 and GPT 5.5 do the planning - Deepseek flash and Gemma do the work - Perfect for…

04
TheSequence · model · Jun 03, 2026

The Sequence AI of the Week #871: Inside the Loop with Claude Opus 4.8

Might seem like a minor release. But it really isn't.

05
arXiv cs.CL · model · Jun 03, 2026

Linear Probes Detect Task Format, Not Reasoning Mode in Language Model Hidden States

arXiv:2606.02907v1 Announce Type: new Abstract: Linear probing of large language model (LLM) hidden states is widely used to claim that models learn distinct representations for different reasoning types. We test this by probing Qwen3-14B…

06
arXiv cs.LG · model · Jun 03, 2026

Hallucination Is Linearly Decodable from Mid-Layer Hidden States in Quantized LLMs

arXiv:2606.02628v1 Announce Type: new Abstract: We investigate whether open-source LLMs encode a linearly separable truthfulness signal in their hidden states, and at which network depth this signal is strongest. Across three $7$B--$8$B in…

07
Gary Marcus · model · Jun 07, 2026

blast from the past 3.5 years ago; some things have changed (esp. coding and math, via neurosymbolic techniques) but many haven’t:

blast from the past 3.5 years ago; some things have changed (esp. coding and math, via neurosymbolic techniques) but many haven’t: Gary Marcus (@GaryMarcus) Bottom line: From the outset Large Language Models like GPT-3 have great at genera…

08
Simon Willison · model · Jun 02, 2026

datasette-agent-micropython 0.1a0

Release: datasette-agent-micropython 0.1a0 I want Datasette Agent to be able to generate and execute Python code safely. This alpha is looking promising so far. GPT-5.5 has so far failed to break out of the sandbox! Tags: python , sandboxi…

09
Ethan Mollick · model · Jun 06, 2026

The Gemini Pro models do not seem to be iterating anywhere near as quickly as Claude or GPT (last release was 3.1 Pro in February). Its causing a growing perfo…

The Gemini Pro models do not seem to be iterating anywhere near as quickly as Claude or GPT (last release was 3.1 Pro in February). Its causing a growing performance gap between Google and the other two labs, and the Gemini 3.5 Flash model…

10
AWS ML · model · Jun 04, 2026

NVIDIA Nemotron 3 Ultra now available on Amazon SageMaker JumpStart

Deploy NVIDIA Nemotron 3 Ultra on Amazon SageMaker JumpStart. Get 5x faster inference and 30% lower cost for agentic AI workloads with this frontier reasoning model.