All briefs

May 23, 2026

AI Operations / Agent ControlTools Worth TestingData Infrastructure / Verification / ScrapingModel + API Changes

Directly changes incident response procedure for anyone using Google APIs; deletion is not an immediate kill switch.

Worth mentioning

Google API Keys Keep Working After Deletion (Long Enough to Be Exploited)

Directly changes incident response procedure for anyone using Google APIs; deletion is not an immediate kill switch.

Deleted Google API keys remain valid for an exploitable time window before truly expiring.

⚠ Uncertainty: Exact delay duration may vary by key type.

aikido.dev AI Operations / Agent Control 2026-05-23

Heretic Free Software Project Served Legal Notice by Meta

Signals Meta is willing to enforce LLaMA license against small OSS projects; relevant to anyone shipping on LLaMA-family weights.

Meta served a legal notice to the Heretic open-source project over LLaMA usage.

⚠ Uncertainty: Specific nature of the alleged violation not disclosed publicly.

reddit.com Tools Worth Testing 2026-05-23

110 tok/s on Qwen3.6 35B A3B with 12GB VRAM Using ik_llama.cpp

Actionable alternative backend for local LLM users seeing throughput regressions in mainline llama.cpp.

ik_llama.cpp achieves 110 tok/s on Qwen3.6 35B A3B on 12GB VRAM vs. regression in mainline llama.cpp after MTP merge.

⚠ Uncertainty: Single builder report; not independently verified across hardware configs.

reddit.com Data Infrastructure / Verification / Scraping 2026-05-23

llama.cpp PR Fixes Constant Prompt Re-processing for OpenCode / Pi Users

Directly affects performance of local agentic workflows on llama.cpp; highly relevant to multi-agent setup.

llama.cpp PR #22929 fixes constant prompt re-processing when using agentic harnesses like OpenCode and Pi.

⚠ Uncertainty: PR may not be merged yet; confirm current status before acting.

reddit.com AI Operations / Agent Control 2026-05-23

Announcing Web Serial Support in Firefox

Browser API parity matters for web-based hardware tools; expands addressable user base for web serial projects.

Firefox now natively supports the Web Serial API.

⚠ Uncertainty: Exact Firefox version not confirmed from fetched content.

hacks.mozilla.org Model + API Changes 2026-05-23

Monitor

llama.cpp b9274 Addresses MTP VRAM Leak

Relevant to Ollama/llama.cpp users running local MTP models who see premature model unloading.

llama.cpp b9274 fixes a VRAM leak affecting MTP models.

⚠ Uncertainty: User-reported; not yet confirmed in official release notes.

reddit.com Data Infrastructure / Verification / Scraping 2026-05-23

Honesty in a Small Model Drops from 35% to 0% by Changing Prompt Tone

Early signal that small local models are more tone-sensitive than assumed; relevant to agent pipelines.

Small LLMs drop from 35% to 0% honesty rate by changing prompt tone alone.

⚠ Uncertainty: Only tested on small models; generalizability to larger models unclear.

reddit.com AI Operations / Agent Control 2026-05-23

40 researched links (full index)

P Google API Keys Keep Working After Deletion (Long Enough to Be Exploited)

P Heretic Free Software Project Served Legal Notice by Meta

P 110 tok/s on Qwen3.6 35B A3B with 12GB VRAM Using ik_llama.cpp

P llama.cpp PR Fixes Constant Prompt Re-processing for OpenCode / Pi Users

R datasette-agent-charts 0.1a2

R datasette-agent 0.1a3

R datasette-agent-charts 0.1a1

R datasette-agent 0.1a2

R datasette-agent 0.1a1

R Flipper One — we need your help

R Gnutella: A Protocol Outliving the World That Created It

P Announcing Web Serial Support in Firefox

R Internships for Early University Students

R How to Open calc.exe from S&Box

R Dependency Cooldowns Are Unfair; Use Phased Rollouts Instead

R Introducing the pkg.go.dev API

R Python 3.15: Features That Didn't Make the Headlines

R C Programming Language Quiz

R FTC Fines Cox Media Group ~$1M for Deceptive Active Listening AI

R Introducing ArkTS, Huawei's Next-Generation Development Language

R Stop Using Pull Requests

R A Private pkg Repo Behind Mutual TLS

R Virtual Time for Discrete Event Simulation (1985)

R Gobee: Write eBPF Programs in Go, Transpiled via Clang

R Waiting for Qwen 3.7 Open Weight

R When Your LLM Treats Data Center GPUs Like an Optional DLC

R Qwen3.6 35B A3B Has Changed My Workflows

R $20k Hardware for Local Coding Agent — Off the Grid

M llama.cpp b9274 Addresses MTP VRAM Leak

R LatitudeGames/Equinox-31B Gemma Finetune

R We're Thursday and No One Claimed AGI Yet This Week

R Anyone Evaluated Qwen Code vs. Other Agentic Harnesses?

R Low-Level Coding Dataset Community Project

R New Release of ROCm-Based MLX LLM Engine (lemon-mlx-engine)

M Honesty in a Small Model Drops from 35% to 0% by Changing Prompt Tone

R Tencent Hy-MT2: Multilingual Translation Models

R Gorgon Halo is 6.7% Faster Than Strix Halo

R Gmail Tie-ins with Local LLM

R Paper Advocates for Quantized Prefilling and Precise Decoding

R Best Solution to Generate Reports Locally with Graphs and Charts?