blogs

Blog Roundup

Blog Roundup - March 13, 2026

Friday · March 13, 2026

34 Scanned

17 Headlines

Foundation Models & LLM Orchestration

00 Updates on context scaling, foundational model delays, and API pricing architectures impacting massive RAG and CDS deployments.

1M Context GA for Claude Opus 4.6 and Sonnet 4.6 simonwillison.net

Anthropic has moved its 1-million-token context window into General Availability for Opus 4.6 and Sonnet 4.6. Crucially, they are applying standard flat pricing across the entire window, avoiding the long-context premium taxes currently imposed by OpenAI (after 272k) and Gemini (after 200k), shifting the unit economics favorably for high-volume, long-document clinical RAG pipelines.

Meta Delays "Avocado" Foundational Model daringfireball.net

Meta has delayed its highly anticipated "Avocado" model to at least May 2026 due to unexpected performance gaps in reasoning and coding. While Avocado outperforms Llama 4 and Google's Gemini 2.5, it falls short of Gemini 3.0 and current OpenAI/Anthropic leaders, prompting Meta leadership to explore licensing Google’s Gemini to bridge the gap.

MedTech Infrastructure & Security

00 High-signal incidents affecting healthcare data systems, medical devices, and clinical workflows.

Massive Wiper Attack on Medtech Firm Stryker krebsonsecurity.com

The Iran-backed threat actor Void Manticore executed a devastating data-wiping attack on Medtech giant Stryker, wiping roughly 200,000 systems across 79 countries. By exploiting Microsoft Intune rather than deploying standard malware, attackers issued remote wipe commands that disrupted hospital surgical supplies and forced the disconnection of LifeNet EKG systems, highlighting severe vulnerabilities in managed healthcare IT infrastructure.

Engineering Internals & Model Evaluation

00 Deep-dives into LLM training stability, optimization techniques, and the limits of automated formal verification.

Learning Rate Interventions in GPT-2 Small Training gilesthomas.com

A detailed analysis of learning rate scheduling when training a GPT-2 small model from scratch using AdamW. The experiment underscores why fixed learning rates fail in complex architectures, emphasizing that Chinchilla-style cosine cycle schedules are necessary to navigate non-convex, V-shaped loss landscapes during late-stage convergence.

LLMs and Formal Specification Failures (TLA+/Alloy) buttondown.com/hillelwayne

While LLM adoption for formal specifications is rising, models persistently fail to capture subtle multi-step bad behavior, concurrency, and nondeterminism in TLA+ and Alloy. LLMs easily handle obvious invariants but struggle to generate compiling code for complex state machines, risking a false sense of verification for less experienced engineers.

AI Hardware & Market Economics

00 Compute bottlenecks, scaling limitations, and the emerging divide between AI capital expenditure and recurring SaaS revenue.

Deep Dive: Logic, Memory, and Power Bottlenecks dwarkesh.com

A breakdown of the physical and economic constraints on AI scaling, with the Big 4 hyperscalers hitting $600B in combined CapEx. Structural bottlenecks include High Bandwidth Memory (HBM) shortages, ASML lithography throughput, and TSMC N2/N3 allocation squeezes, driving inference rental costs to $10B–$13B per gigawatt and pushing companies like OpenAI toward non-standard energy providers.

The AI Revenue Gap (SaaSpocalypse) wheresyoured.at

Analysis pointing to a growing disparity between massive AI funding and actual high-margin software revenue. Notable metrics include Anthropic generating $5B in revenue against $60B in funding, and Adobe's AI-first revenue sitting at just 1% of total quarterly earnings, signaling potential friction for enterprise AI monetization.

Production Backend & Dev Tools

00 Frameworks, tooling, and database optimization patterns for async backends and AI agents.

Introduction to SQLAlchemy 2 In Practice miguelgrinberg.com

Miguel Grinberg is open-sourcing his comprehensive guide to SQLAlchemy 2.0 implementation patterns. This serves as an essential reference for engineers building highly optimized, async Python backends (like FastAPI) that require strict data modeling and ORM performance.

Forge: Unified Git Forge CLI nesbitt.io

A new Go-based tool that provides a unified, standardized interface wrapping the disparate APIs of GitHub, GitLab, and Bitbucket. By homogenizing commands for PRs, CI pipelines, and secrets, it creates a much cleaner environment for autonomous AI coding agents to operate across different repositories.

The AI Workforce & Paradigm Shifts

00 Impact of LLMs on software engineering culture, macroeconomics, and digital content saturation.

What do coders do after AI? anildash.com

Anil Dash examines the shift from idiomatic coding to AI-orchestrated 'software factories' where generation is 10,000x cheaper than human labor. The paradigm is shifting toward high-volume 'slop' iteration where code is rapidly generated, tested, and discarded, forcing senior engineers to transition from practitioners to conductors.

Changing my mind on UBI geohot.github.io

George Hotz outlines a contrarian stance supporting Universal Basic Income not as a safety net, but as an inflationary mechanism to 'burn down' the $210B growth in mandatory entitlement spending. He predicts this will force the productive engineering class to migrate to a parallel, scarce 'gold economy' to avoid subsidizing devalued fiat.

Hacker News AI Saturation Study lcamtuf.substack.com

Using the Pangram detection model, analysis reveals that AI-related or LLM-generated content dominates the top 5 slots of Hacker News daily. The study highlights the increasing prevalence of a 'quasi-deterministic' LLM voice in technical essays.

Quick Mentions

00 Brief technical notes, security updates, and historical retrospectives.

Btw: Software, turnkey, beheerd, as a service berthub.eu

The Dutch Tax Authority is outsourcing its core VAT processing to fully-managed US SaaS providers, raising major concerns over European critical infrastructure and financial data sovereignty.

Weekly Update 494 troyhunt.com

Have I Been Pwned logged a massive spike in credential exfiltration, ingesting data from five major breaches (including Odido and KomikoAI) in just 48 hours.

Tinyweb: The Web in 1000 Lines of C maurycyz.com

A fully functional, minimal web browser implementation in under 1,000 lines of C, featuring OpenSSL for HTTPS and a recursive HTML parser for formatting and link numbering.

MIPS Stack Probing in Windows devblogs.microsoft.com

A technical retrospective on how the MIPS chkstk implementation handles user vs. kernel mode stack pointer testing entirely differently than x86.

An odd font rendering bug in Firefox and Safari shkspr.mobi

Firefox and Safari fail to render combining characters (like ẹ́) correctly in fonts such as Helvetica Now Display at normal weights, while Chrome successfully smushes them.

← Older

Daily Digest Mar 13, 2026

Newer →

Daily Digest Mar 15, 2026