Skip to content

Founding Editor

Aditya Marin Gasga

Aditya covers the whole AI surface area for Signal — frontier models, agent infrastructure, the economics of inference, and the policy decisions that quietly shape what everyone else can build. He writes for operators who need a calibrated view of what's actually shipping versus what's keynote theatre.

By Aditya Marin Gasga

News

Anthropic ships Sonnet 4.7 with native code execution

Sonnet 4.7 shipped this morning with Python execution baked directly into the API — no separate sandbox, no harness wiring, one new boolean parameter. Same model pricing as 4.6 plus a per-execution surcharge.

Aditya Marin Gasga · 5 min read
Models

Your stated cache hit rate is probably lying to you

Your dashboard says 90% cache hits. Your bill says otherwise. The gap is almost always three specific patterns inside the system prompt that don't show up in any log — and you can fix them in an afternoon.

Aditya Marin Gasga · 8 min read
Explainers

What is an embedding, really?

A from-scratch explainer of what embeddings actually are, how they're compared, why they make modern search possible, and which model to pick in 2026 — in about 15 minutes.

Aditya Marin Gasga · 9 min read
Tools

The 7 AI agents worth testing in 2026 — compared

Seven AI agents tested on the same task suite — Manus, Devin, Operator, GPT Agent Mode, Gemini Workspace, Replit, Claude Computer Use. Two are genuinely useful today; the rest need a specific use case.

Aditya Marin Gasga · 8 min read
News

OpenAI's trillion-dollar IPO, explained

OpenAI filed confidentially for what could be the largest tech IPO ever — at a valuation near $1 trillion, while losing more than a dollar for every dollar it earns. Here's what's actually going on.

Aditya Marin Gasga · 4 min read
Models

Grok 4.3: xAI's bet on cheap and fast over best

Grok 4.3 isn't trying to top the leaderboards. It's trying to be the model you can afford to run at scale — and on that bet, it mostly delivers. Here's where it fits and where it doesn't.

Aditya Marin Gasga · 4 min read
Models

Your Opus 4.8 cache misses are self-inflicted

A 70% versus 95% cache hit rate isn't a model difference — it's whether you treat the system prompt as immutable code. Standardize the prefix, push every dynamic value into the user turn, and the misses disappear.

Aditya Marin Gasga · 5 min read
Models

Why Opus 4.8 changes the cost-per-token math

Opus 4.8 lists at $5/MTok in and $25 out — a third of the prior Opus generation — with cache hits at $0.50. Here's what that does to the real per-request math for production teams.

Aditya Marin Gasga · 4 min read
Explainers

What is an LLM, really?

A from-scratch explainer of how large language models actually work — tokens, attention, the inference loop, and what to make of it all — in 12 minutes.

Aditya Marin Gasga · 6 min read
⚡ Built with Signal →