engineering.
42 writings found
Latest Archives
DeepSeek V4: The Frontier Model That Costs Almost Nothing
DeepSeek drops V4 models that rival GPT and Gemini at a fraction of the cost. The efficiency gains are staggering, and they might run on my laptop.
Claude's System Prompt Evolution: What Opus 4.7 Tells Us About AI Behavior Design
Anthropic's latest system prompt reveals a shift toward proactive AI behavior. I dig into what these changes mean for developers building with Claude.
PyCon US 2026: Why This Matters for Python and AI Engineering
PyCon returns to California with new AI and security tracks. What this shift means for the Python community and the future of technical conferences.
When Benchmarks Break: A Laptop Model Drew Better Pelicans Than Claude Opus
A quantized 21GB model running locally outperformed Anthropic's flagship on SVG generation. What this tells us about AI benchmarks and model comparison.
When Benchmark Performance Stops Meaning What We Think It Means
A quantized local model outdraws Claude Opus 4.7 at pelicans on bicycles. What does that tell us about AI benchmarks? Probably nothing good.
Meta's Muse Spark: A Tooled-Up Return to Frontier Models
Meta launches Muse Spark with 16 built-in tools, visual grounding, and Code Interpreter. But where's the open source promise?
Meta's Muse Spark: They're Back in the Frontier Game (And the Tools Are Wild)
Meta drops Muse Spark with 16 powerful tools including visual grounding, Python sandbox, and Meta content search. Are they back in the race?
Meta's Muse Spark: A Developer's First Look at the Tool Arsenal
Meta returns to frontier models with Muse Spark. I got my hands dirty with its 16 tools, from visual grounding to Python sandboxes, and here's what matters.
Meta's Muse Spark: A Tool-Heavy Return to the Frontier Model Race
Meta drops Muse Spark with 16 tools, Code Interpreter, visual grounding, and Meta content search. But is a hosted-only model what we really wanted?
The Axios Attack: When Social Engineering Becomes Your Supply Chain's Weakest Link
A sophisticated social engineering attack compromised Axios maintainer credentials through fake job interviews. Every open source maintainer needs to know this.
The Axios Attack: Why Social Engineering is Now the Biggest Threat to Open Source
A sophisticated supply chain attack on Axios used fake job interviews to install malware. Every open source maintainer needs to understand this threat.
Building macOS Apps Without Knowing Swift: What Vibe Coding Actually Teaches Us
I built two monitoring tools for my M5 MacBook using Claude and GPT without writing Swift myself. The results work, but should they?
Starlette 1.0 and the Problem of Training Data Obsolescence
Starlette finally hits 1.0, but breaking changes expose a fascinating problem: how do you make LLMs generate code for frameworks they weren't trained on?
Starlette 1.0 and the Problem of Teaching AI New Tricks
Starlette finally hits 1.0, but breaking changes expose a fascinating challenge: how do you get LLMs to generate code for versions they weren't trained on?
Starlette 1.0 and the curious case of teaching AI new tricks
Starlette finally hits 1.0, but what happens when your LLM was trained on outdated code? Claude's new skills feature might just solve that problem.