Open post modify

Clive Can Now Modify Itself

We've been building clive — a CLI Live Environment that gives an LLM a terminal and a keyboard. The core idea is simple: instead of wrapping tools in APIs and schemas, you let the agent read the screen and type. The terminal becomes the interface. The loop becomes the protocol. We've added something that takes...

Open post interleaved thinking

The AI That Pauses to Think: How Interleaved Reasoning Is Reshaping Autonomous Agents

When Moonshot AI demonstrated its Kimi K2 model tackling a PhD-level mathematics problem in hyperbolic geometry, according to examples published in their technical documentation, the AI didn't just compute an answer. It embarked on a 23-step journey: searching academic literature, running calculations, reconsidering its approach based on results, querying databases again, and iterating until it...

Open post cloudflare

The LLM Whisperers: How Cloudflare and Anthropic Cracked the Code on AI Agent Efficiency

There's a delicious irony at the heart of modern AI development. We've spent years training large language models on every scrap of code humanity has ever written—Stack Overflow answers, GitHub repositories, programming textbooks, documentation—teaching them to become fluent in Python, JavaScript, TypeScript, and dozens of other languages. Then, when it comes time to actually use...

Open post locally

The Complete Guide to Running LLMs Locally: Hardware, Software, and Performance Essentials

For years, the language model arms race seemed to belong exclusively to cloud providers and their API keys. But something remarkable has happened in the past eighteen months: open-weight models have matured to the point where sophisticated, capable AI can now run entirely on consumer hardware sitting under your desk. The implications are profound. Your...

Open post skills

Claude’s Modular Mind: How Anthropic’s Agent Skills Redefine Context in AI Systems

If you've been building with large language models, you've hit this wall: every API call requires re-explaining your entire workflow. Financial reports need 500 tokens of formatting rules. Code generation needs another 300 tokens for style guides. Multiply this across thousands of requests, and you're paying twice—once in API costs, once in context window exhaustion....

Scroll to top