What is...

Why the future of agentic AI is all about the harness

Ben Dickson

Scaling LLMs hits limits when dealing with agentic AI tasks. For that, we need to look at the harness and the system built around the model(s).

The evolution of LLM tool-use from API calls to agentic applications

Ben Dickson

A look at the evolution of LLM tool-use, from supervised fine-tuning to Reinforcement Learning (RLVR) and agentic applications in large and specialized models.

What makes DeepSeek-V3.2 so efficient?

Ben Dickson

DeepSeek-V3.2 is a top-5 LLM, sitting next to the likes of Grok 4 and GPT-5. But what is more impressive is its efficiency.

What to know about Claude Opus 4.5

Ben Dickson

Anthropic responds to OpenAI and Google with Claude Opus 4.5, a model that prioritizes coding dominance, cost-efficiency, and user-controlled reasoning.

OpenAI’s GPT-5: A reality check for the AI hype train

Ben Dickson

OpenAI's GPT-5 is finally here, but a rocky rollout and mixed reviews have divided the community, creating a reality check for AI hype.

OpenAI’s grand return to open source: unpacking the gpt-oss release

Ben Dickson

The new gpt-oss open-weight models undercut OpenAI's own closed LLMs, marking a strategic pivot designed to reshape the competitive AI market.

What to know about Gemini 2.5 Deep Think

Ben Dickson

A look inside Google’s Gemini 2.5 Deep Think, the AI that uses extended "slow thinking" to solve complex math and code problems.

How OpenAI’s ChatGPT agent redefines AI capability and risk

Ben Dickson

OpenAI's powerful new ChatGPT Agent redefines AI capabilities while introducing new risk and attack vectors in security and data integrity of AI systems.

How Google’s Agent2Agent can boost AI productivity through inter-agent communication

Ben Dickson

Google's new A2A framework lets different AI agents chat and work together seamlessly, breaking down silos and improving productivity across platforms.

Demystifying vibe coding: Hype, reality, and why you still need to code

Ben Dickson

There is a lot of hype surrounding "vibe coding." But there is a darker reality to letting AI write your entire code and ignoring fundamental software skills.

Why LLMs should stop thinking out loud (and what comes after…

Beyond vibe coding: How Codev 3.0 engineers the AI-powered dev team

How Cursor’s Composer 2.5 uses self-distillation to beat the frontier LLMs…

Vertical integration as AI infrastructure: What 21D’s full arch implant system…

Why sandboxing OpenClaw doesn’t stop data exfiltration

Applied ML: When ‘perfect’ becomes the enemy of ‘good’

AI can’t replace software engineers yet, but here is how to…

How to turbocharge your product and market research with DeepSearch

How looking differently at data can save your machine learning project

Building a solid data foundation for generative AI applications

Demystifying loop engineering: Get more from AI agents, avoid loopmaxxing

Why the future of agentic AI is all about the harness

The evolution of LLM tool-use from API calls to agentic applications

What makes DeepSeek-V3.2 so efficient?

What to know about Claude Opus 4.5

AI is writing your code, but who’s reviewing it?

Machine learning in space: Building intelligent systems for the harshest environments

Decoding the brain, inspiring AI: How Rahul Biswas is bridging neuroscience…

The cash flow conundrum: How technology is reshaping small business finance

What to know about the security of open-source machine learning models

Demystifying loop engineering: Get more from AI agents, avoid loopmaxxing

Why the future of agentic AI is all about the harness

The evolution of LLM tool-use from API calls to agentic applications

What makes DeepSeek-V3.2 so efficient?

What to know about Claude Opus 4.5

OpenAI’s GPT-5: A reality check for the AI hype train

OpenAI’s grand return to open source: unpacking the gpt-oss release

What to know about Gemini 2.5 Deep Think

How OpenAI’s ChatGPT agent redefines AI capability and risk

How Google’s Agent2Agent can boost AI productivity through inter-agent communication

Demystifying vibe coding: Hype, reality, and why you still need to code