What is...

Everything you need to know about Grok-3

Ben Dickson

Grok-3 storms the AI scene, boasting superior capabilities and competitive benchmarks. Here's everything to know about this new LLM and LRM from xAI.

Understanding LLM ensembles and mixture-of-agents (MoA)

Ben Dickson

LLM ensembles use the power of teamwork to improve the responses of models. Mixture-of-agents (MoA), a more advanced technique, takes ensembles to the next level.

Demystifying DeepSeek-R1, the model that shocked the AI industry

Ben Dickson

There is a lot of hype and confusion around DeepSeek-R1. Here is what you need to know about how this reasoning model works and what makes it special.

What to know about OpenAI o3-mini

Ben Dickson

OpenAI's o3-mini is a game-changer—faster, cheaper, and smarter than o1, but it's also a bid to reclaim dominance amid DeepSeek's rising threat.

What to know about the alternatives to OpenAI o1 and o3

Ben Dickson

OpenAI o1 and o3 are very effective at math, coding, and reasoning tasks. But they are not the only models that can reason.

What to know about open-source alternatives to GPT-4 Vision

Ben Dickson

GPT-4 Vision is an impressive model that can create new user experiences. Fortunately, there are open-source alternatives. But they come with caveats.

The complete guide to LLM compression

Ben Dickson

Large language models (LLM) require huge memory and computational resources. LLM compression techniques make models more compact and executable on memory-constrained devices.

A simple guide to gradient descent in machine learning

Ben Dickson

Gradient descent is the main technique for training machine learning and deep learning models. Read all about it.

The complete guide to LLM fine-tuning

Ben Dickson

Everything to know about LLM fine-tuning, supervised fine-tuning, reinforcement learning from human feedback (RLHF), and parameter-efficient fine-tuning (PEFT)

What is low-rank adaptation (LoRA)?

Ben Dickson

Low-rank adaptation (LoRA) is a technique that cuts the costs of fine-tuning large language models (LLM) to a fraction of its actual figure.

The paradox of LLM self-distillation: Faster reasoning, weaker generalization

Why harness engineering is becoming the new AI moat

How GhostClaw malware targets the OpenClaw AI agent boom

Why Meta’s V-JEPA 2.1 model is a massive step forward for…

Multi-level AI prompt engineering: A new tool for scientific discovery

Applied ML: When ‘perfect’ becomes the enemy of ‘good’

AI can’t replace software engineers yet, but here is how to…

How to turbocharge your product and market research with DeepSearch

How looking differently at data can save your machine learning project

Building a solid data foundation for generative AI applications

The evolution of LLM tool-use from API calls to agentic applications

What makes DeepSeek-V3.2 so efficient?

What to know about Claude Opus 4.5

OpenAI’s GPT-5: A reality check for the AI hype train

OpenAI’s grand return to open source: unpacking the gpt-oss release

AI is writing your code, but who’s reviewing it?

Machine learning in space: Building intelligent systems for the harshest environments

Decoding the brain, inspiring AI: How Rahul Biswas is bridging neuroscience…

The cash flow conundrum: How technology is reshaping small business finance

What to know about the security of open-source machine learning models

The evolution of LLM tool-use from API calls to agentic applications

Everything you need to know about Grok-3

Understanding LLM ensembles and mixture-of-agents (MoA)

Demystifying DeepSeek-R1, the model that shocked the AI industry

What to know about OpenAI o3-mini

What to know about the alternatives to OpenAI o1 and o3

What to know about open-source alternatives to GPT-4 Vision

The complete guide to LLM compression

A simple guide to gradient descent in machine learning

The complete guide to LLM fine-tuning

What is low-rank adaptation (LoRA)?