Recursive Language Models: A new framework for infinite context in LLMs

By

Ben Dickson

-

January 26, 2026

Microsoft’s new Rho-alpha model brings tactile sensing to robotics

By

Ben Dickson

-

January 23, 2026

While large language models (LLMs) have mastered the art of processing text and images, they remain largely confined to the digital realm. Moving from generating code to folding laundry requires a fundamental shift in how AI perceives the world. Microsoft is attempting to bridge this gap with Rho-alpha (⍴ɑ), a new robotics foundation model designed to bring adaptivity to physical tasks.

Rho-alpha falls under the category of Vision-Language-Action (VLA) models. These systems ingest visual data and natural language commands to output robot arm actions. However, standard VLAs often struggle with precision tasks where vision is obstructed or insufficient, such as manipulating a slippery object or inserting a plug behind a desk. Rho-alpha addresses this by integrating tactile sensing directly into its decision-making process, a capability Microsoft refers to as “VLA+.”

Continue

Vulnerability in Perplexity’s BrowseSafe shows why single models can’t stop prompt injection

By

Ben Dickson

-

January 19, 2026

Lasso Security has discovered significant prompt injection vulnerabilities in BrowseSafe, a new open-source tool from Perplexity designed to protect AI browsers against prompt injection attacks. Despite marketing that promised developers could “immediately harden their systems,” Lasso’s red team achieved a 36% bypass rate using standard encoding techniques. The findings show that relying on a single model for security can create dangerous blind spots, leaving agentic browsers vulnerable to hijacking.

Continue

How test-time training allows models to ‘learn’ long documents instead of just caching them

By

Ben Dickson

-

January 12, 2026

VL-JEPA is a lean, fast vision-language model that rivals the giants

By

Ben Dickson

-

January 3, 2026

This article is part of our coverage of the latest in AI research.

Researchers at Meta have introduced VL-JEPA, a vision-language model built on a Joint Embedding Predictive Architecture (JEPA). Unlike traditional models that focus on generating text word-by-word, VL-JEPA focuses on predicting abstract representations of the world.

This approach makes the model significantly more efficient and capable; it achieves stronger performance than standard vision-language models (VLMs) while using only 50% of the trainable parameters. Beyond its efficiency, the model supports a wide range of applications without requiring architectural modifications. VL-JEPA represents a fundamental shift in model design, moving beyond simple token prediction to a system capable of understanding representations and modeling the physical world.

Continue

The evolution of LLM tool-use from API calls to agentic applications

By

Ben Dickson

-

December 29, 2025

URM shows how small, recurrent models can outperform big LLMs in reasoning tasks

By

Ben Dickson

-

December 22, 2025

This article is part of our coverage of the latest in AI research.

Researchers at Ubiquant have proposed a new deep learning architecture that improves the ability of AI models to solve complex reasoning tasks. Their architecture, the Universal Reasoning Model (URM), refines the Universal Transformer (UT) framework used by other research teams to tackle difficult benchmarks such as ARC-AGI and Sudoku.

While recent models like the Hierarchical Reasoning Model (HRM) and Tiny Recursive Model (TRM) have highlighted the potential of recurrent architectures, the Ubiquant team identified key areas where these models could be optimized. Their resulting approach substantially improves reasoning performance compared to these existing small reasoning models, achieving best-in-class results on reasoning benchmarks.

Continue

The hidden architecture behind AI systems that don’t break under growth

By

Contributor

-

December 21, 2025

By Purusoth Mahendran

Most engineering teams build systems that work today, but the best teams build systems that survive orders of magnitude growth. The difference becomes apparent when transaction volume shifts from millions to billions, rigid workflows give way to conversational interfaces, and batch processing evolves into real-time intelligence.

The gap between these approaches isn’t about writing better code; it’s about understanding that software architecture must account for operational reality, data quality constraints, and inevitable business evolution. Real scalability depends on architecture, data quality, and organizational design.

Continue

A few interesting observations on Gemini 3 Flash

By

Ben Dickson

-

December 20, 2025

Google has just released Gemini 3 Flash, a lightweight, efficient tool optimized for speed and low latency, capable of delivering performance comparable to the larger Gemini 3 Pro at a fraction of the cost. Google brands it as the democratization of frontier intelligence. On the surface, Gemini 3 Flash appears to be a standard upgrade in the race for efficient AI: a smaller, faster model distilled from its larger sibling.

However, a closer look at independent benchmarks and leaked architectural details suggests that Gemini 3 Flash is not simply a small model. We are likely looking at a massive, trillion-parameter architecture behaving like a lightweight agent through extreme sparsity, a design choice that brings unprecedented power but introduces specific tradeoffs in token efficiency and reliability. (Lots of speculation incoming.)

Continue

How Nvidia changed the open source AI game with Nemotron 3

By

Ben Dickson

-

December 16, 2025

Nvidia has released the Nemotron 3, a family of open source language models designed for reasoning and multi-agent tasks. Available in Nano, Super, and Ultra sizes, the models feature a hybrid mixture-of-experts (MoE) architecture that delivers high throughput and a massive 1-million-token context window.

Unlike typical open-weight releases, Nvidia has open-sourced the entire development stack, including training data, recipes, and reinforcement learning environments. As an affordable and easy-to-use model, Nemotron 3 might redefine the model landscape and provide Nvidia the chance to crown itself as the king of open-source AI.

Continue

TechTalks

How GhostClaw malware targets the OpenClaw AI agent boom

Why Meta’s V-JEPA 2.1 model is a massive step forward for…

Multi-level AI prompt engineering: A new tool for scientific discovery

Why AI won’t kill SaaS

How C-JEPA is teaching AI the physics of the physical world

Applied ML: When ‘perfect’ becomes the enemy of ‘good’

AI can’t replace software engineers yet, but here is how to…

How to turbocharge your product and market research with DeepSearch

How looking differently at data can save your machine learning project

Building a solid data foundation for generative AI applications

The evolution of LLM tool-use from API calls to agentic applications

What makes DeepSeek-V3.2 so efficient?

What to know about Claude Opus 4.5

OpenAI’s GPT-5: A reality check for the AI hype train

OpenAI’s grand return to open source: unpacking the gpt-oss release

AI is writing your code, but who’s reviewing it?

Machine learning in space: Building intelligent systems for the harshest environments

Decoding the brain, inspiring AI: How Rahul Biswas is bridging neuroscience…

The cash flow conundrum: How technology is reshaping small business finance

What to know about the security of open-source machine learning models

Recursive Language Models: A new framework for infinite context in LLMs

Microsoft’s new Rho-alpha model brings tactile sensing to robotics

Vulnerability in Perplexity’s BrowseSafe shows why single models can’t stop prompt injection

How test-time training allows models to ‘learn’ long documents instead of just caching them

VL-JEPA is a lean, fast vision-language model that rivals the giants

The evolution of LLM tool-use from API calls to agentic applications

URM shows how small, recurrent models can outperform big LLMs in reasoning tasks

The hidden architecture behind AI systems that don’t break under growth

A few interesting observations on Gemini 3 Flash

How Nvidia changed the open source AI game with Nemotron 3

Subscribe to continue reading

Subscribe to continue reading

Subscribe to continue reading