The pulse, right now.
A continuously-updating read on what the AI industry is doing, shipping, buying, and saying.
appeal 10.0Ukraine Deploys AI Logistics Drones
Ukraine's AI-enabled Hornet drones disrupt Russian supply lines, forcing shorter convoys and limiting heavy equipment movement. This directly impacts Russia's ability to sustain offensive operations, pressuring defence procurement teams to counter AI-driven precision strikes.
The tech landscape is rapidly evolving, with AI integration becoming paramount. From Meta's AI pendant to advanced LLM inference engines and specialised AI chips, efficiency and performance are key. Simultaneously, ethi…
The stream
- PRODUCTMeta Meta develops AI pendant for testing7.0
- OPENSOURCEjmaczan Releases tiny-vLLM engine on GitHub8.0
- RELEASEKog AI Kog AI launches Inference Engine tech preview9.0
- PRODUCTUkraine military Deploys AI drones against Russian convoys10.0
- RELEASELiquid.ai Liquid.ai Releases On-Device MoE Model8.0
- PRODUCTMistral AI Mistral AI launches full AI stack9.0
- FUNDINGXCENA XCENA raises $135M for AI memory chip8.0
- SECURITYClaude Code Undocumented features exposed in package7.0
- LEADERSHIPSam Altman AI leaders reverse job loss warnings8.0
- FUNDINGGlean Glean hits $300M ARR, cuts AI costs9.0
- RELEASEDataHub DataHub Launches Context Intelligence Layer7.0
- PRODUCTClaude Code Claude Code Gains Dynamic Workflows for AI8.0
- + more stories in the last 24 hoursAll stories →
Subjects
The tech landscape is rapidly evolving, with AI integration becoming paramount.
A weekly synthesis of the most material AI-industry moves — what shipped, what was bought, what regulators said, and how the pattern is shifting.
Capital availability is no longer the primary constraint — return justification is.
Departments
All stories →
jmaczan Releases tiny-vLLM Engine
Developer jmaczan released `tiny-vllm`, an open-source C++/CUDA LLM inference engine and course. It provides a practical guide to implementing high-performance techniques like PagedAttention and continuous batching, lowering the barrier for engineers to optimise LLM inference on GPU hardware.

Kog AI Accelerates GPU Inference
Kog AI's new Inference Engine achieves 3,000 tokens/s on standard datacenter GPUs, significantly reducing agentic AI workflow times. This shifts the inference bottleneck to memory bandwidth, allowing platform engineers to leverage existing hardware for real-time LLM performance.

Mistral AI Builds Full AI Stack
Mistral AI expands beyond models to offer a full AI stack, including compute and on-prem solutions. This provides European enterprises with a sovereign alternative for sensitive workloads, addressing data residency and regulatory compliance concerns.

Liquid.ai Releases On-Device MoE Model
Liquid.ai's LFM2.5-8B-A1B model enables complex agentic workflows directly on consumer hardware, reducing cloud dependency. Its expanded context, improved multilingual support, and reduced hallucinations offer platform engineers and product teams a more reliable foundation for on-device AI applications.

XCENA Raises $135M for AI Memory Chip
XCENA secured $135 million for its MX1 chip, designed to integrate compute directly into DRAM via CXL. This aims to drastically cut AI inference infrastructure costs by reducing data round trips, offering hyperscalers a path to significant operational savings and efficiency gains.

Glean Hits $300M ARR, Cuts AI Costs
AI budget-cutting is now a primary driver for enterprise AI adoption. Glean reports $300M in ARR and annualized revenue run rate, driven by its "context graph" reducing AI computing costs, demonstrating a market shift towards solutions that optimise operational expenditure for CTOs and procurement teams.
Every Tuesday. 5-minute read. 33,525 unique readers.
A tight brief for CTOs, CFOs, product leads, and founders. No hype. No hedging. No filler.