
Million-Token Reasoning: Inside the GPT-5.4 Architecture for Enterprise Memory
OpenAI has released GPT-5.4, featuring an unprecedented 1-million-token context window and a new reasoning kernel optimized for long-running autonomous workflows.
8 articles

OpenAI has released GPT-5.4, featuring an unprecedented 1-million-token context window and a new reasoning kernel optimized for long-running autonomous workflows.

NVIDIA's Nemotron 3 Super sets a new benchmark for open-weight models, specifically optimized for high-density reasoning and autonomous agent orchestration.

OpenAI officially replaces the 5.2 series with GPT-5.4 'Thinking,' a model that prioritizes cognitive density and steerable reasoning budgets over raw parameter counts.

OpenAI releases GPT-OSS-120B, a 117-billion parameter mixture-of-experts model under Apache 2.0, bringing frontier-level reasoning to every developer's local machine.

In a shocking move, OpenAI releases GPT-OSS-120B, its first major open-weight LLM, signaling a new competitive strategy against Meta and Alibaba.

Rakuten releases its most powerful AI model to date, boasting 700B parameters and state-of-the-art Japanese language capabilities under an open-source license.

Bigger isn't always better in the world of AI. Discover why small language models (SLMs) are becoming the secret weapon for startups looking for cost-efficiency, low latency, and total control.

Stop guessing and starting engineering. A technical guide to the principles of reliable prompt design for AI agents.