State of Frontier AI
The frontier model landscape at a glance - release cadence, the geography of labs and the countries behind them, and how the field splits between open-weight and closed models.
Release cadence by lab
Release cadence by country
Release cadence by access
The geography of frontier AI
Open vs closed
Monthly users: AI apps vs Google Search
Run it yourself: best open weights + local hardware
- GLM-5.2 / Kimi K2.7 Code / DeepSeek V4 - top-tier agentic coding (long-horizon tasks, tool-call accuracy). 1T-class MoE; need a server.
- Qwen3-Coder (32B) / Devstral Small - the practical local picks; strong coding on a single GPU.
- DeepSeek V4 / Qwen 3.6 / MiniMax M3 - frontier-class quality, MoE cost advantage when self-hosting at scale.
- Gemma 4 (26B-A4B / 31B) / Mistral Small / Llama - efficient, permissive licensing, easy local deployment.
Dense models up to ~32B and small MoE at Q4: Qwen3-Coder 32B, Gemma 4 31B, DeepSeek 32B distill. A capable local coding copilot.
70B dense and mid-size MoE; 100B+ MoE on a high-memory Mac. Small-team self-hosting via vLLM tensor parallelism.
The 1T-class MoE flagships (DeepSeek V4, Kimi K2.7, GLM-5.2) at INT4/FP8 for production batch throughput. e.g. ~630 GB for a 1T model at INT4.
The agent landscape
Autonomous multi-file edits with run / test / debug loops across a repo. Claude Code, Cursor, Devin, Codex, Copilot.
Multi-source web research, synthesis, and cited reports over long horizons. ChatGPT, Gemini, Perplexity deep research.
Drive a real browser or desktop - click, type, fill forms, finish web tasks. ChatGPT Agent, Perplexity Computer, computer-use models.
Resolve tickets end to end, with context from the systems where data already lives. Sierra, Ada, Intercom Fin, Agentforce.
Query, join, and reason over structured and unstructured data on demand. Analytics agents, LlamaIndex, warehouse-native agents.
Handle live phone calls at scale - the fastest-moving frontier, already taking millions of contact-center calls. Genesys, PolyAI, Yellow.ai.
Prospect, qualify, personalize outreach, and book meetings against your CRM. 11x, Artisan, Clay, Agentforce SDR.
Draft, optimize, and ship content and campaigns - increasingly tuned for AI search (GEO). Jasper, Writer, HubSpot agents.
Triage alerts, investigate threats, and draft remediations across the security stack. OpenAI Daybreak, Anthropic Glasswing, CrowdStrike.
Manage inbox, calendar, and tasks - the everyday agent embedded across consumer apps. ChatGPT, Gemini, Copilot, Meta AI.
Track and grow a brand's presence across SERPs and AI search, then act on the gaps. DemandSphere Agents.
Chain steps across apps into reliable back-office pipelines with humans in the loop. n8n, Zapier agents, sub-agent orchestration.
Nous Research - open model + agent collective; Hermes models, Forge and Atropos RL/agent tooling.
LangGraph - stateful multi-agent orchestration from LangChain; the most-deployed open agent layer.
CrewAI - role-based multi-agent "crews" with shared goals and tools.
Microsoft AutoGen - conversational multi-agent framework for cooperating agents.
OpenAI Agents SDK - lightweight open SDK for handoffs and tool use (successor to Swarm).
OpenHands - open autonomous software-engineering agent (formerly OpenDevin).
Pi - minimal MIT-licensed coding-agent harness (Earendil); 15+ model providers, plugins and skills.
smolagents - Hugging Face's minimal framework for code-writing agents.
Claude Code - Anthropic's terminal-native coding agent.
Codex - OpenAI's cloud software-engineering agent.
Devin - Cognition's autonomous software engineer.
Cursor - Anysphere's agentic IDE with background agents.
Gemini CLI - Google's open-source agentic terminal client.
Jules - Google's asynchronous coding agent.
Manus - general-purpose autonomous task agent.
Monetization & commerce
Anthropic passed OpenAI in run-rate and filed to IPO near ~$47B annualized; OpenAI targets $30B full-year. Google's AI revenue is bundled into Cloud / Workspace and not cleanly isolable; xAI is off a smaller base.
OpenAI's ChatGPT ad pilot launched Feb 2026 (~$100M annualized, ~$60 CPM, $200K minimum). Google now shows ads alongside ~25% of AI Overviews plus shopping ads in AI Mode. Perplexity dropped ads entirely. The AI-ad market is estimated at $15-25B, growing 35-50% a year.
Agentic commerce lets agents discover, compare, and buy inside the chat. OpenAI's Agentic Commerce Protocol powers ChatGPT Instant Checkout (Stripe-settled); Google's Universal Commerce Protocol pairs with shopping in AI Mode. Merchants expose product feeds; the agent handles the buy.
Both leaders filed to go public in mid-2026 (xAI + SpaceX reportedly combined ~$1.25T). Enterprise is where the margins are - Anthropic wins roughly 70% of head-to-head enterprise deals against OpenAI - while consumer subscriptions and ads drive volume.
Agent, skill & connector marketplaces
OpenAI GPT Store / Apps - custom GPTs and apps inside the ChatGPT ecosystem.
Salesforce AgentExchange - enterprise agents and actions for Agentforce.
Microsoft Agent Store - agents inside Microsoft 365 Copilot.
Google Agentspace & Agent Garden - prebuilt enterprise agents and ADK samples.
AWS Bedrock AgentCore - build, deploy, and buy agents on AWS.
Anthropic Agent Skills - reusable skill folders that extend Claude.
Claude Code plugin marketplaces - community plugin/skill registries (300K+ developer visits a month).
SkillsLLM - 1,600+ security-vetted, cross-model skills.
Hugging Face Spaces - open hub of runnable agent apps and demos.
Anthropic MCP directory - official catalog of Model Context Protocol servers.
Smithery - large community registry of MCP servers.
Glama - curated MCP server directory and host.
Composio - managed tool and connector layer for agents.
Definitions
A model whose trained weights are publicly downloadable, so anyone can run, fine-tune, or self-host it (DeepSeek, Qwen, GLM). It does not always mean the training data is open.
A model offered only through an API or product, with weights not released (GPT, Gemini, Claude).
An architecture where only a subset of parameters ("experts") activate per token, so a model can hold trillions of total parameters but compute like a much smaller one.
A model that spends extra compute "thinking" (chain-of-thought) before answering, trading latency for accuracy on hard problems.
The maximum amount of text, in tokens, a model can consider at once - including the prompt and its own output.
Compressing a model's weights to fewer bits (e.g. 4-bit) to cut memory use with modest quality loss - key to running large models on limited hardware.
An AI system that plans and takes multi-step actions toward a goal using tools - combining autonomy, reasoning, tool use, and memory.
An open standard for connecting AI models to external tools, data, and services - the common interface for agent integrations.
Letting AI agents discover, compare, and buy products on a user's behalf inside the chat (e.g. ChatGPT Instant Checkout).
Answer Engine Optimization / Generative Engine Optimization: making content easy for AI search engines to surface and cite.
FAQ
More than 80 frontier models from 17 labs across the United States, China, France, and Canada, updated as new models ship.
The United States leads by tracked model count, with China a clear second; together they account for the large majority of frontier models. France (Mistral) and Canada (Cohere) round out the current set.
Close to half. Open-weight (downloadable) and closed (API-only) models are split roughly evenly, and Chinese labs skew open-weight while US labs skew closed.
ChatGPT and Meta AI each report around a billion monthly users, with Google Gemini close behind - though Google Search still operates at several times that scale, and Google's AI Overviews alone reach billions of people monthly.
For top-tier agentic coding, GLM-5.2, Kimi K2.7 Code, and DeepSeek V4 lead but need server-class GPUs. For a single 24-32 GB GPU, Qwen3-Coder 32B and Devstral Small are the practical local picks.
Subscriptions and API usage dominate, with Anthropic and OpenAI each at multi-billion-dollar annualized run-rates. Advertising (ChatGPT and Google) and agentic commerce with in-chat checkout are fast-growing newer channels.
Terms of citation
- Attribution: credit "DemandSphere - State of Frontier AI" and link back to this page.
- License: CC BY-NC 4.0 (attribution, non-commercial). No commercial use without permission.
- Underlying data: model figures come from the AI Frontier Model Tracker, also available via the free JSON API and MCP server. Usage and traffic figures are third-party or company-disclosed estimates, cited inline.
Commercial use or questions: contact DemandSphere.
Data: AI Frontier Model Tracker · JSON API · CC BY-NC 4.0