Industry Insights · 10 min read

Why AI Agents Need Structured Commerce Data

On January 23, 2025, OpenAI launched Operator -- a browser-controlling AI agent capable of navigating websites, filling out forms, and completing purchases on behalf of users. Priced at $200/month as part of the ChatGPT Pro plan, Operator marked a turning point.

The Shopping Agent Revolution

On January 23, 2025, OpenAI launched Operator -- a browser-controlling AI agent capable of navigating websites, filling out forms, and completing purchases on behalf of users. Priced at $200/month as part of the ChatGPT Pro plan, Operator marked a turning point. For the first time, a frontier AI lab explicitly positioned an autonomous agent as a shopping interface.

Operator was not alone. Amazon's Rufus, integrated into the main Amazon app, had already reached over 250 million users, with Amazon reporting a 60% higher conversion rate among shoppers who engaged the assistant. Google's Project Mariner began navigating Chrome tabs to execute multi-step web tasks. Perplexity launched its "Buy with Pro" feature, enabling direct purchases from search results. Each of these products represents a different approach to the same thesis: the next generation of e-commerce runs through AI agents, not through humans scrolling product pages.

Yet beneath this wave of innovation lies a structural problem. The vast majority of agent-driven commerce still depends on screen-scraping -- AI models parsing rendered HTML the same way a human would read a webpage. By most industry estimates, 99% of agent transactions happen via scraping rather than structured APIs. This works, barely, but it is slow, fragile, and inaccurate.

The question is no longer whether AI agents will shop. They already do. The question is whether your store's data is ready for them.

What AI Agents Actually See

When a human visits a product page, they see a rich visual layout: hero images, color swatches, a price in bold, customer reviews with star ratings, and a prominent "Add to Cart" button. The brain processes this information instantly, drawing on years of learned conventions about how e-commerce works.

An AI agent sees something entirely different. It receives raw HTML -- a soup of <div> tags, CSS classes, JavaScript bundles, and interleaved advertising scripts. There is no inherent semantic meaning. A price might live inside a <span class="pdp-price__main"> on one site and a <div data-testid="offer-price"> on another. The agent must guess, infer, and often hallucinate.

Research from Princeton and Stanford's WebArena benchmark illustrates the gap. When GPT-4 was tested on web navigation tasks using unstructured HTML, it achieved only 16% end-to-end accuracy. When the same model was given structured data -- clean, labeled fields with explicit semantics -- accuracy jumped to 54%. That is a 3.4x improvement from data formatting alone, with no change to the underlying model.

This accuracy gap has real commercial consequences. A scraping agent that misreads a price, misidentifies a product variant, or fails to detect that an item is out of stock does not just create a bad user experience. It erodes trust in the entire agent-commerce paradigm.

The Rise of Structured Commerce Data

The solution is not a new AI model. It is better data.

Schema.org -- the collaborative vocabulary maintained by Google, Microsoft, Yahoo, and Yandex -- provides a standardized way to describe products, offers, reviews, and brands in machine-readable format. JSON-LD (JavaScript Object Notation for Linked Data) is the preferred encoding, embedded directly in a page's <head> tag where any crawler or agent can extract it without parsing the visual DOM.

A JSON-LD Product markup tells an agent exactly what it needs to know: the product name, SKU, price, currency, availability, brand, aggregate rating, and review count. No guessing. No scraping heuristics. No fragile CSS selectors that break when a site redesigns.

Google has long used structured data for rich search results. What changed in 2024 and 2025 is that AI agents -- not just search crawlers -- began consuming this data as their primary information source. Schema.org Product markup is now the baseline for AI discoverability. Stores that lack it are functionally invisible to the growing fleet of shopping agents.

The Protocol Ecosystem

Structured data on the page is necessary but not sufficient. AI agents also need standardized protocols for interacting with commerce systems -- searching catalogs, managing carts, initiating payments, and coordinating with other agents. Over the past eighteen months, four major protocols have emerged to address different layers of this stack.

MCP (Model Context Protocol) was open-sourced by Anthropic in November 2024. MCP defines a universal standard for connecting AI models to external tools and data sources. It is the connectivity layer -- the mechanism by which an LLM discovers what capabilities a server offers and invokes them. Think of MCP as USB-C for AI: a single interface that works regardless of which model or which data source is on either end.

ACP (Agentic Commerce Protocol) was introduced by OpenAI in partnership with Stripe in September 2025. ACP focuses on the payment and transaction layer -- how an AI agent securely initiates a purchase, handles payment authorization, and manages order state. It builds on Stripe's payment infrastructure to provide a trusted checkout mechanism for agent-driven transactions.

UCP (Universal Commerce Protocol) was co-developed by Google and Shopify, announced in January 2025. UCP addresses the full shopping journey: product discovery, catalog browsing, cart management, and checkout flow. It defines a standardized API contract for how AI agents interact with merchant storefronts, from initial search to order confirmation.

A2A (Agent-to-Agent) was released by Google in April 2025. While the other protocols handle agent-to-server communication, A2A addresses agent-to-agent collaboration. It defines how one AI agent can delegate tasks to another, share context, and coordinate complex multi-step workflows -- such as a personal shopping agent coordinating with a shipping logistics agent.

Each protocol serves a distinct purpose. MCP provides the plumbing. UCP structures the shopping experience. ACP handles the money. A2A enables teamwork between agents. Together, they form the emerging infrastructure of agentic commerce.

The Market is Moving Fast

The numbers tell an unambiguous story. The global AI agents market reached an estimated $7.3 billion in 2025, growing at a compound annual rate between 43% and 50%. McKinsey projects that AI-influenced retail spending in the United States alone could reach $1 trillion by 2030.

Consumer behavior is shifting in lockstep. Adobe Analytics reported a 4,700% year-over-year increase in AI-driven traffic to retail websites during the 2024 holiday season. A separate study found that 26% of US adults used AI for product discovery or shopping recommendations in 2025 -- up from single digits just a year prior.

The platform players are responding with structural changes. In the summer of 2025, Shopify activated default MCP endpoints on every store in its ecosystem, making millions of merchants instantly accessible to AI agents via structured protocol. This was not a beta feature or a developer preview. It was a platform-wide default, signaling that Shopify views agent-readiness as fundamental infrastructure, not optional enhancement.

Google's AI Mode in Search, Microsoft's Copilot shopping features, and ChatGPT's native shopping experience all rely on structured product data to populate results. These are not experimental features anymore. They are primary surfaces where consumers discover and evaluate products.

The Invisibility Problem

Here is the strategic reality merchants face: if your product data is not structured and protocol-accessible, AI agents cannot reliably find you.

This is different from traditional SEO. In organic search, a page with mediocre metadata could still rank on quality content, backlinks, and domain authority. AI agents operate differently. They do not "browse" the web the way a search crawler does. They query structured endpoints, consume JSON-LD, and invoke protocol-defined tools. If a merchant's data is locked inside rendered HTML with no structured layer, the agent either skips that merchant entirely or attempts to scrape with low confidence -- producing results that the agent's own ranking logic will de-prioritize.

The parallel is instructive. In the early days of mobile commerce, merchants who did not optimize for mobile screens saw traffic and conversion collapse as smartphone usage grew. The same dynamic is now playing out with AI agents. Structured commerce data is the mobile-responsive design of the agentic era.

And the window for action is narrowing. As agent-driven shopping grows from 26% to majority adoption, merchants without structured data infrastructure will find themselves excluded from an increasing share of purchase decisions.

Building for Agent-Readiness

What does it take to make a store agent-ready? The requirements span three levels.

Level 1: Structured markup. Every product page needs valid JSON-LD with Schema.org Product, Offer, Brand, AggregateRating, and Review types. This is the minimum for AI discoverability. Validation tools like Google's Rich Results Test can verify markup accuracy.

Level 2: Protocol endpoints. Beyond static markup, stores need live API endpoints that agents can query in real time -- searching products by natural language, checking inventory, creating carts, and initiating checkout. MCP, UCP, and ACP each define specifications for these interactions.

Level 3: Data quality and freshness. Structured data is only valuable if it is accurate. Prices must reflect current pricing. Inventory counts must update in near-real time. Product descriptions must be complete enough for an agent to make informed recommendations without hallucinating missing details.

This is where infrastructure platforms like ORBEXA operate. Rather than requiring each merchant to independently implement multiple protocols, normalize their data, and maintain structured endpoints, ORBEXA provides a multi-protocol infrastructure layer that transforms existing store data into agent-ready formats. The platform supports MCP, UCP, ACP, and A2A through shared multi-tenant infrastructure with custom domain support, enabling merchants of any size to become agent-accessible.

What Comes Next

The transition from human-browsed to agent-queried commerce is not a speculative future. It is happening now, with production-grade agents from OpenAI, Google, Amazon, Microsoft, and a growing ecosystem of vertical players. The protocols are standardizing. The consumer behavior is shifting. The platform defaults are changing.

Merchants who invest in structured commerce data and protocol infrastructure today are positioning themselves for a market where AI agents influence an increasing share of purchasing decisions. Those who wait risk the same fate as businesses that ignored mobile optimization a decade ago -- not sudden irrelevance, but a slow erosion of visibility and market share that compounds over time.

The data layer is the new storefront. The question is whether yours is open for business.

← Back to News