Stainless Human-Dependent

Name: Agent Native Offer Audit — Stainless
Item: Stainless
Rating: 54
Author: Agent Native Offers

AUDIE Score: 54/100 · Audited 2026-04-13 · Website: https://stainless.com · Machine-readable: JSON

Pillar Scores

P1 Signal Architecture — 12/25

P2 Clarity Stack — 15/25

P3 Trust Envelope — 9/20

P4 Velocity Triggers — 8/10

P5 Gravity Design — 10/20

Audit History

Scores are never overwritten — every re-audit is appended as a new dated version. Machine-readable: history JSON.

Version	Date	Score	Tier	Rubric
v1	2026-04-13	54/100	Human-Dependent	v1-2026-04

Executive Summary

Stainless scores 54/100 (Human-Dependent) — a score that understates its actual strategic position in the agent ecosystem. The company builds the infrastructure that makes APIs agent-accessible for OpenAI, Anthropic, Google, and Cloudflare, yet its own offer infrastructure is entirely human-configured: no llms.txt, no structured data, no native log API. The platform's gravity is exceptional (auto-regenerating SDKs and MCP servers create some of the highest switching costs audited to date), and velocity is strong (free tier, no friction, quantified performance signals). The gap is entirely in Signal Architecture and Trust infrastructure — the tools an agent needs to independently discover, evaluate, and manage Stainless. The top recommendation is publishing llms.txt immediately: a one-hour fix that would raise the score by 5 points and correct the most ironic gap in the dataset — an AI infrastructure company that hasn't told AI where to find it.

Strongest Signals

Best-in-class Integration Gravity (P5-A: 4/5) Stainless is deeply embedded in the API development lifecycle. Auto-regeneration on every OpenAPI spec change means the MCP server, SDKs, and docs all update in lockstep. Switching away requires rebuilding the entire developer interface stack — one of the highest switching costs of any platform audited.
Exceptional Velocity Triggers (P4: 8/10) Free tier, no credit card, self-serve, and quantified decision metrics (94–97% accuracy, 3x fewer tool calls, 100k+ tokens saved) give agents everything needed to independently evaluate and activate. This is the highest P4 score in the audit portfolio.
Compounding Value Through Auto-Regeneration (P5-D: 3/5) As the underlying API evolves, so does the generated MCP server — automatically. This creates an agent-visible compounding value loop that most platforms lack entirely.

Critical Gaps

No llms.txt (P1-C: 0/5) A platform whose entire value proposition is enabling agents to use APIs has no /llms.txt. This is the single most ironic gap in the audit — a company building AI agent infrastructure that hasn't told AI agents how to find or understand it.
No Structured Data (P1-A: 2/5) No schema.org markup found on any page. For a company whose homepage is the front door for agent discovery, the absence of Organization, Product, and Offer schema is a first-impression failure.
No Substitution or Fallback Policy (P2-C: 0/5) Zero documentation on what happens when the code execution sandbox fails, when the MCP server generator is unavailable, or when generated servers degrade. Production agents have no guidance for degraded states.
Audit Logs Not Natively Accessible (P3-C: 3/5) Stainless's monitoring product forwards to external platforms rather than exposing a first-party log API. Agents can't query Stainless directly for execution history — they need access to Splunk, Azure Monitor, or similar.

All 20 Criteria

P1-A Structured Data — 2/5
robots.txt allows all crawlers and two sitemaps are present (sitemap.xml + docs/sitemap-index.xml), signaling crawlability. However, direct search for schema.org/JSON-LD on stainless.com returned no results. Homepage is React-rendered marketing with no detectable rich structured data markup.

P1-B Machine-Readable Pricing — 3/5
Pricing page at stainless.com/pricing has highly granular HTML-structured pricing: $79/$499/Enterprise per generator per month, with endpoint count limits (50/100/100+), preview build limits (300/1000/custom), and explicit overage rates ($1/1000 builds; $5/mo per endpoint per SDK). Not encoded in schema.org/Offer format.

P1-C llms.txt / Agent Layer — 0/5
No /llms.txt file found at stainless.com/llms.txt. The URL returned information about the company but no structured agent-facing content file. No evidence of any equivalent agent identity layer.

P1-D API / MCP Availability — 4/5
The product IS an MCP server generator. Outputs: NPM packages, Cloudflare Workers deployments, Docker containers, self-hosted options. Two-tool architecture (search_docs + execute) confirmed. Multiple deployment paths. Score capped at 4 because Stainless's own management API (for project creation, spec updates) is not publicly documented.

P1-E Discoverability (GEO) — 3/5
Two sitemaps, open crawling, well-structured docs at app.stainless.com. Named as infrastructure for Anthropic, OpenAI, Google, and Cloudflare — strong LLM ecosystem presence. Blog posts explicitly addressing agent experience. No llms.txt and no explicit AI retrieval optimization.

P2-A Offer Completeness — 4/5
What (SDK + docs + MCP server generation from OpenAPI specs), who (API companies, developer teams), how much (clear tier pricing with per-generator monthly cost). All findable on a single /pricing page with feature breakdowns by tier.

P2-B Scope & Limits — 4/5
Explicit, specific limits documented: 5 generators (Free), 25–50 endpoints per generator, 100/300/1,000 preview builds/month per tier, $1 per 1,000 overage builds, $5/mo per additional endpoint per SDK at Enterprise. All quantified and published.

P2-C Substitution Rules — 0/5
No documentation on what happens when the service is unavailable, when code execution sandbox fails, or when generated MCP servers degrade. No fallback routing, graceful degradation policy, or alternative guidance found anywhere.

P2-D Conditional Logic — 3/5
Most conditions are disclosed on the pricing page: tier gates for webhooks, OAuth 2.0, dedicated support, and AI chat clearly tied to plan level. Enterprise requires custom pricing (disclosed). No opaque "contact sales" wall on standard tiers.

P2-E Semantic Precision — 4/5
Strong use of specific metrics: "94–97% task accuracy," "3x fewer tool calls," "100k+ tokens saved on complex tasks." Named enterprise customers (Anthropic, OpenAI, Google, Cloudflare, Modern Treasury). Exact pricing numbers throughout. Minimal vague superlatives.

P3-A Verifiable Performance — 2/5
Self-reported performance benchmarks (94–97% accuracy, 3x tool call reduction) on the MCP product page. Notable customer logos (Anthropic, OpenAI) serve as implicit verification. No public status page found, no third-party G2/Trustpilot reviews indexed, no SLA document located. $25M Series A (Dec 2024) signals institutional credibility but not operational reliability.

P3-B Scoped Permissions — 2/5
Authentication via environment variable API keys — effectively all-or-nothing access per key. Code execution runs in Cloudflare Workers sandbox (security benefit), but no granular agent-specific permission model. No time-bounded, amount-bounded, or action-bounded scoping documented for agent operations.

P3-C Audit Trail — 3/5
"Real-Time MCP Monitoring and Logging" product confirmed (dedicated page on stainless.com/mcp/). Structured telemetry per tool call: timestamps, tool names, parameters, response times, success/error, request IDs. Logs forward to external observability platforms (Splunk, Azure Monitor, Tinybird). Not natively accessible through a Stainless API — agents must query through external platforms.

P3-D Behavioral Consistency — 2/5
Company founded by Stripe's SDK infrastructure team; pedigree suggests stability. Auto-regeneration on OpenAPI spec changes implies disciplined versioning of output artifacts. But no formal API versioning contract, no documented change notice period, no SLA, no version-controlled terms found.

P4-A Friction-Free Activation — 4/5
Free tier with no credit card required. Self-serve signup. MCP server generation begins immediately from the Free tier. No approval workflow or human gate in the documented flow. Some multi-step configuration (create project, add SDKs, configure MCP, set environment variables) prevents a 5/5.

P4-B Agent Decision Signals — 4/5
Free tier provides zero-commitment entry. Quantified performance metrics (94–97% accuracy, 3x fewer tool calls, 100k+ token savings) are explicit, agent-legible signals for whether to integrate. Clear pricing ladder enables upgrade decision without human involvement.

P5-A Integration Depth — 4/5
SDKs embed in customer codebases. MCP server integrates into agent workflows and IDE configurations. Documentation deploys to customer-owned domains. Auto-regeneration ties Stainless deeply into the API development lifecycle — every spec update flows through Stainless. High switching cost: migrating away means rebuilding SDKs, docs, and MCP servers from scratch.

P5-B Agent Memory Layer — 2/5
MCP server is customized per-project via OpenAPI spec, creating implicit personalization. The docs search tool queries project-specific documentation. But no persistent agent memory layer — no API for agents to read/write contextual state between sessions. Every interaction queries static docs fresh.

P5-C Programmatic Renewal — 1/5
Standard subscription billing only. No agent-accessible endpoint to check subscription status, remaining credits, or trigger renewal. No evidence of any programmatic subscription management API.

P5-D Compounding Value — 3/5
Auto-regeneration on spec changes is a genuine compounding value signal visible to agents: as the API grows, the MCP server gains new tools, the docs expand, and the execute tool covers more capabilities. Agents using the generated MCP server observe increasing capability over time. The value compounds with every API update — a clear differentiator.

Rubric: v1-2026-04. Scores reflect the company's state on the audit date and may have improved since.