AGENT NATIVE OFFERS

← Leaderboard

Stainless Human-Dependent

AUDIE Score: 54/100 · Audited 2026-04-13 · Website: https://stainless.com · Machine-readable: JSON

Pillar Scores

P1 Signal Architecture — 12/25
P2 Clarity Stack — 15/25
P3 Trust Envelope — 9/20
P4 Velocity Triggers — 8/10
P5 Gravity Design — 10/20

Executive Summary

Stainless scores 54/100 (Human-Dependent) — a score that understates its actual strategic position in the agent ecosystem. The company builds the infrastructure that makes APIs agent-accessible for OpenAI, Anthropic, Google, and Cloudflare, yet its own offer infrastructure is entirely human-configured: no llms.txt, no structured data, no native log API. The platform's gravity is exceptional (auto-regenerating SDKs and MCP servers create some of the highest switching costs audited to date), and velocity is strong (free tier, no friction, quantified performance signals). The gap is entirely in Signal Architecture and Trust infrastructure — the tools an agent needs to independently discover, evaluate, and manage Stainless. The top recommendation is publishing llms.txt immediately: a one-hour fix that would raise the score by 5 points and correct the most ironic gap in the dataset — an AI infrastructure company that hasn't told AI where to find it.

Strongest Signals

Critical Gaps

Priority Actions

    All 20 Criteria

    P1-A Structured Data — 2/5
    robots.txt allows all crawlers and two sitemaps are present (sitemap.xml + docs/sitemap-index.xml), signaling crawlability. However, direct search for schema.org/JSON-LD on stainless.com returned no results. Homepage is React-rendered marketing with no detectable rich structured data markup.
    P1-B Machine-Readable Pricing — 3/5
    Pricing page at stainless.com/pricing has highly granular HTML-structured pricing: $79/$499/Enterprise per generator per month, with endpoint count limits (50/100/100+), preview build limits (300/1000/custom), and explicit overage rates ($1/1000 builds; $5/mo per endpoint per SDK). Not encoded in schema.org/Offer format.
    P1-C llms.txt / Agent Layer — 0/5
    No /llms.txt file found at stainless.com/llms.txt. The URL returned information about the company but no structured agent-facing content file. No evidence of any equivalent agent identity layer.
    P1-D API / MCP Availability — 4/5
    The product IS an MCP server generator. Outputs: NPM packages, Cloudflare Workers deployments, Docker containers, self-hosted options. Two-tool architecture (search_docs + execute) confirmed. Multiple deployment paths. Score capped at 4 because Stainless's own management API (for project creation, spec updates) is not publicly documented.
    P1-E Discoverability (GEO) — 3/5
    Two sitemaps, open crawling, well-structured docs at app.stainless.com. Named as infrastructure for Anthropic, OpenAI, Google, and Cloudflare — strong LLM ecosystem presence. Blog posts explicitly addressing agent experience. No llms.txt and no explicit AI retrieval optimization.
    P2-A Offer Completeness — 4/5
    What (SDK + docs + MCP server generation from OpenAPI specs), who (API companies, developer teams), how much (clear tier pricing with per-generator monthly cost). All findable on a single /pricing page with feature breakdowns by tier.
    P2-B Scope & Limits — 4/5
    Explicit, specific limits documented: 5 generators (Free), 25–50 endpoints per generator, 100/300/1,000 preview builds/month per tier, $1 per 1,000 overage builds, $5/mo per additional endpoint per SDK at Enterprise. All quantified and published.
    P2-C Substitution Rules — 0/5
    No documentation on what happens when the service is unavailable, when code execution sandbox fails, or when generated MCP servers degrade. No fallback routing, graceful degradation policy, or alternative guidance found anywhere.
    P2-D Conditional Logic — 3/5
    Most conditions are disclosed on the pricing page: tier gates for webhooks, OAuth 2.0, dedicated support, and AI chat clearly tied to plan level. Enterprise requires custom pricing (disclosed). No opaque "contact sales" wall on standard tiers.
    P2-E Semantic Precision — 4/5
    Strong use of specific metrics: "94–97% task accuracy," "3x fewer tool calls," "100k+ tokens saved on complex tasks." Named enterprise customers (Anthropic, OpenAI, Google, Cloudflare, Modern Treasury). Exact pricing numbers throughout. Minimal vague superlatives.
    P3-A Verifiable Performance — 2/5
    Self-reported performance benchmarks (94–97% accuracy, 3x tool call reduction) on the MCP product page. Notable customer logos (Anthropic, OpenAI) serve as implicit verification. No public status page found, no third-party G2/Trustpilot reviews indexed, no SLA document located. $25M Series A (Dec 2024) signals institutional credibility but not operational reliability.
    P3-B Scoped Permissions — 2/5
    Authentication via environment variable API keys — effectively all-or-nothing access per key. Code execution runs in Cloudflare Workers sandbox (security benefit), but no granular agent-specific permission model. No time-bounded, amount-bounded, or action-bounded scoping documented for agent operations.
    P3-C Audit Trail — 3/5
    "Real-Time MCP Monitoring and Logging" product confirmed (dedicated page on stainless.com/mcp/). Structured telemetry per tool call: timestamps, tool names, parameters, response times, success/error, request IDs. Logs forward to external observability platforms (Splunk, Azure Monitor, Tinybird). Not natively accessible through a Stainless API — agents must query through external platforms.
    P3-D Behavioral Consistency — 2/5
    Company founded by Stripe's SDK infrastructure team; pedigree suggests stability. Auto-regeneration on OpenAPI spec changes implies disciplined versioning of output artifacts. But no formal API versioning contract, no documented change notice period, no SLA, no version-controlled terms found.
    P4-A Friction-Free Activation — 4/5
    Free tier with no credit card required. Self-serve signup. MCP server generation begins immediately from the Free tier. No approval workflow or human gate in the documented flow. Some multi-step configuration (create project, add SDKs, configure MCP, set environment variables) prevents a 5/5.
    P4-B Agent Decision Signals — 4/5
    Free tier provides zero-commitment entry. Quantified performance metrics (94–97% accuracy, 3x fewer tool calls, 100k+ token savings) are explicit, agent-legible signals for whether to integrate. Clear pricing ladder enables upgrade decision without human involvement.
    P5-A Integration Depth — 4/5
    SDKs embed in customer codebases. MCP server integrates into agent workflows and IDE configurations. Documentation deploys to customer-owned domains. Auto-regeneration ties Stainless deeply into the API development lifecycle — every spec update flows through Stainless. High switching cost: migrating away means rebuilding SDKs, docs, and MCP servers from scratch.
    P5-B Agent Memory Layer — 2/5
    MCP server is customized per-project via OpenAPI spec, creating implicit personalization. The docs search tool queries project-specific documentation. But no persistent agent memory layer — no API for agents to read/write contextual state between sessions. Every interaction queries static docs fresh.
    P5-C Programmatic Renewal — 1/5
    Standard subscription billing only. No agent-accessible endpoint to check subscription status, remaining credits, or trigger renewal. No evidence of any programmatic subscription management API.
    P5-D Compounding Value — 3/5
    Auto-regeneration on spec changes is a genuine compounding value signal visible to agents: as the API grows, the MCP server gains new tools, the docs expand, and the execute tool covers more capabilities. Agents using the generated MCP server observe increasing capability over time. The value compounds with every API update — a clear differentiator.

    Rubric v1 (April 2026). Scores reflect the company's state on the audit date and may have improved since.