All 20 Criteria
P1-A Structured Data — 0/5
No evidence of schema.org markup, JSON-LD, or Organization/Offer schema on agentops.ai. Homepage is a standard React/Next.js marketing site with no detectable structured data in search result snippets or web search structured data queries.
P1-B Machine-Readable Pricing — 1/5
Pricing tiers (Free: ~50K events/mo, Pro: ~$40–49/mo, Enterprise: custom) are mentioned on the homepage, but the dedicated /pricing page returns a 404 error. No schema.org Offer markup or clean JSON pricing structure detected. Pricing numbers inconsistent across sources (5K vs 50K events on free tier).
P1-C llms.txt / Agent Layer — 4/5
A docs index file exists at `docs.agentops.ai/llms.txt`, which is explicitly referenced in their documentation as a structured index for LLM/agent discovery of available pages and integrations. This is a meaningful agent identity layer, though it is docs-scoped rather than a product-level llms.txt at the root domain.
P1-D API / MCP Availability — 3/5
Python SDK (`pip install agentops`) and TypeScript/JavaScript SDK available. MCP Docs Server is available for IDE integration (`npx mint-mcp add agentops`). Supports 25+ agent frameworks and 400+ LLMs. However, no OpenAPI spec was found; the API is SDK-first rather than REST-first, limiting direct agent programmatic consumption.
P1-E Discoverability (GEO) — 3/5
Well-represented in AI developer directories (aiagentslist.com, aimultiple.com, etc.), ~4,000 GitHub stars, developer-focused blog, and strong integration with frameworks that AI agents are built on. Some SEO-quality content present, but no explicit AI retrieval optimization (no structured content for LLM answer engines).
P2-A Offer Completeness — 2/5
The "what" (agent observability/monitoring) and "how much" (tier pricing) are findable on the homepage, but the dedicated pricing page is broken (404). Pricing numbers are inconsistent across sources, making machine-parseable verification impossible without cross-referencing third-party review sites.
P2-B Scope & Limits — 2/5
Event limits per tier are mentioned in marketing copy (Free: ~50K events/mo, Pro: ~500K events/mo), but these are stated in prose, not in a structured format. No explicit rate limits, API call caps, or SLA caps documented in discoverable form.
P2-C Substitution & Fallback Rules — 0/5
No guidance found on what happens if the service is unavailable, no SLA fallback, no downtime compensation policy. No FAQ or documentation addressing service substitution or interruption.
P2-D Conditional Logic Transparency — 1/5
Enterprise pricing requires "contact sales" with no disclosed conditions, minimums, or feature gates. Pro vs Free distinction exists but conditions for Enterprise access are fully opaque.
P2-E Semantic Precision — 2/5
Technical language is often precise ("Time Travel Debugging," "point-in-time replay," "token counting") but marketing copy includes vague claims like "leading developer platform" and "best-in-class reliability." Pricing inconsistency between sources (5K vs 50K free tier events) undermines semantic precision.
P3-A Verifiable Performance Data — 1/5
No public status page accessible (status.agentops.ai returned connection refused). No third-party verified uptime data found. Enterprise logos cited (Microsoft, Google, Samsung, Meta, J&J, Accenture, Deloitte) but not independently verifiable via case studies or third-party reviews. G2 and Trustpilot listings not confirmed.
P3-B Scoped Permission Model — 2/5
API key-based authentication with tier-based access (Free/Pro/Enterprise). No agent-scoped permissions found — no time-bounded, amount-bounded, or action-bounded access controls designed for autonomous agent evaluation and use.
P3-C Audit Trail / Transaction Log — 1/5
Ironically, audit trail generation is AgentOps' core product feature — it creates audit trails FOR agents being monitored. However, there is no machine-accessible audit log for agents interacting WITH the AgentOps platform itself (i.e., no API endpoint logging agent API calls made to AgentOps).
P3-D Behavioral Consistency Signals — 0/5
No versioned terms of service found (/terms returned 404). No change notice policy. No documented stability track record or changelog for platform behavior. Open-source GitHub repo shows version history for the SDK, but the cloud platform has no visible versioning or change policy.
P4-A Friction-Free Activation — 4/5
Activation is genuinely low-friction: `pip install agentops`, get an API key from the dashboard, add 2 lines of code. No sales call required for Free or Pro tiers. Sign-up appears to be self-serve. Loses 1 point because Enterprise tier requires human contact.
P4-B Agent Decision Signals — 2/5
Free tier provides a programmatic trial signal, and cost tracking is a core platform feature (agents can observe their own costs via AgentOps). However, there are no explicit agent-legible signals for when to upgrade, what triggers tier changes, or programmatic cost-threshold notifications usable by an autonomous agent making purchasing decisions.
P5-A Integration Depth / Switching Cost — 3/5
Deep integration with 25+ agent frameworks (CrewAI, LangChain, AutoGen, OpenAI Agents, Google ADK, etc.) creates meaningful switching friction. Historical session data, traces, and fine-tuning datasets accumulate over time and are not portable. However, AgentOps monitors agents rather than storing their operational data — agents could switch monitoring providers with relatively low operational disruption.
P5-B Agent Memory / Personalization Layer — 0/5
AgentOps is a monitoring and observability platform, not a memory platform. No agent-accessible memory layer exists. Each session starts from scratch from the agent's perspective. Session data exists for human developers to review, not for agents to consume.
P5-C Programmatic Renewal Signals — 0/5
No evidence of an agent-accessible renewal API, programmatic billing management, or automated tier upgrade mechanism. Standard SaaS billing through a human-facing dashboard.
P5-D Compounding Value Signal — 1/5
Historical session data and the ability to fine-tune specialized LLMs on saved completions represent latent compounding value. However, this value is accessible only to human developers, not readable by autonomous agents. No agent-facing signals of accumulated value over time.
Rubric v1 (April 2026). Scores reflect the company's state on the audit date and may have improved since.