Overview

The Kimchi console gives you a centralized view of how your organization uses AI models — who's making requests, what it costs, where the spend is going, and whether you're staying within budget. These reporting pages are designed for team leads, engineering managers, and anyone responsible for AI adoption and cost management.

What data feeds into reporting

Two types of activity appear in the console:

  • API requests — every request sent through Kimchi's serverless endpoints (llm.kimchi.dev) is tracked automatically. No setup required. Token counts, model usage, latency, and cost are recorded for each call.
  • (Coming soon!) Coding tool telemetry — usage from AI coding tools like Claude Code and OpenCode requires OpenTelemetry (OTel) configuration. Once configured, session-level metrics (tokens, cost, tool calls) flow into the console alongside API data. See Report coding tool metrics to Kimchi for setup instructions.

Pages

The console organizes these across five pages, each serving a different question:

PageWhat it answersPrimary audience
Adoption reportIs AI usage growing? Where is the spend going?Engineering leadership, finance
UsageWho is using what, and how much does it cost?Team leads, engineering managers
AnalyticsHow are our API calls performing? Are there errors or latency issues?Platform engineers, team leads
TeamsHow are users organized, and who belongs to which team?Org admins, team leads
BudgetsAre we staying within spending limits?Finance, engineering managers

Adoption report

The Adoption report tracks organization-wide trends: active user count, total cost, request volume, and how those numbers evolve over time. It breaks down spend and usage by provider and by harness (Kimchi, Claude Code, OpenCode), and ranks models by cost. Use it to justify AI investment, spot adoption plateaus, or identify which tools and models drive the most value.

Usage

The Usage page shows who is consuming what. Switch between a by team view and a by user view. The team view shows aggregated metrics per team — requests, tokens, cost, and which models each team favors. The user view drills into individual developers, showing their team affiliation, model preferences, and personal cost footprint. Live Mode keeps the data current as requests flow in.

Analytics

The Analytics page is the operational dashboard. It tracks total cost, API call volume, average request duration, and error rate — with trend indicators showing whether each metric is improving or degrading. Dig into usage by model, reliability metrics, and cost breakdowns. Filter by model, API key, tags, and time range. Use it to catch latency regressions, identify expensive models, and correlate errors with deployments.

Teams

The Teams page lets you organize users into named groups. Teams can be created manually or attributed automatically based on how users are provisioned. Team membership flows into Usage and Analytics filtering — once users belong to teams, you can slice all reporting data by team.

Budgets

The Budgets page lets you set spending limits at multiple levels — organization-wide per-user caps (hard and soft), per-API-key limits, and per-team limits (pooled or per-user). Each budget tracks total spend against its limit with a progress bar and status indicator. Use budgets to control costs from org-wide guardrails down to individual developer allowances.

Related resources