Argovaa — Token Cost of Ownership (TCO)

Cloud cost intelligence

TOKEN COST OF
OWNERSHIP

Instrument your service, pull AWS cost data, add LLM token spend, and get the true total cost per request — so you know exactly what each AI call actually costs your business.

Infrastructure cost

AWS EC2, ECS, Lambda, networking from CUR

LLM token cost

OpenAI, Anthropic, Azure input/output tokens

Service overhead

Latency, retries, storage, observability

=

TRUE COST PER REQUEST

Step 1 — Instrument your service

ADD THE ARGOVAA SDK
TO YOUR SERVICE

Copy one of the snippets below into your service. The SDK tracks token usage, latency, and request metadata — sending it to Argovaa for cost blending.

            Python — pip install argovaa-sdk
            
          
import argovaa
from openai import OpenAI

# Initialise with your Argovaa API key
argovaa.init(
  api_key="arg_live_xxxxxxxxxxxx",
  service_name="my-ai-service",
  environment="production"
)

# Wrap your existing LLM client
client = argovaa.wrap(OpenAI())

# Use exactly as before — tracking is automatic
response = client.chat.completions.create(
  model="gpt-4o",
  messages=[{"role": "user", "content": prompt}],
  # Argovaa captures: tokens, latency, cost, model
)

# Optionally tag requests for cost attribution
with argovaa.trace(
  user_id=user_id,
  feature="document-summary",
  tenant=tenant_id       # for multi-tenant billing
):
  result = client.chat.completions.create(...)

            Node.js — npm install @argovaa/sdk
            
          
import { Argovaa } from '@argovaa/sdk';
import OpenAI from 'openai';

// Initialise Argovaa
const argovaa = new Argovaa({
  apiKey: 'arg_live_xxxxxxxxxxxx',
  serviceName: 'my-ai-service',
  environment: 'production',
});

// Wrap your OpenAI client
const openai = argovaa.wrap(new OpenAI());

// Tracking is fully automatic from here
const response = await openai.chat.completions.create({
  model: 'gpt-4o',
  messages: [{ role: 'user', content: prompt }],
});

// Tag for multi-tenant cost attribution
await argovaa.trace({
  userId: userId,
  feature: 'document-summary',
  tenant: tenantId,
  customCost: 0.002  // optional fixed overhead
}, async () => {
  return openai.chat.completions.create({...});
});

What gets tracked

Per-request, zero-config

Metric	Source
Input tokens	LLM API
Output tokens	LLM API
Model name	LLM API
Token cost ($)	Calculated
Latency (ms)	SDK
Error rate	SDK
User / tenant ID	SDK
Feature / endpoint	SDK
AWS infra cost	CUR upload

Your Argovaa API key

              arg_live_●●●●●●●●●●●●
              
            

Use this key to initialise the SDK. Keep it secret — never commit to version control. Rotate from Settings if compromised.

Step 2 — AWS cloud cost

PULL YOUR
AWS COST DATA

Upload your AWS Cost and Usage Report (CUR) CSV file. We extract EC2, ECS, Lambda, and networking costs and blend them with your LLM token usage.

Upload AWS CUR file

CSV format · monthly export

📂

Drop your CUR CSV here

or click to browse · .csv or .csv.gz supported

Or enter manually

For quick estimates

EC2 monthly cost ($)

ECS / Fargate ($)

Lambda ($)

Networking / Data transfer ($)

RDS / Database ($)

S3 / Storage ($)

Total monthly requests served

Used to calculate per-request infra cost

How to export AWS CUR

1

Go to AWS Console → Billing → Cost & Usage Reports

2

Click Create report → enable Resource IDs and CSV format

3

Set S3 bucket for delivery → wait for the first export (up to 24h for new reports)

4

Download the CSV from S3 and upload it here — Argovaa parses EC2, ECS, Lambda, and networking automatically

AWS cost breakdown

Step 3 — LLM token cost

ENTER YOUR
LLM TOKEN USAGE

Enter token usage from your LLM provider dashboard, or the SDK will populate this automatically once instrumented.

LLM usage & pricing

Monthly totals

Provider & model

Input price ($/1M tokens)

Output price ($/1M tokens)

Monthly input tokens

Monthly output tokens

Avg input tokens / request

Avg output tokens / request

Pricing tier

LLM cost summary

Monthly input cost

$—

Monthly output cost

$—

Total LLM monthly

$—

LLM cost / request

$—

Step 4 — Total cost of ownership

YOUR TRUE
COST PER REQUEST

All three cost layers blended into a single TCO view — infrastructure, LLM tokens, and overhead per request.

Infrastructure / req

$—

AWS cloud cost

LLM tokens / req

$—

Input + output tokens

Overhead / req

$—

Storage, observability

Total TCO / request

$—

All layers combined

Cost breakdown

Monthly TCO projection

Full cost attribution table

Per-layer breakdown with optimisation recommendations

Cost layer	Monthly ($)	Per request ($)	% of TCO	Status

Optimisation recommendations

AI-generated savings opportunities

Complete steps 1–3 to generate recommendations