Tag

#llm-ops

3 posts tagged llm-ops.

ops

LLM Cost & Latency Observability with OpenTelemetry

Token spend and tail latency are the two metrics that decide whether an LLM feature ships or gets killed. How to instrument both with OpenTelemetry so you can answer 'why did this cost double?' in a query, not a war room.
May 22, 2026
ops

Closing the Eval-Prod Gap: Online Evaluation as Observability

Offline eval scores are green and production is worse. The gap is not a measurement error — it is structural. Here is how to instrument online evaluation so production quality becomes observable.
May 9, 2026
ops

End-to-End Tracing for LLM Applications: What Belongs in a Span

Production LLM apps span multiple model calls, tool invocations, retrieval steps, and re-tries. A complete trace makes them debuggable; a sparse one leaves you guessing.
May 6, 2026