Skip to content

Document time-to-first-token surfaces on the LLMs page#2050

Draft
dmontagu wants to merge 3 commits into
mainfrom
dm/ttft-docs
Draft

Document time-to-first-token surfaces on the LLMs page#2050
dmontagu wants to merge 3 commits into
mainfrom
dm/ttft-docs

Conversation

@dmontagu

@dmontagu dmontagu commented Jul 3, 2026

Copy link
Copy Markdown
Contributor

Summary

Documents the new time-to-first-token surfaces shipping with pydantic/platform#23856 (UI) and pydantic/pydantic-ai#5967 (SDK emission) in docs/guides/web-ui/llms.md:

  • LLM detail page: headline cards updated five → six (adds First token), trend-chart list gains Time to first token, plus a short paragraph on what the card/chart measure, the streaming-only behavior and empty state, and the matching Time to first token (streaming) section in the built-in LLM Tokens and Costs dashboards.
  • Agent run detail page: the Time to first token (median/p90) chart added to the charts list.
  • What drives each column table: new row for gen_ai.client.operation.time_to_first_chunk (seconds) with the gen_ai.server.time_to_first_token fallback — matches the model_calls materialized view's coalesce exactly.
  • Troubleshooting table: new row for an empty Time to first token chart (streaming-only; requires an instrumentation that records the attribute).

Merge timing

Should merge around when platform#23856 ships, since it describes the six-card layout that PR introduces.

https://claude.ai/code/session_01NbiZJHnx65j4ibTDe6xr1T

Review in cubic

Covers the First token headline card, the Time to first token charts on the
LLM and agent detail pages, the driving attributes
(gen_ai.client.operation.time_to_first_chunk with
gen_ai.server.time_to_first_token fallback), and a troubleshooting row for
empty charts. Ships with pydantic/platform#23856.

Claude-Session: https://claude.ai/code/session_01NbiZJHnx65j4ibTDe6xr1T
@dmontagu dmontagu self-assigned this Jul 3, 2026
dmontagu added 2 commits July 2, 2026 21:44
Gives the TTFT paragraph its own heading (a stable #time-to-first-token
anchor the product links to) and adds a warning admonition explaining the
underlying OTel GenAI convention is at Development stability.

Claude-Session: https://claude.ai/code/session_01NbiZJHnx65j4ibTDe6xr1T
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant