Background
When working with LLMs, context length is a critical resource. Users currently have no visibility into how much context has been consumed or when the next context compression will be triggered. Surfacing this information in the Web UI helps users better understand the agent's memory state and anticipate compression events before they occur.
Changes
- Display the current context length (e.g., token count) in the Web UI
- Display the threshold at which context compression will be triggered
- Update the display in real time as the conversation progresses
Background
When working with LLMs, context length is a critical resource. Users currently have no visibility into how much context has been consumed or when the next context compression will be triggered. Surfacing this information in the Web UI helps users better understand the agent's memory state and anticipate compression events before they occur.
Changes