feat: add Bedrock prompt cache TTL config by SharpLu · Pull Request #12875 · danny-avila/LibreChat

SharpLu · 2026-04-29T10:17:31Z

Summary

Bedrock prompt caching now supports both 5-minute and 1-hour cache checkpoint TTLs. This PR adds an optional endpoints.bedrock.promptCacheTtl YAML config so LibreChat admins can choose 5m or 1h; when the setting is omitted, LibreChat does not send a TTL and Bedrock keeps the existing 5-minute default behavior.

This is useful for longer Bedrock/Claude sessions where users reuse large system prompts, tools, or reference context across turns that may be more than five minutes apart. The 1-hour TTL lets supported Bedrock Claude 4.5 models retain those cache checkpoints longer when the admin explicitly opts in.

Changes

Add Bedrock endpoint config validation for promptCacheTtl: "5m" | "1h"
Pass promptCacheTtl from librechat.yaml into Bedrock llmConfig
Preserve 1h only for Claude 4.5 Bedrock model IDs, and strip stale/unsupported TTL values so unsupported models fall back to Bedrock default 5-minute behavior
Preserve explicit 5m for prompt-cache-supported Claude/Nova models
Strip TTL when prompt caching is disabled or when the selected model does not support Bedrock prompt caching
Add conversation/preset schema support for the new value
Document the YAML option in librechat.example.yaml and the Helm configYamlContent example

Deployment configuration

Docker/Compose users can set this in the mounted librechat.yaml file. Helm users can set the same YAML under librechat.configYamlContent, or provide an existing ConfigMap through librechat.existingConfigYaml with a librechat.yaml key.

endpoints:
  bedrock:
    models:
      - "anthropic.claude-sonnet-4-5-20250929-v1:0"
    promptCacheTtl: "1h"

References

AWS CachePointBlock API reference: https://docs.aws.amazon.com/bedrock/latest/APIReference/API_runtime_CachePointBlock.html
AWS Bedrock prompt caching guide: https://docs.aws.amazon.com/bedrock/latest/userguide/prompt-caching.html
AWS announcement for 1-hour prompt caching: https://aws.amazon.com/about-aws/whats-new/2026/01/amazon-bedrock-one-hour-duration-prompt-caching/
Depends on agents runtime support for emitting cachePoint.ttl: feat: support Bedrock prompt cache TTL agents#123

Testing

cd packages/data-provider && npx jest specs/bedrock.spec.ts src/config.spec.ts --runInBand --coverage=false
npm run build:data-provider
cd packages/api && npx jest src/endpoints/bedrock/initialize.spec.ts --runInBand --coverage=false
npm run build:data-schemas
helm lint helm/librechat
helm template bedrock-test helm/librechat -f <values-with-bedrock-promptCacheTtl.yaml>
docker compose -f docker-compose.yml -f docker-compose.override.yml config
docker compose -f deploy-compose.yml config
git diff --check

Copilot

Pull request overview

Adds an optional Bedrock prompt-cache checkpoint TTL configuration (5m or 1h) that can be set via endpoints.bedrock.promptCacheTtl, validates it, threads it through Bedrock initialization, and ensures stale TTL values are stripped when prompt caching is disabled/unsupported.

Changes:

Extend conversation/preset types + schemas to allow promptCacheTtl: '5m' | '1h'.
Add Bedrock endpoint config validation for promptCacheTtl and pass it into Bedrock llmConfig.
Normalize Bedrock prompt-cache options to preserve TTL only for Claude/Nova models and strip TTL when promptCache is false/unsupported; add tests + document YAML option.

Reviewed changes

Copilot reviewed 13 out of 13 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
packages/data-schemas/src/types/convo.ts	Adds `promptCacheTtl` to conversation TS typing.
packages/data-schemas/src/schema/preset.ts	Adds `promptCacheTtl` to preset TS typing.
packages/data-schemas/src/schema/defaults.ts	Adds Mongoose schema enum for `promptCacheTtl`.
packages/data-provider/src/types.ts	Allows `promptCacheTtl` in endpoint option typing surface.
packages/data-provider/src/schemas.ts	Accepts `promptCacheTtl` in conversation/query validation schemas.
packages/data-provider/src/config.ts	Validates Bedrock endpoint config `promptCacheTtl` (`5m`/`1h`).
packages/data-provider/src/config.spec.ts	Tests Bedrock endpoint schema accepts/rejects TTL values.
packages/data-provider/src/bedrock.ts	Normalizes prompt cache + TTL behavior for supported models and strips stale TTL.
packages/data-provider/specs/bedrock.spec.ts	Adds tests for preserving/stripping `promptCacheTtl` behavior.
packages/api/src/types/bedrock.ts	Adds `BedrockPromptCacheTtl` type + config fields.
packages/api/src/endpoints/bedrock/initialize.ts	Threads endpoint-config TTL into Bedrock request/llmConfig.
packages/api/src/endpoints/bedrock/initialize.spec.ts	Tests `promptCacheTtl` is present only when configured.
librechat.example.yaml	Documents the new `endpoints.bedrock.promptCacheTtl` option.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-04-29T10:21:43Z

  promptCache: z.boolean().optional(),
+  promptCacheTtl: z.enum(['5m', '1h']).optional(),


promptCacheTtl is documented as part of the /* Anthropic */ block, but this TTL is Bedrock-specific (per PR description and usage in bedrockInputParser). This comment/section placement is misleading; please either move promptCacheTtl to the AWS Bedrock section of the schema or update the comment to reflect Bedrock usage.

Suggested change

promptCache: z.boolean().optional(),

promptCacheTtl: z.enum(['5m', '1h']).optional(),

promptCache: z.boolean().optional(),

/* AWS Bedrock */

promptCacheTtl: z.enum(['5m', '1h']).optional(),

/* Anthropic */

Copilot · 2026-04-29T10:21:43Z

  // Anthropic-specific
  | 'promptCache'
+  | 'promptCacheTtl'


promptCacheTtl is listed under the // Anthropic-specific grouping, but it's Bedrock-only (whereas promptCache applies to both Anthropic and Bedrock). Please adjust the grouping/comment so future readers don’t assume this option is supported by the Anthropic endpoint.

Suggested change

// Anthropic-specific

| 'promptCache'

| 'promptCacheTtl'

// Anthropic/Bedrock

| 'promptCache'

// Bedrock-specific

| 'promptCacheTtl'

// Anthropic-specific

Copilot · 2026-04-29T10:21:43Z

  /* Anthropic only */
  promptCache: {
    type: Boolean,
  },
+  promptCacheTtl: {
+    type: String,
+    enum: ['5m', '1h'],
+  },


The new promptCacheTtl field is placed under the /* Anthropic only */ section, but the TTL is Bedrock-specific. This comment/section placement is misleading; please move promptCacheTtl to a Bedrock section (or rename the comment to reflect shared/Bedrock usage).

SharpLu · 2026-05-08T08:44:39Z

Hey @danny-avila, friendly bump on this one — just rebased onto main, CI is green.

The PR adds an optional endpoints.bedrock.promptCacheTtl: "5m" | "1h" in librechat.yaml so admins can opt into the new 1-hour Bedrock prompt cache TTL for Claude 4.5 models. When unset, behavior is unchanged (Bedrock defaults to 5 min). Useful for long agent sessions where reusing large system prompts/tools across turns >5 min would otherwise blow the cache.

Depends on danny-avila/agents#123 (also rebased now) for the runtime side.

Happy to address any feedback whenever you have a moment. Thanks!

Copilot AI review requested due to automatic review settings April 29, 2026 10:17

Copilot started reviewing on behalf of SharpLu April 29, 2026 10:18 View session

SharpLu force-pushed the feat/bedrock-cache-ttl branch from 44d18ec to 96286dd Compare April 29, 2026 10:19

SharpLu mentioned this pull request Apr 29, 2026

feat: support Bedrock prompt cache TTL danny-avila/agents#123

Open

Copilot AI reviewed Apr 29, 2026

View reviewed changes

SharpLu force-pushed the feat/bedrock-cache-ttl branch from 96286dd to 9835757 Compare April 29, 2026 10:36

feat: add Bedrock prompt cache TTL config

e5ff82f

SharpLu force-pushed the feat/bedrock-cache-ttl branch from 9835757 to e5ff82f Compare May 8, 2026 08:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: add Bedrock prompt cache TTL config#12875

feat: add Bedrock prompt cache TTL config#12875
SharpLu wants to merge 1 commit intodanny-avila:mainfrom
SharpLu:feat/bedrock-cache-ttl

SharpLu commented Apr 29, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 29, 2026

Uh oh!

Copilot AI Apr 29, 2026

Uh oh!

Copilot AI Apr 29, 2026

Uh oh!

SharpLu commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		promptCache: z.boolean().optional(),
		promptCacheTtl: z.enum(['5m', '1h']).optional(),

Uh oh!

Conversation

SharpLu commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Deployment configuration

References

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Apr 29, 2026

Choose a reason for hiding this comment

Uh oh!

SharpLu commented May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SharpLu commented Apr 29, 2026 •

edited

Loading