Re-implement token caching for Vercel AI SDK usage #60

bhouston · 2025-03-03T21:12:27Z

This PR implements token caching for the Vercel AI SDK with Anthropic provider. It adds the appropriate providerOptions.anthropic.cacheControl: 'ephemeral' property to the last two messages in the conversation, which allows the conversation up to that point to be cached (with a ~5 minute window), reducing token consumption during repeated API calls.\n\nFixes #58

…urns (issue #56)

…vider (fixes #58)

bhouston added 7 commits March 3, 2025 15:16

partial conversion.

6746351

Convert from JsonSchema7Type to ZodSchema for tool parameters and ret…

a51b970

…urns (issue #56)

remove mistakenly added *.bak files.

2af9361

remove debug console logs.

5b1f379

format + lint

462602d

more lint cleanup

462cff6

Re-implement token caching for Vercel AI SDK usage with Anthropic pro…

870cbee

…vider (fixes #58)

bhouston force-pushed the feature/issue-58-token-caching branch from 15ac90d to 870cbee Compare March 3, 2025 21:18

bhouston added 2 commits March 3, 2025 20:25

cache system prompt as well.

3dfa8d1

ensure that it is always objects returned

73604de

bhouston merged commit 73604de into main Mar 4, 2025
1 check failed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Re-implement token caching for Vercel AI SDK usage #60

Re-implement token caching for Vercel AI SDK usage #60

bhouston commented Mar 3, 2025 •

edited

Loading

Re-implement token caching for Vercel AI SDK usage #60

Re-implement token caching for Vercel AI SDK usage #60

Conversation

bhouston commented Mar 3, 2025 • edited Loading

bhouston commented Mar 3, 2025 •

edited

Loading