Anthropic prompt caching support #25625

mrdrprofuroboros · 2024-08-21T16:49:02Z

mrdrprofuroboros
Aug 21, 2024

Checked

I searched existing ideas and did not find a similar one
I added a very descriptive title
I've clearly described the feature request and motivation for it

Feature request

Anthropic has recently added prompt caching with its own special pricing and cache write/hit token counts
https://www.anthropic.com/news/prompt-caching

Apparently this feature will start popping up in other providers. Spinning up this thread to ask if there are any plans on integrating it into langchain standard API

Motivation

I'm using langfuse for token/cost monitoring and right now I have to make quite ugly workarounds to use claud's prompt cache: https://github.com/orgs/langfuse/discussions/2987
I think there are a lot of devs that would want to use anthropic prompt caching within langchain infra

Proposal (If applicable)

Right now anthropic requires marking one of the messages as a prefix end to cache tools-system-messages until this flag. not sure if openai has any work in that direction. this whole thing is exposed through anthropic beta api (https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching) A simple starting step might be inheriting AnthropicBetaClient and substituting the underlying anthropic calls + exposing cache writes / hits to the response object

Would love to hear your thoughts on that

mrdrprofuroboros · 2024-08-22T03:31:08Z

mrdrprofuroboros
Aug 22, 2024
Author

#25644

0 replies

baskaryan · 2024-10-03T22:32:45Z

baskaryan
Oct 3, 2024
Maintainer

implemented in #27087

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Anthropic prompt caching support #25625

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Anthropic prompt caching support #25625

mrdrprofuroboros Aug 21, 2024

Checked

Feature request

Motivation

Proposal (If applicable)

Replies: 2 comments

mrdrprofuroboros Aug 22, 2024 Author

baskaryan Oct 3, 2024 Maintainer

mrdrprofuroboros
Aug 21, 2024

mrdrprofuroboros
Aug 22, 2024
Author

baskaryan
Oct 3, 2024
Maintainer