Prompt Caching (opens in new tab)
--- Prompt caching optimizes your API usage by allowing resuming from specific prefixes in your prompts. This significantly reduces processing time and costs for repetitive tasks or prompts with consistent elements. This feature is eligible for Zero Data Retention (ZDR). When your organization has a ZDR arrangement, data sent through this feature is not stored after the API response is returned. There are two ways to enable prompt caching: - **Automatic caching**: Add a single `cache_co...
Read the original article