koboldcpp/scripts
Ryan Goulden 26c9ce1288
server: Add cached_tokens info to oaicompat responses (#19361)
* tests : fix fetch_server_test_models.py

* server: to_json_oaicompat cached_tokens

Adds OpenAI and Anthropic compatible information about the
number of cached prompt tokens used in a response.
2026-03-19 19:09:33 +01:00
..
snapdragon/windows Merge commit '2cd20b72ed' into concedo_experimental 2026-03-10 22:11:08 +08:00