Georgi Gerganov
|
d2fcd91cf9
|
server : disable context shift by default (#15416)
* server : disable context shift by default
ggml-ci
* server : make scopr of test parameters local
|
2025-08-19 16:46:37 +03:00 |
|
Lukas Straub
|
a9f77a8be3
|
server : add openai-style logit_bias support (#14946)
Signed-off-by: Lukas Straub <lukasstraub2@web.de>
|
2025-07-31 14:08:23 +02:00 |
|
Olivier Chafik
|
f13847cfb5
|
server: fix regression on streamed non-chat completion w/ stops (#13785)
* more forgiving message diffs: partial stop words aren't erased, full stops are
* Add (slow) server test for completion + stream + stop
|
2025-05-26 14:16:37 +01:00 |
|
Xuan-Son Nguyen
|
360a9c98e1
|
server : fix cache_tokens bug with no cache_prompt (#13533)
|
2025-05-14 13:35:07 +02:00 |
|
Diego Devesa
|
1d36b3670b
|
llama : move end-user examples to tools directory (#13249)
* llama : move end-user examples to tools directory
---------
Co-authored-by: Xuan Son Nguyen <son@huggingface.co>
|
2025-05-02 20:27:13 +02:00 |
|