koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-05-09 02:50:39 +00:00

History

Daniel Bevenius b7552cfcbc common : add default embeddings presets (#11677 ) * common : add default embeddings presets This commit adds default embeddings presets for the following models: - bge-small-en-v1.5 - e5-small-v2 - gte-small These can be used with llama-embedding and llama-server. For example, with llama-embedding: ```console ./build/bin/llama-embedding --embd-gte-small-default -p "Hello, how are you?" ``` And with llama-server: ```console ./build/bin/llama-server --embd-gte-small-default ``` And the embeddings endpoint can then be called with a POST request: ```console curl --request POST \ --url http://localhost:8080/embeddings \ --header "Content-Type: application/json" \ --data '{"input": "Hello, how are you?"}' ``` I'm not sure if these are the most common embedding models but hopefully this can be a good starting point for discussion and further improvements. Refs: https://github.com/ggerganov/llama.cpp/issues/10932		2025-02-07 09:15:22 +01:00
..
cmake	llama : reorganize source code + improve CMake (#8006 )	2024-06-26 18:33:02 +03:00
arg.cpp	common : add default embeddings presets (#11677 )	2025-02-07 09:15:22 +01:00
arg.h	arg : option to exclude arguments from specific examples (#11136 )	2025-01-08 12:55:36 +02:00
base64.hpp	llava : expose as a shared library for downstream projects (#3613 )	2023-11-07 00:36:23 +03:00
build-info.cpp.in	build : link against build info instead of compiling against it (#3879 )	2023-11-02 08:50:16 +02:00
chat-template.hpp	`sync`: minja (#11641 )	2025-02-05 01:00:12 +00:00
chat.cpp	`sync`: minja (#11641 )	2025-02-05 01:00:12 +00:00
chat.hpp	`tool-call`: support Command R7B (+ return tool_plan "thoughts" in API) (#11585 )	2025-02-02 09:25:38 +00:00
CMakeLists.txt	sampling : support for llguidance grammars (#10224 )	2025-02-02 09:55:32 +02:00
common.cpp	`sync`: minja (#11641 )	2025-02-05 01:00:12 +00:00
common.h	`tool-call`: support Command R7B (+ return tool_plan "thoughts" in API) (#11585 )	2025-02-02 09:25:38 +00:00
console.cpp	console : utf-8 fix for windows stdin (#9690 )	2024-09-30 11:23:42 +03:00
console.h	gguf : new file format with flexible meta data (beta) (#2398 )	2023-08-21 23:07:43 +03:00
json-schema-to-grammar.cpp	sampling : support for llguidance grammars (#10224 )	2025-02-02 09:55:32 +02:00
json-schema-to-grammar.h	sampling : support for llguidance grammars (#10224 )	2025-02-02 09:55:32 +02:00
json.hpp	json-schema-to-grammar improvements (+ added to server) (#5978 )	2024-03-21 11:50:43 +00:00
llguidance.cpp	sampling : support for llguidance grammars (#10224 )	2025-02-02 09:55:32 +02:00
log.cpp	Name colors (#11573 )	2025-02-02 15:14:48 +00:00
log.h	Name colors (#11573 )	2025-02-02 15:14:48 +00:00
minja.hpp	`sync`: minja (#11641 )	2025-02-05 01:00:12 +00:00
ngram-cache.cpp	llama : use LLAMA_TOKEN_NULL (#11062 )	2025-01-06 10:52:15 +02:00
ngram-cache.h	llama : use LLAMA_TOKEN_NULL (#11062 )	2025-01-06 10:52:15 +02:00
sampling.cpp	sampling : support for llguidance grammars (#10224 )	2025-02-02 09:55:32 +02:00
sampling.h	sampling : support for llguidance grammars (#10224 )	2025-02-02 09:55:32 +02:00
speculative.cpp	llama : add `llama_vocab`, functions -> methods, naming (#11110 )	2025-01-12 11:32:42 +02:00
speculative.h	speculative : refactor and add a simpler example (#10362 )	2024-11-25 09:58:41 +02:00
stb_image.h	common : Update stb_image.h to latest version (#9161 )	2024-08-27 08:58:50 +03:00