koboldcpp

mirror of https://github.com/LostRuins/koboldcpp.git synced 2026-05-24 05:33:38 +00:00

History

Georgi Gerganov d2e179a477 llama-eval : add per-task summary stats (#23151 ) * llama-eval : add per-problem summary table to HTML reports - Add chunk_idx and problem_idx to TaskState and saved case dicts - Group completed cases by problem_idx in dump_html() - Render per-problem summary table before individual task table - Columns: Problem (zero-padded), Runs, Correct (n/r), Tokens (min/avg/max), T/s (min/avg/max), Gen s (min/avg/max) - Sorted by problem index, monospace font, right-aligned numbers - Colspan headers for grouped stats, auto width - Simulator: add /v1/models endpoint, timings in response, template-aware question matching, --dataset arg (aime/aime2025) Assisted-by: llama.cpp:local pi * llama-eval : add tabs for Detailed and Summary tables, apply monospace font globally - Wrap Detailed and Summary tables in switchable tabs (Detailed active by default) - Remove summary-section wrapper, use tab labels instead - Apply monospace font to all tables and the top bar Assisted-by: llama.cpp:local pi * llama-eval : redesign top bar as CSS grid label/value pairs - Replace flat span list with 4-column grid layout (2 pairs per row) - Labels in muted color (#888), values in dark (#222) - Bold dataset name and model name - Removed media query, always uses 4 columns Assisted-by: llama.cpp:local pi * llama-eval : use realistic token counts and throughput in simulator - comp_tokens: [30, 80] → [10000, 60000] - tps_gen: derived → uniform [90.0, 110.0] - t_gen_ms: now computed from tokens/tps Assisted-by: llama.cpp:local pi * llama-eval : color Answer column green/red based on correctness Use the same .correct/.incorrect CSS classes on the Answer column to make correct answers green and incorrect answers red. Assisted-by: llama.cpp:local pi * llama-eval : fix pyright errors from max(..., key=len) type inference Use key=lambda x: len(x) instead of key=len so the type checker infers the return type as str instead of Sized, fixing: - unresolved-attribute: Object of type Sized has no attribute lower - not-subscriptable: Cannot subscript object of type Sized Assisted-by: llama.cpp:local pi		2026-05-19 09:46:05 +03:00
..
batched	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
batched.swift
convert-llama2c-to-ggml	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
debug	common: fix missing exports in llama-common (#22340 )	2026-04-27 08:06:39 +03:00
deprecation-warning	Fix locale-dependent float printing in GGUF metadata (#17331 )	2026-03-04 09:30:40 +01:00
diffusion	examples: refactor diffusion generation (#22590 )	2026-05-04 20:19:30 +08:00
embedding	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
eval-callback	common: fix missing exports in llama-common (#22340 )	2026-04-27 08:06:39 +03:00
gen-docs	spec : refactor params (#22397 )	2026-04-28 09:07:33 +03:00
gguf	Fix locale-dependent float printing in GGUF metadata (#17331 )	2026-03-04 09:30:40 +01:00
gguf-hash	Fix locale-dependent float printing in GGUF metadata (#17331 )	2026-03-04 09:30:40 +01:00
idle	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
llama-eval	llama-eval : add per-task summary stats (#23151 )	2026-05-19 09:46:05 +03:00
llama.android	android : libcommon -> libllama-common (#22076 )	2026-04-18 11:19:40 +02:00
llama.swiftui
lookahead	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
lookup	spec : refactor params (#22397 )	2026-04-28 09:07:33 +03:00
model-conversion	model-conversion : add causal-convert-mmproj target [no ci] (#22969 )	2026-05-12 15:15:40 +02:00
parallel	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
passkey	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
retrieval	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
save-load-state	common : only load backends when required (#22290 )	2026-05-05 09:23:50 +02:00
simple	Fix locale-dependent float printing in GGUF metadata (#17331 )	2026-03-04 09:30:40 +01:00
simple-chat	Fix locale-dependent float printing in GGUF metadata (#17331 )	2026-03-04 09:30:40 +01:00
simple-cmake-pkg
speculative	spec : fix vocab compat checks in spec example (#22426 )	2026-04-30 08:18:25 +03:00
speculative-simple	spec : parallel drafting support (#22838 )	2026-05-11 19:09:43 +03:00
sycl	sycl : fix error when use -mg 1 error (#23140 )	2026-05-18 08:11:19 +03:00
training	libs : rename libcommon -> libllama-common (#21936 )	2026-04-17 11:11:46 +03:00
CMakeLists.txt	examples : add debug utility/example (#18464 )	2026-01-07 10:42:19 +01:00
convert_legacy_llama.py
json_schema_pydantic_example.py
json_schema_to_grammar.py	ci : switch from pyright to ty (#20826 )	2026-03-21 08:54:34 +01:00
llama.vim	chore : correct typos [no ci] (#20041 )	2026-03-05 08:50:21 +01:00
pydantic_models_to_grammar.py	ci : switch from pyright to ty (#20826 )	2026-03-21 08:54:34 +01:00
pydantic_models_to_grammar_examples.py
reason-act.sh
regex_to_grammar.py
server-llama2-13B.sh
server_embd.py
ts-type-to-grammar.sh