koboldcpp/examples
Georgi Gerganov d2e179a477
llama-eval : add per-task summary stats (#23151)
* llama-eval : add per-problem summary table to HTML reports

- Add chunk_idx and problem_idx to TaskState and saved case dicts
- Group completed cases by problem_idx in dump_html()
- Render per-problem summary table before individual task table
  - Columns: Problem (zero-padded), Runs, Correct (n/r),
    Tokens (min/avg/max), T/s (min/avg/max), Gen s (min/avg/max)
  - Sorted by problem index, monospace font, right-aligned numbers
  - Colspan headers for grouped stats, auto width
- Simulator: add /v1/models endpoint, timings in response,
  template-aware question matching, --dataset arg (aime/aime2025)

Assisted-by: llama.cpp:local pi

* llama-eval : add tabs for Detailed and Summary tables, apply monospace font globally

- Wrap Detailed and Summary tables in switchable tabs (Detailed active by default)
- Remove summary-section wrapper, use tab labels instead
- Apply monospace font to all tables and the top bar

Assisted-by: llama.cpp:local pi

* llama-eval : redesign top bar as CSS grid label/value pairs

- Replace flat span list with 4-column grid layout (2 pairs per row)
- Labels in muted color (#888), values in dark (#222)
- Bold dataset name and model name
- Removed media query, always uses 4 columns

Assisted-by: llama.cpp:local pi

* llama-eval : use realistic token counts and throughput in simulator

- comp_tokens: [30, 80] → [10000, 60000]
- tps_gen: derived → uniform [90.0, 110.0]
- t_gen_ms: now computed from tokens/tps

Assisted-by: llama.cpp:local pi

* llama-eval : color Answer column green/red based on correctness

Use the same .correct/.incorrect CSS classes on the Answer column
to make correct answers green and incorrect answers red.

Assisted-by: llama.cpp:local pi

* llama-eval : fix pyright errors from max(..., key=len) type inference

Use key=lambda x: len(x) instead of key=len so the type checker
infers the return type as str instead of Sized, fixing:
  - unresolved-attribute: Object of type Sized has no attribute lower
  - not-subscriptable: Cannot subscript object of type Sized

Assisted-by: llama.cpp:local pi
2026-05-19 09:46:05 +03:00
..
batched libs : rename libcommon -> libllama-common (#21936) 2026-04-17 11:11:46 +03:00
batched.swift
convert-llama2c-to-ggml libs : rename libcommon -> libllama-common (#21936) 2026-04-17 11:11:46 +03:00
debug common: fix missing exports in llama-common (#22340) 2026-04-27 08:06:39 +03:00
deprecation-warning Fix locale-dependent float printing in GGUF metadata (#17331) 2026-03-04 09:30:40 +01:00
diffusion examples: refactor diffusion generation (#22590) 2026-05-04 20:19:30 +08:00
embedding libs : rename libcommon -> libllama-common (#21936) 2026-04-17 11:11:46 +03:00
eval-callback common: fix missing exports in llama-common (#22340) 2026-04-27 08:06:39 +03:00
gen-docs spec : refactor params (#22397) 2026-04-28 09:07:33 +03:00
gguf Fix locale-dependent float printing in GGUF metadata (#17331) 2026-03-04 09:30:40 +01:00
gguf-hash Fix locale-dependent float printing in GGUF metadata (#17331) 2026-03-04 09:30:40 +01:00
idle libs : rename libcommon -> libllama-common (#21936) 2026-04-17 11:11:46 +03:00
llama-eval llama-eval : add per-task summary stats (#23151) 2026-05-19 09:46:05 +03:00
llama.android android : libcommon -> libllama-common (#22076) 2026-04-18 11:19:40 +02:00
llama.swiftui
lookahead libs : rename libcommon -> libllama-common (#21936) 2026-04-17 11:11:46 +03:00
lookup spec : refactor params (#22397) 2026-04-28 09:07:33 +03:00
model-conversion model-conversion : add causal-convert-mmproj target [no ci] (#22969) 2026-05-12 15:15:40 +02:00
parallel libs : rename libcommon -> libllama-common (#21936) 2026-04-17 11:11:46 +03:00
passkey libs : rename libcommon -> libllama-common (#21936) 2026-04-17 11:11:46 +03:00
retrieval libs : rename libcommon -> libllama-common (#21936) 2026-04-17 11:11:46 +03:00
save-load-state common : only load backends when required (#22290) 2026-05-05 09:23:50 +02:00
simple Fix locale-dependent float printing in GGUF metadata (#17331) 2026-03-04 09:30:40 +01:00
simple-chat Fix locale-dependent float printing in GGUF metadata (#17331) 2026-03-04 09:30:40 +01:00
simple-cmake-pkg
speculative spec : fix vocab compat checks in spec example (#22426) 2026-04-30 08:18:25 +03:00
speculative-simple spec : parallel drafting support (#22838) 2026-05-11 19:09:43 +03:00
sycl sycl : fix error when use -mg 1 error (#23140) 2026-05-18 08:11:19 +03:00
training libs : rename libcommon -> libllama-common (#21936) 2026-04-17 11:11:46 +03:00
CMakeLists.txt examples : add debug utility/example (#18464) 2026-01-07 10:42:19 +01:00
convert_legacy_llama.py
json_schema_pydantic_example.py
json_schema_to_grammar.py ci : switch from pyright to ty (#20826) 2026-03-21 08:54:34 +01:00
llama.vim chore : correct typos [no ci] (#20041) 2026-03-05 08:50:21 +01:00
pydantic_models_to_grammar.py ci : switch from pyright to ty (#20826) 2026-03-21 08:54:34 +01:00
pydantic_models_to_grammar_examples.py
reason-act.sh
regex_to_grammar.py
server-llama2-13B.sh
server_embd.py
ts-type-to-grammar.sh