CUDA: reduce MMQ stream-k overhead (#22298)
update-ops-docs.yml #310 -Commit 9725a313be pushed by vrr
upstream
2026-04-25 18:21:18 +00:00
0s
parser: fix structured output bug (#22302)
python-type-check.yml #309 -Commit 0adede866d pushed by vrr
upstream
2026-04-25 10:11:31 +00:00
0s
parser: fix structured output bug (#22302)
python-check-requirements.yml #308 -Commit 0adede866d pushed by vrr
upstream
2026-04-25 10:11:31 +00:00
0s
parser: fix structured output bug (#22302)
pre-tokenizer-hashes.yml #307 -Commit 0adede866d pushed by vrr
upstream
2026-04-25 10:11:31 +00:00
0s
upstream
2026-04-21 17:40:13 +00:00
0s
fix: GLM-DSA crash in llama-tokenize when using vocab_only (#22102)
python-check-requirements.yml #302 -Commit 81df3f7cfa pushed by vrr
upstream
2026-04-21 17:40:13 +00:00
0s
upstream
2026-04-21 17:40:13 +00:00
0s
android : libcommon -> libllama-common (#22076)
python-type-check.yml #300 -Commit 23b8cc4991 pushed by vrr
upstream
2026-04-19 23:40:14 +00:00
0s
android : libcommon -> libllama-common (#22076)
python-check-requirements.yml #299 -Commit 23b8cc4991 pushed by vrr
upstream
2026-04-19 23:40:14 +00:00
0s
android : libcommon -> libllama-common (#22076)
pre-tokenizer-hashes.yml #298 -Commit 23b8cc4991 pushed by vrr
upstream
2026-04-19 23:40:13 +00:00
0s
ci : add android arm64 build and release (#21647)
update-ops-docs.yml #297 -Commit a279d0f0f4 pushed by vrr
upstream
2026-04-18 17:40:14 +00:00
0s
ci : add android arm64 build and release (#21647)
python-type-check.yml #296 -Commit a279d0f0f4 pushed by vrr
upstream
2026-04-18 17:40:14 +00:00
0s
ci : add android arm64 build and release (#21647)
python-check-requirements.yml #295 -Commit a279d0f0f4 pushed by vrr
upstream
2026-04-18 17:40:14 +00:00
0s
ci : add android arm64 build and release (#21647)
pre-tokenizer-hashes.yml #294 -Commit a279d0f0f4 pushed by vrr
upstream
2026-04-18 17:40:14 +00:00
0s
upstream
2026-04-15 01:21:16 +00:00
0s
vulkan: Flash Attention DP4A shader for quantized KV cache (#20797)
python-check-requirements.yml #292 -Commit 75f3bc94e6 pushed by vrr
upstream
2026-04-15 01:21:16 +00:00
0s
upstream
2026-04-15 01:21:16 +00:00
0s
upstream
2026-04-13 01:21:16 +00:00
0s
CUDA: skip compilation of superfluous FA kernels (#21768)
python-check-requirements.yml #289 -Commit ff5ef82786 pushed by vrr
upstream
2026-04-13 01:21:16 +00:00
0s
CUDA: skip compilation of superfluous FA kernels (#21768)
pre-tokenizer-hashes.yml #288 -Commit ff5ef82786 pushed by vrr
upstream
2026-04-13 01:21:16 +00:00
0s
upstream
2026-04-11 19:42:12 +00:00
0s
upstream
2026-04-11 19:42:12 +00:00
0s
upstream
2026-04-11 19:42:12 +00:00
0s
upstream
2026-04-10 13:21:16 +00:00
0s
CUDA: also store `node->src->data` ptrs for equality check (#21635)
python-check-requirements.yml #283 -Commit d12cc3d1ca pushed by vrr
upstream
2026-04-10 13:21:16 +00:00
0s
upstream
2026-04-10 13:21:16 +00:00
0s
upstream
2026-04-08 13:21:17 +00:00
0s