ruvector/crates/ruvector-attention-node/npm/linux-x64-gnu/package.json
rUv 9d79eedec9 perf(sparse-inference): 6x speedup with W2 transpose and SIMD activations
Key optimizations in v0.1.31:
- W2 matrix stored transposed for contiguous row access during sparse accumulation
- SIMD GELU/SiLU using AVX2+FMA polynomial approximations
- Cached SIMD feature detection with OnceLock (eliminates runtime CPUID calls)
- SIMD axpy for vectorized weight accumulation

Benchmark results (512 input, 2048 hidden):
- 10% active: 130µs (83% reduction, 52× vs dense)
- 30% active: 383µs (83% reduction, 18× vs dense)
- 50% active: 651µs (83% reduction, 10× vs dense)
- 70% active: 912µs (83% reduction, 7× vs dense)

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-05 05:07:42 +00:00

22 lines
No EOL
376 B
JSON

{
"name": "@ruvector/attention-linux-x64-gnu",
"version": "0.1.4",
"os": [
"linux"
],
"cpu": [
"x64"
],
"main": "attention.linux-x64-gnu.node",
"files": [
"attention.linux-x64-gnu.node"
],
"license": "MIT OR Apache-2.0",
"engines": {
"node": ">= 10"
},
"libc": [
"glibc"
],
"repository": "https://github.com/ruvnet/ruvector"
}