ruvector/data/training/corpus_stats.json
rUv 850ff6be9a data: add merged training corpus (230 records, 530K tokens)
98 brain memories + 131 ADRs + 1 routing reference.
Governance: SHA-256 dedup, quality >= 0.5, schema validated.

Co-Authored-By: claude-flow <ruv@ruv.net>
2026-03-28 12:03:23 +00:00

18 lines
No EOL
337 B
JSON

{
"total_records": 132,
"total_estimated_tokens": 672420,
"per_source": {
"adr": {
"count": 131,
"estimated_tokens": 672357
},
"claude-routing": {
"count": 1,
"estimated_tokens": 63
}
},
"quality_histogram": {
"0.9-1.0": 132
},
"exported_at": "2026-03-28T11:58:50.319983+00:00"
}