Commit graph

7 commits

Author SHA1 Message Date
Claude
81ab90f6b0 feat(rvdna): add health biomarker analysis engine with streaming simulation
Implement ADR-014 Health Biomarker Analysis Architecture:
- biomarker.rs: Composite risk scoring engine with 17-SNP weight matrix,
  gene-gene interaction modifiers (COMT×OPRM1, MTHFR compound, BRCA1×TP53),
  64-dim HNSW-aligned profile vectors, clinical reference ranges for 12
  biomarkers, and deterministic synthetic population generation
- biomarker_stream.rs: Streaming biomarker simulator with generic RingBuffer,
  configurable noise/drift/anomaly injection, z-score anomaly detection,
  linear regression trend analysis, and exponential moving averages
- 35 unit tests + 15 integration tests (168 total, 0 failures)
- Criterion benchmark suite targeting ADR-014 performance budgets

https://claude.ai/code/session_014FpaYVohmyLH5dcBZTgmSY
2026-02-22 05:19:23 +00:00
rUv
161f890ddb fix: apply cargo fmt across workspace and fix CI issues
- Run cargo fmt --all to fix formatting in 362 files across the entire workspace
- Add PGDG repository for PostgreSQL 17 in CI test-all-features and benchmark jobs
- Add missing rvf dependency crates to standalone Dockerfile for domain-expansion
- Add sona-learning and domain-expansion features to standalone Dockerfile build
- Create npu.rs stub for ruvector-sparse-inference (fixes rustfmt resolution error)

Co-Authored-By: claude-flow <ruv@ruv.net>
2026-02-21 20:56:38 +00:00
Claude
1fc198da66 feat: integrate ruvector-solver into DNA and quantum components
DNA crate (rvdna):
- Add ruvector-solver dependency with forward-push feature
- New kmer_pagerank module: KmerGraphRanker uses Forward Push PPR to
  rank sequences by structural centrality in k-mer overlap graphs
- New solver_bench benchmark suite with 3 groups:
  A) Localized relevance via Forward Push PPR (20-200x speedup)
  B) Laplacian solve for denoising via Neumann/CG (10-80x speedup)
  C) Cohort-scale label propagation via CG solver
- README: add DNA Solver Benchmarks section with dataset citations
  (GIAB, NA12878, 1000 Genomes), graph construction docs, benchmark
  tables, and reproducibility instructions

Quantum crate (prime-radiant-category):
- Add ruvector-solver dependency with neumann/cg features
- SparseMatrix: replace O(nnz) COO Vec with O(1) HashMap entries,
  add to_csr_f64() and spmv_f64() using solver CsrMatrix
- ComplexMatrix: add Jacobi eigenvalue algorithm for real-symmetric
  matrices (much more stable than power iteration + deflation),
  add to_csr_real() and is_real_valued() helper methods
- DensityMatrix: add SpectralDecomposition cache, purity_fast() via
  Frobenius norm O(n²) vs O(n³), static eigenvalue helpers
- SimplicialComplex: add graph_laplacian_csr() for spectral analysis
- SolverBackedOperator: sparse quantum operator using CsrMatrix SpMV
  for 40-60 effective qubit scaling (vs ~33 with dense matrices)
- New quantum_solver_bench: SpMV scaling, eigenvalue convergence,
  memory scaling benchmarks from 10 to 30 qubits

All 362 tests pass (81 quantum + 102 DNA + 179 solver).

https://claude.ai/code/session_01TiqLbr2DaNAntQHaVeLfiR
2026-02-20 13:37:24 +00:00
rUv
829ca9f014 feat(rvdna): rename package to rvdna, publish to crates.io and npm
Rename dna-analyzer-example to rvdna across all source files, tests,
and benchmarks. Add crates.io metadata (repository, docs, keywords).
Publish rvdna v0.1.0 to crates.io and @ruvector/rvdna v0.1.0 to npm
with NAPI-RS platform loader, JS fallbacks, and TypeScript definitions.

Also publishes workspace deps at v2.0.2: ruvector-math, ruvector-core,
ruvector-filter, ruvector-collections, ruvector-graph, ruvector-gnn.

Co-Authored-By: claude-flow <ruv@ruv.net>
2026-02-12 15:47:00 +00:00
rUv
244bbffe54 feat: add package.json for rvdna example with WASM bindings and build scripts 2026-02-12 15:32:55 +00:00
Claude
cfee05c384 feat(dna): implement missing capabilities + optimize hot paths
- Affine gap scoring: 3-matrix Smith-Waterman (H/E/F) with flat 1D
  arrays for cache-friendly access, direct slice indexing
- Indel detection: call_indel() for insertion/deletion from pileup data
- VCF output: VCFv4.3 format with proper CHROM/POS/REF/ALT/QUAL columns
- CYP2C19 pharmacogenomics: star allele calling (*1/*2/*3/*17),
  phenotype prediction, drug recommendations (clopidogrel, voriconazole)
- Cancer signal detection: methylation entropy + extreme ratio scoring,
  CancerSignalDetector with configurable risk threshold
- Molecular weight: monoisotopic Da for all 20 amino acids
- Isoelectric point: Henderson-Hasselbalch bisection with sidechain pKa
- K-mer encoding: zero-allocation canonical hashing (hash both strands,
  take min) eliminates O(n) Vec allocs per sliding window
- CRC32: lookup table replaces bit-by-bit (~8x faster header checksums)
- Benchmarks: added RVDNA, epigenomics, protein analysis groups

95 tests pass (54 lib + 12 kmer + 17 pipeline + 12 security)

https://claude.ai/code/session_013B6stXbYwAkWHbE16sjUrq
2026-02-11 05:31:16 +00:00
Claude
541fc85e4b feat(dna): complete SOTA genomic analysis pipeline with full test suite
Implements a comprehensive DNA analyzer demonstrating RuVector's vector
computing capabilities for bioinformatics:

Modules (9):
- types: Core domain types (DnaSequence, Nucleotide, ProteinSequence, etc.)
- kmer: HNSW k-mer indexing with FNV-1a hashing and MinHash sketching
- alignment: Smith-Waterman local alignment with CIGAR generation
- variant: SNP calling from pileup data with genotype classification
- protein: DNA-to-protein translation with contact graph prediction
- epigenomics: Horvath clock biological age prediction from CpG methylation
- pharma: CYP2D6 star allele calling and metabolizer phenotype prediction
- pipeline: DAG-based genomic analysis orchestration
- error: Typed error handling across all modules

Testing (41 tests, 0 mocks):
- 12 k-mer integration tests (encoding, HNSW search, MinHash Jaccard)
- 17 pipeline e2e tests (alignment, variant calling, pharmacogenomics)
- 12 security tests (buffer overflow, path traversal, concurrency, bounds)

Benchmarks: Criterion suite for kmer, alignment, variant, protein, pipeline

Binary: 7-stage demo (sequence gen, k-mer search, alignment, variant
calling, protein analysis, epigenomics, pharmacogenomics)

https://claude.ai/code/session_013B6stXbYwAkWHbE16sjUrq
2026-02-11 04:29:28 +00:00