ruvector/examples/onnx-embeddings/examples/batch.rs
ruvnet 758fce1a22 chore(workspace): cargo fmt nested workspaces — rvf/, examples/*
Root-level `cargo fmt --all` doesn't recurse into nested workspaces
(crates/rvf/, examples/onnx-embeddings/, examples/data/, …), but
CI's `cargo fmt --all -- --check` was failing on files inside them
(e.g. crates/rvf/rvf-wire/src/hash.rs).

Ran `cargo fmt --all` inside each nested workspace. Mechanical-only
whitespace, no semantic change.

Touched nested workspaces:
  crates/rvf/*
  examples/onnx-embeddings/*
  examples/data/*
  examples/mincut/*
  examples/exo-ai-2025/*
  examples/prime-radiant/*
  examples/rvf/*
  examples/ultra-low-latency-sim/*
  examples/edge/*
  examples/vibecast-7sense/*
  examples/onnx-embeddings-wasm/*

Combined with previous commit (96d8fdc17), the full workspace tree
should now pass `cargo fmt --all -- --check` in CI.

Co-Authored-By: claude-flow <ruv@ruv.net>
2026-04-24 10:51:14 -04:00

56 lines
1.5 KiB
Rust

//! Batch embedding example with parallel processing
use anyhow::Result;
use ruvector_onnx_embeddings::{EmbedderBuilder, PoolingStrategy, PretrainedModel};
use std::time::Instant;
#[tokio::main]
async fn main() -> Result<()> {
// Create embedder with custom settings
let mut embedder = EmbedderBuilder::new()
.pretrained(PretrainedModel::AllMiniLmL6V2)
.pooling(PoolingStrategy::Mean)
.normalize(true)
.batch_size(32)
.max_length(256)
.build()
.await?;
// Generate test data
let texts: Vec<String> = (0..100)
.map(|i| format!("This is test sentence number {} for batch embedding.", i))
.collect();
println!("Embedding {} texts...", texts.len());
// Sequential embedding
let start = Instant::now();
let output = embedder.embed(&texts)?;
let seq_time = start.elapsed();
println!(
"Sequential: {:?} ({:.2} texts/sec)",
seq_time,
texts.len() as f64 / seq_time.as_secs_f64()
);
// Parallel embedding
let start = Instant::now();
let output_parallel = embedder.embed_parallel(&texts)?;
let par_time = start.elapsed();
println!(
"Parallel: {:?} ({:.2} texts/sec)",
par_time,
texts.len() as f64 / par_time.as_secs_f64()
);
println!(
"\nSpeedup: {:.2}x",
seq_time.as_secs_f64() / par_time.as_secs_f64()
);
println!("Total embeddings: {}", output.len());
println!("Dimension: {}", output.dimension);
Ok(())
}