mirror of
https://github.com/ruvnet/RuVector.git
synced 2026-05-24 22:15:18 +00:00
docs(readme): add downloads badge for rvLite, add Agentic-Synth section
rvLite: - Add downloads badge linking to npm package Agentic-Synth - AI Synthetic Data Generation: - Problem/Solution comparison table - Key features: multi-model, caching, routing, DSPy.ts - Data generation types: time-series, events, structured, embeddings - Quick start with npx commands - Basic usage examples (structured, time-series, streaming) - Self-learning with DSPy optimizer example - Performance metrics (98.2% faster with caching) - Ecosystem integration table Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
parent
4c44894c8a
commit
cdca236b2f
1 changed files with 145 additions and 0 deletions
145
README.md
145
README.md
|
|
@ -1141,6 +1141,7 @@ npm install @ruvector/rudag-wasm
|
|||
|
||||
[](https://crates.io/crates/rvlite)
|
||||
[](https://www.npmjs.com/package/@ruvector/rvlite)
|
||||
[](https://www.npmjs.com/package/@ruvector/rvlite)
|
||||
|
||||
**A complete vector database that runs anywhere JavaScript runs** — browsers, Node.js, Deno, Bun, Cloudflare Workers, Vercel Edge Functions.
|
||||
|
||||
|
|
@ -1445,6 +1446,150 @@ npm install @ruvector/edge-net
|
|||
|
||||
</details>
|
||||
|
||||
<details>
|
||||
<summary><strong>🎲 Agentic-Synth - AI Synthetic Data Generation</strong></summary>
|
||||
|
||||
[](https://www.npmjs.com/package/@ruvector/agentic-synth)
|
||||
[](https://www.npmjs.com/package/@ruvector/agentic-synth)
|
||||
|
||||
**AI-Powered Synthetic Data Generation at Scale** — Generate unlimited, high-quality synthetic data for training AI models, testing systems, and building robust agentic applications.
|
||||
|
||||
### Why Agentic-Synth?
|
||||
|
||||
| Problem | Solution |
|
||||
|---------|----------|
|
||||
| Real data is **expensive** to collect | Generate **unlimited** synthetic data |
|
||||
| **Privacy-sensitive** with compliance risks | **Fully synthetic**, no PII concerns |
|
||||
| **Slow** to generate at scale | **10-100x faster** than manual creation |
|
||||
| **Insufficient** for edge cases | **Customizable** schemas for any scenario |
|
||||
| **Hard to reproduce** across environments | **Reproducible** with seed values |
|
||||
|
||||
### Key Features
|
||||
|
||||
| Feature | Description |
|
||||
|---------|-------------|
|
||||
| **Multi-Model Support** | Gemini, OpenRouter, GPT, Claude, and 50+ models via DSPy.ts |
|
||||
| **Context Caching** | 95%+ performance improvement with intelligent LRU cache |
|
||||
| **Smart Model Routing** | Automatic load balancing, failover, and cost optimization |
|
||||
| **DSPy.ts Integration** | Self-learning optimization with 20-25% quality improvement |
|
||||
| **Streaming** | AsyncGenerator for real-time data flow |
|
||||
| **Memory Efficient** | <50MB for datasets up to 10K records |
|
||||
|
||||
### Data Generation Types
|
||||
|
||||
| Type | Use Cases |
|
||||
|------|-----------|
|
||||
| **Time-Series** | Financial data, IoT sensors, metrics |
|
||||
| **Events** | Logs, user actions, system events |
|
||||
| **Structured** | JSON, CSV, databases, APIs |
|
||||
| **Embeddings** | Vector data for RAG systems |
|
||||
|
||||
### Quick Start
|
||||
|
||||
```bash
|
||||
# Install
|
||||
npm install @ruvector/agentic-synth
|
||||
|
||||
# Or run instantly with npx
|
||||
npx @ruvector/agentic-synth generate --count 100
|
||||
|
||||
# Interactive mode
|
||||
npx @ruvector/agentic-synth interactive
|
||||
```
|
||||
|
||||
### Basic Usage
|
||||
|
||||
```typescript
|
||||
import { AgenticSynth } from '@ruvector/agentic-synth';
|
||||
|
||||
// Initialize with your preferred model
|
||||
const synth = new AgenticSynth({
|
||||
model: 'gemini-pro',
|
||||
apiKey: process.env.GEMINI_API_KEY
|
||||
});
|
||||
|
||||
// Generate structured data
|
||||
const users = await synth.generate({
|
||||
schema: {
|
||||
name: 'string',
|
||||
email: 'email',
|
||||
age: 'number:18-65',
|
||||
role: ['admin', 'user', 'guest']
|
||||
},
|
||||
count: 1000
|
||||
});
|
||||
|
||||
// Generate time-series data
|
||||
const stockData = await synth.timeSeries({
|
||||
fields: ['open', 'high', 'low', 'close', 'volume'],
|
||||
interval: '1h',
|
||||
count: 500,
|
||||
volatility: 0.02
|
||||
});
|
||||
|
||||
// Stream large datasets
|
||||
for await (const batch of synth.stream({ count: 100000, batchSize: 1000 })) {
|
||||
await processData(batch);
|
||||
}
|
||||
```
|
||||
|
||||
### Self-Learning with DSPy
|
||||
|
||||
```typescript
|
||||
import { AgenticSynth, DSPyOptimizer } from '@ruvector/agentic-synth';
|
||||
|
||||
// Enable self-learning optimization
|
||||
const synth = new AgenticSynth({
|
||||
model: 'gemini-pro',
|
||||
optimizer: new DSPyOptimizer({
|
||||
learningRate: 0.1,
|
||||
qualityThreshold: 0.85
|
||||
})
|
||||
});
|
||||
|
||||
// Quality improves automatically over time
|
||||
const data = await synth.generate({
|
||||
schema: { ... },
|
||||
count: 1000,
|
||||
optimize: true // Enable learning
|
||||
});
|
||||
|
||||
console.log(`Quality score: ${data.metrics.quality}`);
|
||||
// First run: 0.72
|
||||
// After 100 runs: 0.94 (+25% improvement)
|
||||
```
|
||||
|
||||
### Performance
|
||||
|
||||
| Metric | Value |
|
||||
|--------|-------|
|
||||
| **With caching** | 98.2% faster |
|
||||
| **P99 latency** | 2500ms → 45ms |
|
||||
| **Memory** | <50MB for 10K records |
|
||||
| **Throughput** | 1000+ records/sec |
|
||||
|
||||
### Ecosystem Integration
|
||||
|
||||
| Package | Purpose |
|
||||
|---------|---------|
|
||||
| **RuVector** | Native vector database for RAG |
|
||||
| **DSPy.ts** | Prompt optimization |
|
||||
| **Agentic-Jujutsu** | Version-controlled generation |
|
||||
|
||||
### Installation
|
||||
|
||||
```bash
|
||||
# npm
|
||||
npm install @ruvector/agentic-synth
|
||||
|
||||
# Examples package (50+ production examples)
|
||||
npm install @ruvector/agentic-synth-examples
|
||||
```
|
||||
|
||||
> **Full Documentation**: [agentic-synth README](./npm/packages/agentic-synth/README.md)
|
||||
|
||||
</details>
|
||||
|
||||
<details>
|
||||
<summary><strong>🐘 PostgreSQL Extension</strong></summary>
|
||||
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue