open-notebook/docs/3-USER-GUIDE/search.md
LUIS NOVO e13e4a2d8b docs: restructure documentation with new organized layout
- Replace old docs structure with new comprehensive documentation
- Organize into 8 major sections (0-START-HERE through 7-DEVELOPMENT)
- Convert CONFIGURATION.md, CONTRIBUTING.md, MAINTAINER_GUIDE.md to redirects
- Remove outdated MIGRATION.md and DESIGN_PRINCIPLES.md
- Fix all internal documentation links and cross-references
- Add progressive disclosure paths for different user types
- Include 44 focused guides covering all features
- Update README.md to remove v1.0 breaking changes notice
2026-01-03 20:10:24 -03:00

12 KiB

Search Effectively - Finding What You Need

Search is your gateway into your research. This guide covers two search modes and when to use each.


Quick-Start: Find Something

1. Go to your notebook
2. Type in search box
3. See results (both sources and notes)
4. Click result to view source/note
5. Done!

That works for basic searches.
But you can do much better...

Two Search Modes Explained

Open Notebook has two fundamentally different search approaches.

Search Type 1: TEXT SEARCH (Keyword Matching)

How it works:

  • You search for words: "transformer"
  • System finds chunks containing "transformer"
  • Ranked by relevance: frequency, position, context

Speed: Very fast (instant)

When to use:

  • You remember exact words or phrases
  • You're looking for specific terms
  • You want precise keyword matches
  • You need exact quotes

Example:

Search: "attention mechanism"
Results:
  1. "The attention mechanism allows..." (perfect match)
  2. "Attention and other mechanisms..." (partial match)
  3. "How mechanisms work in attention..." (includes words separately)

All contain "attention" AND "mechanism"
Ranked by how close together they are

What it finds:

  • Exact phrases: "transformer model"
  • Individual words: transformer OR model (too broad)
  • Names: "Vaswani et al."
  • Numbers: "1994", "GPT-4"
  • Technical terms: "LSTM", "convolution"

What it doesn't find:

  • Similar words: searching "attention" won't find "focus"
  • Synonyms: searching "large" won't find "big"
  • Concepts: searching "similarity" won't find "likeness"

Search Type 2: VECTOR SEARCH (Semantic/Concept Matching)

How it works:

  • Your search converted to embedding (vector)
  • All chunks converted to embeddings
  • System finds most similar embeddings
  • Ranked by semantic similarity

Speed: A bit slower (1-2 seconds)

When to use:

  • You're exploring a concept
  • You don't know exact words
  • You want semantically similar content
  • You're discovering, not searching

Example:

Search: "What's the mechanism for understanding in models?"
(Notice: No chunk likely says exactly that)

Results:
  1. "Mechanistic interpretability allows understanding..." (semantic match)
  2. "Feature attribution reveals how models work..." (conceptually similar)
  3. "Attention visualization shows model decisions..." (same topic)

None contain your exact words
But all are semantically related

What it finds:

  • Similar concepts: "understanding" + "interpretation" + "explainability" (all related)
  • Paraphrases: "big" and "large" (same meaning)
  • Related ideas: "safety" relates to "alignment" (connected concepts)
  • Analogies: content about biological learning when searching "learning"

What it doesn't find:

  • Exact keywords: if you search a rare word, vector search might miss it
  • Specific numbers: "1994" vs "1993" are semantically different
  • Technical jargon: "LSTM" and "RNN" are different even if related

Question: "Do I remember the exact words?"

→ YES: Use TEXT SEARCH
   Example: "I remember the paper said 'attention is all you need'"

→ NO: Use VECTOR SEARCH
   Example: "I'm looking for content about how models process information"

→ UNSURE: Try TEXT SEARCH first (faster)
         If no results, try VECTOR SEARCH

Text search: "I know what I'm looking for"
Vector search: "I'm exploring an idea"

1. Go to search box
2. Type your keywords: "transformer", "attention", "2017"
3. Press Enter
4. Results appear (usually instant)
5. Click result to see context

Results show:
  - Which source contains it
  - How many times it appears
  - Relevance score
  - Preview of surrounding text
1. Go to search box
2. Type your concept: "How do models understand language?"
3. Choose "Vector Search" from dropdown
4. Press Enter
5. Results appear (1-2 seconds)
6. Click result to see context

Results show:
  - Semantically related chunks
  - Similarity score (higher = more related)
  - Preview of surrounding text
  - Different sources mixed together

Ask is different from simple search. It automatically searches, synthesizes, and answers.

How Ask Works

Stage 1: QUESTION UNDERSTANDING
  "Compare the approaches in my papers"
  → System: "This asks for comparison"

Stage 2: SEARCH STRATEGY
  → System: "I should search for each approach separately"

Stage 3: PARALLEL SEARCHES
  → Search 1: "Approach in paper A"
  → Search 2: "Approach in paper B"
  (Multiple searches happen at once)

Stage 4: ANALYSIS & SYNTHESIS
  → Per-result analysis: "Based on paper A, the approach is..."
  → Per-result analysis: "Based on paper B, the approach is..."
  → Final synthesis: "Comparing A and B: A differs from B in..."

Result: Comprehensive answer, not just search results
Task Use Why
"Find the quote about X" TEXT SEARCH Need exact words
"What does source A say about X?" TEXT SEARCH Direct, fast answer
"Find content about X" VECTOR SEARCH Semantic discovery
"Compare A and B" ASK Comprehensive synthesis
"What's the big picture?" ASK Full analysis needed
"How do these sources relate?" ASK Cross-source synthesis
"I remember something about X" TEXT SEARCH Recall memory
"I'm exploring the topic of X" VECTOR SEARCH Discovery mode

Advanced Search Strategies

Strategy 1: Simple Search with Follow-Up

1. Text search: "attention mechanism"
   Results: 50 matches

2. Too many. Follow up with vector search:
   "Why is attention useful?" (concept search)
   Results: Most relevant papers/notes

3. Better results with less noise

Strategy 2: Ask for Comprehensive, Then Search for Details

1. Ask: "What are the main approaches to X?"
   Result: Comprehensive answer about A, B, C

2. Use that to identify specific sources

3. Text search in those specific sources:
   "Why did they choose method X?"
   Result: Detailed information

Strategy 3: Vector Search for Discovery, Text for Verification

1. Vector search: "How do transformers generalize?"
   Results: Related conceptual papers

2. Skim to understand landscape

3. Text search in promising sources:
   "generalization", "extrapolation", "transfer"
   Results: Specific passages to read carefully

Strategy 4: Combine Search with Chat

1. Vector search: "What's new in AI 2026?"
   Results: Latest papers

2. Go to Chat
3. Add those papers to context
4. Ask detailed follow-up questions
5. Get deep analysis of results

Search Quality Issues & Fixes

Getting No Results

Problem Cause Solution
Text search: no results Word doesn't appear Try vector search instead
Vector search: no results Concept not in content Try broader search term
Both empty Content not in notebook Add sources to notebook
Sources not processed Wait for processing to complete

Getting Too Many Results

Problem Cause Solution
1000+ results Search too broad Be more specific
All sources Filter by source
Keyword matches rare words Use vector search instead

Getting Wrong Results

Problem Cause Solution
Results irrelevant Search term has multiple meanings Provide more context
Using text search for concepts Try vector search
Different meaning Homonym (word means multiple things) Add context (e.g., "attention mechanism")

Getting Low Quality Results

Problem Cause Solution
Results don't match intent Vague search term Be specific ("Who invented X?" vs "X")
Concept not well-represented Add more sources on that topic
Vector embedding not trained on domain Use text search as fallback

Tips for Better Searches

  1. Be specific — "attention mechanism" not just "attention"
  2. Use exact phrases — Put quotes around: "attention is all you need"
  3. Include context — "LSTM vs attention" not just "attention"
  4. Use technical terms — These are usually more precise
  5. Try synonyms — If first search fails, try related terms
  1. Ask a question — "What's the best way to X?" is better than "best way"
  2. Use natural language — Explain what you're looking for
  3. Be specific about intent — "Compare X and Y" not "X and Y"
  4. Include context — "In machine learning, how..." vs just "how..."
  5. Think conceptually — What idea are you exploring?

General Tips

  1. Start broad, then narrow — "AI papers" → "transformers" → "attention mechanism"
  2. Try both search types — Each finds different things
  3. Use Ask for complex questions — Don't just search
  4. Save good results as notes — Create knowledge base
  5. Filter by source if needed — "Search in Paper A only"

Search Examples

Example 1: Finding a Specific Fact

Goal: "Find the date the transformer was introduced"

Step 1: Text search
  "transformer 2017" (or year you remember)

If that works: Done!

If no results: Try
  "attention is all you need" (famous paper title)

Check result for exact date

Example 2: Exploring a Concept

Goal: "Find content about alignment interpretability"

Step 1: Vector search
  "How do we make AI interpretable?"

Results: Papers on interpretability, transparency, alignment

Step 2: Review results
  See which papers are most relevant

Step 3: Deep dive
  Go to Chat, add top 2-3 papers
  Ask detailed questions about alignment

Example 3: Comprehensive Answer

Goal: "How do different approaches to AI safety compare?"

Step 1: Ask
  "Compare the main approaches to AI safety in my sources"

Result: Comprehensive analysis comparing approaches

Step 2: Identify sources
  From answer, see which papers were most relevant

Step 3: Deep dive
  Text search in those papers:
  "limitations", "critiques", "open problems"

Step 4: Save as notes
  Create comparison note from Ask result

Example 4: Finding Pattern

Goal: "Find all papers mentioning transformers"

Step 1: Text search
  "transformer"

Results: All papers mentioning "transformer"

Step 2: Vector search
  "neural network architecture for sequence processing"

Results: Papers that don't say "transformer" but discuss similar concept

Step 3: Combine
  Union of text + vector results shows full landscape

Step 4: Analyze
  Go to Chat with all results
  Ask: "What's common across all these?"

Search in the Workflow

How search fits with other features:

SOURCES
  ↓
SEARCH (find what matters)
  ├─ Text search (precise)
  ├─ Vector search (exploration)
  └─ Ask (comprehensive)
  ↓
CHAT (explore with follow-ups)
  ↓
TRANSFORMATIONS (batch extract)
  ↓
NOTES (save insights)

Workflow Example

1. Add 10 papers to notebook

2. Search: "What's the state of the art?"
   (Vector search explores landscape)

3. Ask: "Compare these 3 approaches"
   (Comprehensive synthesis)

4. Chat: Deep questions about winner
   (Follow-up exploration)

5. Save best insights as notes
   (Knowledge capture)

6. Transform remaining papers
   (Batch extraction for later)

7. Create podcast from notes + sources
   (Share findings)

TEXT SEARCH — "I know what I'm looking for"

  • Fast, precise, keyword-based
  • Use when you remember exact words/phrases
  • Best for: Finding specific facts, quotes, technical terms
  • Speed: Instant

VECTOR SEARCH — "I'm exploring an idea"

  • Slow-ish, concept-based, semantic
  • Use when you're discovering connections
  • Best for: Concept exploration, related ideas, synonyms
  • Speed: 1-2 seconds

ASK — "I want a comprehensive answer"

  • Auto-searches, auto-analyzes, synthesizes
  • Use for complex questions needing multiple sources
  • Best for: Comparisons, big-picture questions, synthesis
  • Speed: 10-30 seconds

Pick the right tool for your search goal, and you'll find what you need faster.