Files

defiQUG 903c03c65b Add full monorepo: virtual-banker, backend, frontend, docs, scripts, deployment

Co-authored-by: Cursor <cursoragent@cursor.com>

2026-02-10 11:32:49 -08:00

3.8 KiB

Raw Blame History

Unified Search Architecture

Overview

This document specifies the unified search architecture that enables searching across multiple chains and entity types with relevance ranking.

Architecture

flowchart TB
    Query[Search Query]
    Router[Query Router]
    
    subgraph Search[Search Services]
        ES1[Elasticsearch<br/>Chain 138]
        ES2[Elasticsearch<br/>Chain 1]
        ES3[Elasticsearch<br/>Chain 137]
    end
    
    Agg[Aggregator]
    Rank[Relevance Ranking]
    Results[Unified Results]
    
    Query --> Router
    Router --> ES1
    Router --> ES2
    Router --> ES3
    ES1 --> Agg
    ES2 --> Agg
    ES3 --> Agg
    Agg --> Rank
    Rank --> Results

Search Algorithm

Query Processing

Steps:

Parse query (extract terms, filters)
Determine chain scope (all chains or specific chain)
Route to appropriate search indices
Execute searches in parallel
Aggregate results
Rank by relevance
Return unified results

Query Types

Exact Match (Hash, Address):

Direct lookup in specific chain
Return single result if found

Full-Text Search (Name, Symbol, Label):

Search across all chains
Rank by relevance
Return top N results per chain

Fuzzy Search (Typos, Partial matches):

Use fuzzy matching
Rank by similarity
Include suggestions

Ranking and Relevance Scoring

Relevance Factors

1. Exact Match Score:

Exact match: 100%
Prefix match: 80%
Fuzzy match: 60%

2. Chain Relevance:

User's preferred chain: +20%
Popular chains: +10%

3. Entity Type Relevance:

Addresses: Highest (most specific)
Transactions: High
Blocks: Medium
Tokens: Medium
Contracts: Lower (unless verified)

4. Popularity Score:

Transaction count
Token holder count
Contract usage

Scoring Formula

score = (exact_match_score * 0.5) + 
        (chain_relevance * 0.2) + 
        (entity_type_relevance * 0.2) + 
        (popularity_score * 0.1)

Result Aggregation

Aggregation Strategy

Per-Chain Results:

Limit results per chain (e.g., top 10)
Combine across chains
Remove duplicates (same address on multiple chains)

Result Format

{
  "query": "0x123...",
  "total_results": 5,
  "results": [
    {
      "type": "address",
      "chain_id": 138,
      "address": "0x123...",
      "label": "My Wallet",
      "score": 0.95
    },
    {
      "type": "transaction",
      "chain_id": 138,
      "hash": "0x123...",
      "score": 0.80
    }
  ],
  "chains_searched": [138, 1, 137]
}

Performance Optimization

Caching

Cache Strategy:

Cache popular queries (top 1000)
Cache duration: 1 minute
Invalidate on data updates

Parallel Search

Strategy: Execute searches across chains in parallel

Benefits:

Faster response time
Better resource utilization

Result Limiting

Per-Chain Limit: Top 10-20 results per chain Total Limit: Top 50-100 results total

Search Indexes

Per-Chain Indices

Index Names: {entity_type}-{chain_id} (e.g., addresses-138)