feat(embeddings): added model2vec by vrn21 · Pull Request #778 · HelixDB/helix-db

vrn21 · 2025-12-19T11:57:03Z

Description

Add model2vec-rs as 4th embedding provider

Closes #721

Summary

Adds model2vec-rs as a new embedding provider for free, local, offline embedding generation without API keys or external servers.

Changes

Dependencies

Added model2vec-rs = { version = "0.1", optional = true } to Cargo.toml
Added model2vec = ["model2vec-rs"] feature flag

Implementation (115 lines across 3 files)

helix-db/src/helix_gateway/embedding_providers/mod.rs:

Added Model2Vec { model_name: String } variant to EmbeddingProvider enum
Added model2vec: Option<StaticModel> field to EmbeddingModelImpl (feature-gated)
Implemented model loading in constructor via StaticModel::from_pretrained()
Implemented fetch_embedding_async() using tokio::task::spawn_blocking() for sync→async conversion
Added parser for "model2vec:{model}" prefix (default: minishlab/potion-base-32M)
Added comprehensive inline documentation (68 lines module docs + provider-specific comments)
f32→f64 conversion for HelixDB compatibility

helix-db/src/helix_gateway/tests/embedding_providers.rs:

Added test_parse_model2vec_provider - validates parsing with explicit model
Added test_parse_model2vec_default - validates default model fallback
Added test_model2vec_embedding (#[ignore]) - integration test requiring model download

Technical Details

Model Loading:

Models downloaded from HuggingFace Hub on first use
Cached in ~/.cache/huggingface/
Loaded once in constructor, reused for all embeddings
StaticModel is Clone (Arc-based, cheap)

Async Handling:

encode_single() is sync/CPU-bound
Wrapped in tokio::task::spawn_blocking() to avoid blocking async runtime
Returns Vec<f64> like other providers

Available Models:

minishlab/potion-base-2M (2MB, 256 dims)
minishlab/potion-base-8M (8MB, 256 dims)
minishlab/potion-base-32M (32MB, 768 dims) [default]
minishlab/potion-retrieval-32M (32MB, 768 dims)

Testing

# Unit tests
cargo test --lib --features model2vec embedding_providers
# Result: 17 passed, 0 failed, 5 ignored

# Build verification
cargo build --features server,model2vec
# Result: Success, no warnings

## Usage
# Feature flag:

```bash
cargo build --features server,model2vec

Configuration (config.hx.json):

{
  "embedding_model": "model2vec:minishlab/potion-base-32M"
}

HelixQL:

QUERY search(query: String) =>
    results <- SearchV<Document>(Embed(query), 10)
    RETURN results

Breaking Changes

None. All changes are additive:

New feature flag (opt-in)
New enum variant (non-breaking)
New optional field (feature-gated)
Existing providers unchanged

Checklist when merging to main

No compiler warnings (if applicable)
Code is formatted with rustfmt
No useless or dead code (if applicable)
Code is easy to understand
Doc comments are used for all functions, enums, structs, and fields (where appropriate)
All tests pass
Performance has not regressed (assuming change was not to fix a bug)
Version number has been updated in helix-cli/Cargo.toml and helixdb/Cargo.toml

Additional Notes

Greptile Summary

This PR successfully adds model2vec-rs as a fourth embedding provider, enabling free, local, offline embedding generation without API keys. The implementation follows existing patterns for other providers with proper feature gating, comprehensive documentation, and async handling.

Key Changes:

Added optional model2vec-rs dependency with feature flag in helix-db/Cargo.toml
Implemented Model2Vec variant in EmbeddingProvider enum with model loading via StaticModel::from_pretrained()
Used tokio::task::spawn_blocking() to handle synchronous encode_single() without blocking async runtime
Added f32→f64 conversion for HelixDB compatibility
Created parsing logic for "model2vec:{model}" format with default minishlab/potion-base-32M
Added 68 lines of comprehensive module-level documentation explaining all four providers
Included three unit tests (two for parsing, one integration test marked #[ignore])

Minor Issues Found:

One test (test_parse_model2vec_default) doesn't fully assert the returned model value, though this is a minor style issue

Important Files Changed

Filename	Overview
helix-db/Cargo.toml	added optional `model2vec-rs` dependency and `model2vec` feature flag
helix-db/src/helix_gateway/embedding_providers/mod.rs	implemented Model2Vec provider with comprehensive documentation, async handling, and proper feature gating
helix-db/src/helix_gateway/tests/embedding_providers.rs	added three tests for Model2Vec provider parsing and embedding generation, but found one issue with test assertion

Sequence Diagram

sequenceDiagram
    participant User
    participant Config
    participant EmbeddingModelImpl
    participant StaticModel
    participant TokenioRuntime
    participant ThreadPool

    User->>Config: configure model2vec provider
    User->>EmbeddingModelImpl: new(api_key, model, url)
    EmbeddingModelImpl->>EmbeddingModelImpl: parse_provider_and_model()
    EmbeddingModelImpl->>EmbeddingModelImpl: extract model name
    EmbeddingModelImpl->>StaticModel: from_pretrained(model_name)
    Note over StaticModel: Downloads from HuggingFace<br/>Cached locally
    StaticModel-->>EmbeddingModelImpl: StaticModel instance
    EmbeddingModelImpl-->>User: EmbeddingModelImpl ready

    User->>EmbeddingModelImpl: fetch_embedding_async(text)
    EmbeddingModelImpl->>EmbeddingModelImpl: match Model2Vec provider
    EmbeddingModelImpl->>EmbeddingModelImpl: clone text and model
    EmbeddingModelImpl->>TokenioRuntime: spawn_blocking(encode_single)
    TokenioRuntime->>ThreadPool: schedule blocking task
    ThreadPool->>StaticModel: encode_single(text)
    StaticModel-->>ThreadPool: Vec f32 embedding
    ThreadPool->>ThreadPool: convert f32 to f64
    ThreadPool-->>TokenioRuntime: Vec f64 embedding
    TokenioRuntime-->>EmbeddingModelImpl: Result with embedding
    EmbeddingModelImpl-->>User: Vec f64 embedding

greptile-apps

Additional Comments (1)

helix-db/src/helix_gateway/tests/embedding_providers.rs, line 150 (link)

style: the returned model value is not being asserted. The test should verify the model string matches the default

then add after line 156:

_{3 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

xav-db · 2025-12-22T14:45:36Z

@vrn21 resolve conflicts please

vrn21 · 2025-12-25T17:55:55Z

Sorry for the delay, have resolved the conflicts!

xav-db

LGTM

xav-db · 2026-01-09T09:55:57Z

please fix clippy check @vrn21

vrn21 · 2026-01-09T19:59:59Z

Clippy fixed @xav-db

vrn21 added 2 commits December 19, 2025 17:09

feat: model2vec usable as an embedding option

1349c7f

gating model2vec

dcacc04

greptile-apps bot reviewed Dec 19, 2025

View reviewed changes

fix for greptile

122ae66

xav-db changed the base branch from main to dev December 22, 2025 14:42

Merge branch 'dev' into rust2vec

fff4f8f

Merge branch 'dev' into rust2vec

e394f81

xav-db approved these changes Jan 8, 2026

View reviewed changes

update for clipyy

b8e1225

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(embeddings): added model2vec #778

feat(embeddings): added model2vec #778
vrn21 wants to merge 6 commits intoHelixDB:devfrom
vrn21:rust2vec

vrn21 commented Dec 19, 2025 •

edited by greptile-apps bot

Loading

Uh oh!

greptile-apps bot left a comment •

edited

Loading

Uh oh!

xav-db commented Dec 22, 2025

Uh oh!

vrn21 commented Dec 25, 2025

Uh oh!

xav-db left a comment

Uh oh!

xav-db commented Jan 9, 2026

Uh oh!

vrn21 commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

vrn21 commented Dec 19, 2025 • edited by greptile-apps bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Add model2vec-rs as 4th embedding provider

Summary

Changes

Dependencies

Implementation (115 lines across 3 files)

Technical Details

Testing

HelixQL:

Breaking Changes

Checklist when merging to main

Additional Notes

Greptile Summary

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Additional Comments (1)

Uh oh!

xav-db commented Dec 22, 2025

Uh oh!

vrn21 commented Dec 25, 2025

Uh oh!

xav-db left a comment

Choose a reason for hiding this comment

Uh oh!

xav-db commented Jan 9, 2026

Uh oh!

vrn21 commented Jan 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vrn21 commented Dec 19, 2025 •

edited by greptile-apps bot

Loading

greptile-apps bot left a comment •

edited

Loading