Skip to content

chore(weave): add dataset_sources table migration#7198

Draft
jwlee64 wants to merge 1 commit into
masterfrom
josiah/dataset-sources-migration
Draft

chore(weave): add dataset_sources table migration#7198
jwlee64 wants to merge 1 commit into
masterfrom
josiah/dataset-sources-migration

Conversation

@jwlee64

@jwlee64 jwlee64 commented Jun 11, 2026

Copy link
Copy Markdown
Contributor

Description

  • Fixes WB-NNNNN
  • Fixes #NNNN

What does the PR do? Include a concise description of the PR contents.

Testing

How was this PR tested?

New ClickHouse table linking dataset rows to their provenance sources
(calls / agent spans). ReplacingMergeTree(updated_at) keyed on
(project_id, dataset_object_id, row_digest, source_kind, source_id);
bloom filters on source_id and source_trace_id power reverse lookups.

Insert-only versioning: reads collapse via GROUP BY + argMax, never
FINAL (benchmarked: FINAL is ~100x slower on the reverse lookup at 100M
rows). Shared membership-pattern invariants documented in
weave/trace_server/docs/membership_pattern.md, referenced from the
annotation_queues query builder (the first pattern instance).

jwlee64 commented Jun 11, 2026

Copy link
Copy Markdown
Contributor Author

@codecov

codecov Bot commented Jun 11, 2026

Copy link
Copy Markdown

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant