Dataframe v2: support for `filtered_index_values` #7589

teh-cmc · 2024-10-04T08:32:55Z

Title.

DNM: requires Dataframe v2: extensive test suite and associated bug fixes for all existing features #7587

Checklist

I have read and agree to Contributor Guide and the Code of Conduct
I've included a screenshot or gif (if applicable)
I have tested the web demo (if applicable):
- Using examples from latest main build: rerun.io/viewer
- Using full set of examples from nightly build: rerun.io/viewer
The PR title and labels are set such as to maximize their usefulness for the next release's CHANGELOG
If applicable, add a new check to the release checklist!
If have noted any breaking changes to the log API in CHANGELOG.md and the migration guide

To run all checks from main, comment on the PR with @rerun-bot full-check.

jleibs · 2024-10-04T21:50:30Z

crates/store/re_dataframe2/src/query.rs

@@ -527,6 +527,13 @@ impl QueryHandle<'_> {
            .min_by_key(|streaming_state| streaming_state.index_value)
            .map(|streaming_state| streaming_state.index_value)?;

+        if let Some(filtered_index_values) = self.query.filtered_index_values.as_ref() {
+            if !filtered_index_values.contains(&cur_index_value) {
+                self.increment_cursors_at_index_value(cur_index_value);


I know we're not optimizing yet... but it seems like this could take an optional next_index_value which we can easily look up from filtered_index_values. Then we could just increment repeatedly in the inner increment as long as we're less than next_index_value.

Additionally, if we made increment_cursors_at_index_value() return the minimum value across all the chunks, we could loop here and basically guarantee that the call to next_row() will end up with a value included in filtered_index_values

Index sampling and clear/tombstone support first -- I don't want to optimize myself into a corner because of weird interactions with other features...

teh-cmc added enhancement New feature or request 🔍 re_query affects re_query itself do-not-merge Do not merge this PR include in changelog labels Oct 4, 2024

Base automatically changed from cmc/dataframev2_tests to main October 4, 2024 10:08

teh-cmc added 3 commits October 4, 2024 12:09

reusable cursor increment logic

c2c94d9

add test for filtered_index_values

7918560

implement support for filtered_index_values

12852da

teh-cmc force-pushed the cmc/dataframev2_filtered_index_values branch from be7eaaf to 12852da Compare October 4, 2024 10:09

teh-cmc removed the do-not-merge Do not merge this PR label Oct 4, 2024

teh-cmc mentioned this pull request Oct 4, 2024

Dataframe v2: support for filtered_point_of_view #7593

Merged

6 tasks

jleibs approved these changes Oct 4, 2024

View reviewed changes

teh-cmc merged commit bed792e into main Oct 7, 2024
33 of 34 checks passed

teh-cmc deleted the cmc/dataframev2_filtered_index_values branch October 7, 2024 08:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dataframe v2: support for `filtered_index_values` #7589

Dataframe v2: support for `filtered_index_values` #7589

teh-cmc commented Oct 4, 2024 •

edited by github-actions bot

Loading

jleibs Oct 4, 2024

teh-cmc Oct 7, 2024

Dataframe v2: support for filtered_index_values #7589

Dataframe v2: support for filtered_index_values #7589

Conversation

teh-cmc commented Oct 4, 2024 • edited by github-actions bot Loading

Checklist

jleibs Oct 4, 2024

Choose a reason for hiding this comment

teh-cmc Oct 7, 2024

Choose a reason for hiding this comment

Dataframe v2: support for `filtered_index_values` #7589

Dataframe v2: support for `filtered_index_values` #7589

teh-cmc commented Oct 4, 2024 •

edited by github-actions bot

Loading