Spine benchmark columnar performance improvements#732
Merged
frankmcsherry merged 5 commits intoTimelyDataflow:master-nextfrom Apr 29, 2026
Merged
Spine benchmark columnar performance improvements#732frankmcsherry merged 5 commits intoTimelyDataflow:master-nextfrom
frankmcsherry merged 5 commits intoTimelyDataflow:master-nextfrom
Conversation
Brings back the spines arrangement bake-off (deleted in TimelyDataflow#724 Spring cleaning, then RHH-dependent) with three modes: `key` (OrdKeySpine), `val` (OrdValSpine with Val=()), and `col` (columnar ValSpine via the columnar module added in TimelyDataflow#730). All three feed the same Vec-shaped input collections through one driver loop; `col` repacks via a small in-dataflow `unary` (`ToRecorded`) that builds `RecordedUpdates` containers before `arrange_core`. Bisecting against the example exposed a regression introduced in TimelyDataflow#725: EditList::load now delegates to populate_key, which seek_keys + checks + rewinds vals on every call. In the merge-join inner loop (join.rs Ordering::Equal arm), the cursor is already positioned by the upstream `match trace_key.cmp(&batch_key)` work, so the seek is redundant. Repeated 1M times in the spines query phase, this added ~3s (+40% queries time vs pre-TimelyDataflow#725 baseline). Restoring EditList::load to its pre-TimelyDataflow#725 division of labor — assume the cursor is positioned, walk vals inline — recovers performance. populate_key and replay_key keep the seek for callers that legitimately need it (reduce, ValueHistory). The Option-based meet API from TimelyDataflow#725 stays. Measurements (1M keys, 1000 size, key mode): - v0.23.0 baseline: 6.56s queries - pre-TimelyDataflow#725 (f4e7550): 7.16s queries - master HEAD before this commit: 10.12s queries - this commit: 7.00s queries Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This PR re-adds
examples/spines.rs(removed in spring cleaning) to compare the in-tree columnar representations with existingval/keyidioms. Several scaling glitches were observed, many of them improved, although surely several more remain.