Skip to content

Merge branch 'main' into perf/smarter-codecs

d5c712c
Select commit
Loading
Failed to load commit list.
Draft

sketch out improved performance by refactoring codec pipeline logic #3719

Merge branch 'main' into perf/smarter-codecs
d5c712c
Select commit
Loading
Failed to load commit list.
CodSpeed HQ / CodSpeed Performance Analysis succeeded Mar 13, 2026 in 0s

Performance Gate Passed

⚡ 59 improved benchmarks
✅ 7 untouched benchmarks
⏩ 6 skipped benchmarks1

Performance Changes

Mode Benchmark BASE HEAD Efficiency
WallTime test_write_array[memory-Layout(shape=(1000000,), chunks=(1000,), shards=(1000,))-gzip] 2,196.4 ms 704.2 ms ×3.1
WallTime test_write_array[memory-Layout(shape=(1000000,), chunks=(1000,), shards=None)-None] 556.8 ms 193.8 ms ×2.9
WallTime test_slice_indexing[(50, 50, 50)-(0, 0, 0)-memory] 1,742.5 µs 549.1 µs ×3.2
WallTime test_slice_indexing[None-(slice(None, None, None), slice(0, 3, 2), slice(0, 10, None))-memory] 3.8 ms 1.1 ms ×3.5
WallTime test_write_array[memory-Layout(shape=(1000000,), chunks=(1000,), shards=None)-gzip] 1,031.3 ms 283.1 ms ×3.6
WallTime test_slice_indexing[(50, 50, 50)-(slice(None, None, None), slice(None, None, None), slice(None, None, None))-memory_get_latency] 436.2 ms 117.8 ms ×3.7
WallTime test_slice_indexing[(50, 50, 50)-(slice(None, None, None), slice(None, None, None), slice(None, None, None))-memory] 419.6 ms 83.5 ms ×5
WallTime test_sharded_morton_indexing_large[(33, 33, 33)-memory] 10.3 s 1.8 s ×5.8
WallTime test_slice_indexing[None-(slice(None, None, None), slice(0, 3, 2), slice(0, 10, None))-memory_get_latency] 4.2 ms 2.2 ms +84.66%
WallTime test_sharded_morton_single_chunk[(32, 32, 32)-memory] 2,018.4 µs 681 µs ×3
WallTime test_slice_indexing[(50, 50, 50)-(0, 0, 0)-memory_get_latency] 4.2 ms 3.2 ms +28.54%
WallTime test_write_array[memory-Layout(shape=(1000000,), chunks=(1000,), shards=(1000,))-None] 1,622.2 ms 590.1 ms ×2.7
WallTime test_write_array[memory-Layout(shape=(1000000,), chunks=(100,), shards=(1000000,))-None] 5,367.9 ms 252.9 ms ×21
WallTime test_sharded_morton_single_chunk[(33, 33, 33)-memory] 1,978.3 µs 703.4 µs ×2.8
WallTime test_slice_indexing[(50, 50, 50)-(slice(None, 10, None), slice(None, 10, None), slice(None, 10, None))-memory_get_latency] 4.2 ms 3.3 ms +27.81%
WallTime test_write_array[memory-Layout(shape=(1000000,), chunks=(100,), shards=(1000000,))-gzip] 9.5 s 1.2 s ×7.7
WallTime test_slice_indexing[(50, 50, 50)-(slice(0, None, 4), slice(0, None, 4), slice(0, None, 4))-memory] 414.6 ms 78.9 ms ×5.3
WallTime test_sharded_morton_single_chunk[(30, 30, 30)-memory] 1,958 µs 659.6 µs ×3
WallTime test_slice_indexing[(50, 50, 50)-(slice(0, None, 4), slice(0, None, 4), slice(0, None, 4))-memory_get_latency] 431.3 ms 113.2 ms ×3.8
WallTime test_slice_indexing[(50, 50, 50)-(slice(10, -10, 4), slice(10, -10, 4), slice(10, -10, 4))-memory_get_latency] 243.5 ms 70.6 ms ×3.4
... ... ... ... ... ...

ℹ️ Only the first 20 benchmarks are displayed. Go to the app to view all benchmarks.


Comparing d-v-b:perf/smarter-codecs (d5c712c) with main (a02d996)

Open in CodSpeed

Footnotes

  1. 6 benchmarks were skipped, so the baseline results were used instead. If they were deleted from the codebase, click here and archive them to remove them from the performance reports.