feat(uploads): direct-to-storage presigned uploads (single + multipart) by zfarrell · Pull Request #74 · hotdata-dev/sdk-rust

zfarrell · 2026-06-26T04:53:51Z

Adds ergonomic direct-to-storage uploads: Client::upload_file opens an upload session, PUTs bytes straight to object storage via server-minted presigned URLs (single PUT for small files, bounded-concurrency multipart for large), then finalizes — zero S3/AWS deps and no legacy /v1/files path. Configurable max_concurrency (default 10) with a memory-budget-bounded in-flight cap, automatic part-size scaling, header-isolated storage PUTs, and integrity verified at finalize; validated end-to-end against prod for both single-PUT and multipart (148 tests).

claude · 2026-06-26T04:55:48Z

+    fn respond(&self, _req: &Request) -> ResponseTemplate {
+        let now = self.active.fetch_add(1, Ordering::SeqCst) + 1;
+        self.peak.fetch_max(now, Ordering::SeqCst);
+        // We cannot decrement after the delay from here, so model "in-flight"
+        // as entries during the delay window: the delay keeps responses pending
+        // long enough that all admitted tasks overlap, and admission is bounded
+        // by the cap. Decrement immediately is fine because peak is a max.
+        self.active.fetch_sub(1, Ordering::SeqCst);
+        ResponseTemplate::new(200)
+            .insert_header("ETag", "\"c\"")
+            .set_delay(self.delay)
+    }


nit: (not blocking) This concurrency tracker can't actually detect an unbounded run. active is incremented and decremented synchronously inside respond, before set_delay takes effect — so the increment never persists across the 50ms delay window. Two respond() calls would have to execute their three atomics in literally overlapping nanoseconds to record peak >= 2. In practice peak reads ~1 regardless of the real in-flight count, so the peak <= 2 assertion would pass even if the JoinSet bound were removed and all 6 parts ran at once. The test gives false confidence in the very property it's named for.

To genuinely measure overlap, keep active incremented for the duration of the delay (e.g. have the responder block/sleep itself while holding the count, or model in-flight with an async responder that decrements after the delay) rather than decrementing immediately.

claude · 2026-06-26T04:55:50Z

+    let create = models::CreateUploadRequest {
+        declared_size_bytes,
+        content_type: opts.content_type.clone().map(Some),
+        content_encoding: opts.content_encoding.clone().map(Some),
+        filename: filename.map(Some),
+        part_size: Some(Some(part_size_hint_i64)),
+        ..models::CreateUploadRequest::new(declared_size_bytes)
+    };


super nit: (not blocking) declared_size_bytes is set explicitly and supplied via ..CreateUploadRequest::new(declared_size_bytes), so the field is written twice. Harmless, but the struct-update base is doing nothing the explicit fields don't already cover except defaulting the rest — slightly clearer to either drop the explicit declared_size_bytes line or document why both are present.

claude

Reviewed the presigned-upload module, wiring, and test suite. Orchestration, storage-PUT header isolation, finalize-exactly-once, retryable part PUTs, memory-budgeted concurrency, and size-overflow handling all look correct and are well documented. Two non-blocking notes left inline. LGTM.

…field

claude

Both prior nits are resolved in the code:

The concurrency test now uses a raw-TCP storage server that genuinely holds in-flight PUTs across a hold duration and asserts peak reaches and is bounded by the cap — it measures real overlap rather than synchronously-decremented counters.
The duplicate declared_size_bytes write is gone; the value now comes solely from CreateUploadRequest::new(..).

The new module is well-structured and thoroughly tested: bounded-concurrency multipart with positioned per-part reads, retry-safe buffered part PUTs, single-shot finalize, strict storage-PUT header isolation, and complete error/edge-case coverage (overflow, malformed sessions, part-URL count mismatch, missing ETag, storage non-2xx). No blocking issues.

zfarrell added 5 commits June 25, 2026 19:30

feat(uploads): add presigned direct-to-storage upload

1ea578e

feat(uploads): byte-granular progress for single-PUT

18279a6

fix(uploads): send empty object for single-PUT finalize

c04d3e2

feat(uploads): configurable concurrency and auto part-size scaling

215b640

fix(uploads): address upload-path review findings

6f28ab8

claude Bot reviewed Jun 26, 2026

View reviewed changes

claude Bot previously approved these changes Jun 26, 2026

View reviewed changes

test(uploads): faithfully measure upload concurrency; drop redundant …

b79fcc8

…field

zfarrell dismissed claude[bot]’s stale review via b79fcc8 June 26, 2026 05:03

claude Bot approved these changes Jun 26, 2026

View reviewed changes

zfarrell merged commit f8e6bf9 into main Jun 26, 2026
5 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(uploads): direct-to-storage presigned uploads (single + multipart)#74

feat(uploads): direct-to-storage presigned uploads (single + multipart)#74
zfarrell merged 6 commits into
mainfrom
feat/presigned-upload

zfarrell commented Jun 26, 2026

Uh oh!

claude Bot Jun 26, 2026

Uh oh!

claude Bot Jun 26, 2026

Uh oh!

claude Bot left a comment

Uh oh!

claude Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

zfarrell commented Jun 26, 2026

Uh oh!

claude Bot Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot Jun 26, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant