seaweedFS

Files

Chris Lu 0798b274dd feat(s3): add concurrent chunk prefetch for large file downloads (#8917 )

* feat(s3): add concurrent chunk prefetch for large file downloads

Add a pipe-based prefetch pipeline that overlaps chunk fetching with
response writing during S3 GetObject, SSE downloads, and filer proxy.

While chunk N streams to the HTTP response, fetch goroutines for the
next K chunks establish HTTP connections to volume servers ahead of
time, eliminating the RTT gap between sequential chunk fetches.

Uses io.Pipe for minimal memory overhead (~1MB per download regardless
of chunk size, vs buffering entire chunks). Also increases the
streaming read buffer from 64KB to 256KB to reduce syscall overhead.

Benchmark results (64KB chunks, prefetch=4):
- 0ms latency:  1058 → 2362 MB/s (2.2× faster)
- 5ms latency:  11.0 → 41.7 MB/s (3.8× faster)
- 10ms latency: 5.9  → 23.3 MB/s (4.0× faster)
- 20ms latency: 3.1  → 12.1 MB/s (3.9× faster)

* fix: address review feedback for prefetch pipeline

- Fix data race: use *chunkPipeResult (pointer) on channel to avoid
  copying struct while fetch goroutines write to it. Confirmed clean
  with -race detector.
- Remove concurrent map write: retryWithCacheInvalidation no longer
  updates fileId2Url map. Producer only reads it; consumer never writes.
- Use mem.Allocate/mem.Free for copy buffer to reduce GC pressure.
- Add local cancellable context so consumer errors (client disconnect)
  immediately stop the producer and all in-flight fetch goroutines.

* fix(test): remove dead code and add Range header support in test server

- Remove unused allData variable in makeChunksAndServer
- Add Range header handling to createTestServer for partial chunk
  read coverage (206 Partial Content, 416 Range Not Satisfiable)

* fix: correct retry condition and goroutine leak in prefetch pipeline

- Fix retry condition: use result.fetchErr/result.written instead of
  copied to decide cache-invalidation retry. The old condition wrongly
  triggered retry when the fetch succeeded but the response writer
  failed on the first write (copied==0 despite fetcher having data).
  Now matches the sequential path (stream.go:197) which checks whether
  the fetcher itself wrote zero bytes.

- Fix goroutine leak: when the producer's send to the results channel
  is interrupted by context cancellation, the fetch goroutine was
  already launched but the result was never sent to the channel. The
  drain loop couldn't handle it. Now waits on result.done before
  returning so every fetch goroutine is properly awaited.

2026-04-03 19:57:30 -07:00

cors

go fmt

2026-01-28 14:34:07 -08:00

iceberg

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

policy

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

policy_engine

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3_constants

feat(s3): store and return checksum headers for additional checksum algorithms (#8914 )

2026-04-03 18:37:54 -07:00

s3_objectlock

mount: make metadata cache rebuilds snapshot-consistent (#8531 )

2026-03-07 09:19:40 -08:00

s3bucket

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

s3err

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3lifecycle

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3tables

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

AmazonS3.xsd

add s3test for sql (#5718 )

2024-07-04 11:00:41 -07:00

auth_credentials_subscribe_test.go

go fmt

2026-02-20 18:40:47 -08:00

auth_credentials_subscribe.go

fix: resolve CORS cache race condition causing stale 404 responses (#8748 )

2026-03-23 19:33:20 -07:00

auth_credentials_trust.go

Add AssumeRole and AssumeRoleWithLDAPIdentity STS actions (#8003 )

2026-01-12 10:45:24 -08:00

auth_credentials.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

auth_proxy_integration_test.go

feat: drop table location mapping support (#8458 )

2026-02-26 16:36:24 -08:00

auth_security_test.go

feat: drop table location mapping support (#8458 )

2026-02-26 16:36:24 -08:00

auth_signature_v2_test.go

fix(s3api): fix AWS Signature V2 format and validation (#7488 )

2025-11-26 12:24:02 -08:00

auth_signature_v2.go

s3: enforce authentication and JSON error format for Iceberg REST Catalog (#8192 )

2026-02-03 11:55:12 -08:00

auth_signature_v4_sts_test.go

S3: Implement IAM defaults and STS signing key fallback (#8348 )

2026-02-16 13:59:13 -08:00

auth_signature_v4_test.go

fix(s3api): correctly extract host header port in extractHostHeader (#8464 )

2026-02-27 13:41:45 -08:00

auth_signature_v4.go

fix(s3api): correctly extract host header port in extractHostHeader (#8464 )

2026-02-27 13:41:45 -08:00

auth_sts_identity_test.go

S3: Implement IAM defaults and STS signing key fallback (#8348 )

2026-02-16 13:59:13 -08:00

auth_sts_v4_test.go

Fix AWS SDK Signature V4 with STS credentials (issue #7941 ) (#7944 )

2026-01-03 10:09:59 -08:00

auto_signature_v4_test.go

weed/s3api: prune test-only functions (#8840 )

2026-03-30 09:43:33 -07:00

bucket_metadata_test.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

bucket_metadata.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

bucket_paths.go

feat: drop table location mapping support (#8458 )

2026-02-26 16:36:24 -08:00

bucket_size_metrics.go

Make lock_manager.RenewInterval configurable in LiveLock (#7830 )

2025-12-20 15:25:47 -08:00

chunked_bug_reproduction_test.go

S3 API: unsigned streaming (no cred) but chunks contain signatures (#7118 )

2025-08-11 10:31:01 -07:00

chunked_reader_v4_test.go

IAM: Add Service Account Support (#7744 ) (#7901 )

2025-12-29 20:17:23 -08:00

chunked_reader_v4.go

s3: support STREAMING-AWS4-HMAC-SHA256-PAYLOAD-TRAILER for signed chunked uploads with checksums (#7623 )

2025-12-04 14:51:37 -08:00

copy_source_decode_test.go

s3: use url.PathUnescape for X-Amz-Copy-Source header (#8545 )

2026-03-07 11:10:02 -08:00

custom_types.go

S3: Directly read write volume servers (#7481 )

2025-11-18 23:18:35 -08:00

filer_multipart.go

feat(s3): store and return checksum headers for additional checksum algorithms (#8914 )

2026-04-03 18:37:54 -07:00

filer_util_delete_test.go

Fix S3 delete for non-empty directory markers (#8740 )

2026-03-23 13:35:16 -07:00

filer_util_tags.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

filer_util.go

Prune Unused Functions from weed/s3api (#8815 )

2026-03-28 13:24:11 -07:00

iam_defaults_test.go

Fix IAM defaults and S3Tables IAM regression (#8374 )

2026-02-18 18:20:03 -08:00

iam_optional_test.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

identity_reflection_test.go

Enforce IAM for S3 Tables bucket creation (#8388 )

2026-02-19 22:52:05 -08:00

object_lock_utils.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

README.txt

add s3test for sql (#5718 )

2024-07-04 11:00:41 -07:00

s3_action_resolver_test.go

feat(s3): add STS GetFederationToken support (#8891 )

2026-04-02 17:37:05 -07:00

s3_action_resolver.go

feat(s3): add STS GetFederationToken support (#8891 )

2026-04-02 17:37:05 -07:00

s3_bucket_encryption.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3_content_encoding_test.go

S3: Fix Content-Encoding header not preserved (#7894 ) (#7895 )

2025-12-27 12:25:33 -08:00

s3_end_to_end_test.go

Add AssumeRole and AssumeRoleWithLDAPIdentity STS actions (#8003 )

2026-01-12 10:45:24 -08:00

s3_error_utils.go

Prune Unused Functions from weed/s3api (#8815 )

2026-03-28 13:24:11 -07:00

s3_existing_object_tag_test.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

s3_granular_action_security_test.go

S3: add context aware action resolution (#7479 )

2025-11-13 16:10:46 -08:00

s3_iam_middleware.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3_iam_role_selection_test.go

S3 API: Advanced IAM System (#7160 )

2025-08-30 11:15:48 -07:00

s3_jwt_auth_test.go

weed/s3api: prune test-only functions (#8840 )

2026-03-30 09:43:33 -07:00

s3_list_parts_action_test.go

S3: add context aware action resolution (#7479 )

2025-11-13 16:10:46 -08:00

s3_metadata_util.go

s3: do not persist multi part "Response-Content-Disposition" in request header (#7887 )

2025-12-26 13:21:15 -08:00

s3_sse_c.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3_sse_ctr_test.go

S3: Directly read write volume servers (#7481 )

2025-11-18 23:18:35 -08:00

s3_sse_kms_utils.go

S3 API: Add SSE-S3 (#7151 )

2025-08-22 01:15:42 -07:00

s3_sse_kms.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3_sse_metadata.go

S3 API: Fix SSE-S3 decryption on object download (#7366 )

2025-10-23 20:10:12 -07:00

s3_sse_s3_integration_test.go

go fmt

2025-10-27 23:04:55 -07:00

s3_sse_s3_multipart_test.go

S3: Directly read write volume servers (#7481 )

2025-11-18 23:18:35 -08:00

s3_sse_s3.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3_sse_test_utils_test.go

S3 API: Fix SSE-S3 decryption on object download (#7366 )

2025-10-23 20:10:12 -07:00

s3_sse_utils.go

S3: Directly read write volume servers (#7481 )

2025-11-18 23:18:35 -08:00

s3_token_differentiation_test.go

fix: comprehensive go vet error fixes and add CI enforcement (#7861 )

2025-12-23 14:48:50 -08:00

s3_validation_utils.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3api_acl_helper.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3api_acp.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

s3api_auth.go

Fix critical authentication bypass vulnerability (#7912 ) (#7915 )

2025-12-30 12:40:59 -08:00

s3api_bucket_config_update_test.go

s3api: preserve lifecycle config responses for Terraform (#8805 )

2026-03-27 22:50:02 -07:00

s3api_bucket_config.go

s3api: preserve lifecycle config responses for Terraform (#8805 )

2026-03-27 22:50:02 -07:00

s3api_bucket_cors_handlers.go

S3: add fallback for CORS (#7404 )

2025-10-29 13:43:27 -07:00

s3api_bucket_handlers_lifecycle_test.go

fix(s3): lifecycle TTL rules inherit replication and volumeGrowthCount from filer config (#8321 )

2026-02-12 16:46:05 -08:00

s3api_bucket_handlers_object_lock_config.go

Add table operations test (#8241 )

2026-02-07 13:27:47 -08:00

s3api_bucket_handlers.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3api_bucket_lifecycle_config.go

s3api: preserve lifecycle config responses for Terraform (#8805 )

2026-03-27 22:50:02 -07:00

s3api_bucket_lifecycle_response_test.go

s3api: skip TTL fast-path for versioned buckets (#8823 )

2026-03-29 00:05:53 -07:00

s3api_bucket_metadata_test.go

S3 API: Add SSE-KMS (#7144 )

2025-08-21 08:28:07 -07:00

s3api_bucket_policy_arn_test.go

s3api: fix AccessDenied by correctly propagating principal ARN in vended tokens (#8330 )

2026-02-12 23:11:41 -08:00

s3api_bucket_policy_engine.go

s3: support s3:x-amz-server-side-encryption policy condition (#8806 )

2026-03-27 23:15:01 -07:00

s3api_bucket_policy_handlers.go

fix(s3): omit NotResource:null from bucket policy JSON response (#8658 )

2026-03-16 12:58:26 -07:00

s3api_bucket_tagging_handlers.go

S3 API: Add SSE-KMS (#7144 )

2025-08-21 08:28:07 -07:00

s3api_circuit_breaker_test.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

s3api_circuit_breaker.go

s3api: fix static IAM policy enforcement after reload (#8532 )

2026-03-06 12:35:08 -08:00

s3api_copy_size_calculation.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3api_copy_validation.go

S3 API: Add SSE-KMS (#7144 )

2025-08-21 08:28:07 -07:00

s3api_domain_test.go

fix: Use a mixed of virtual and path styles within a single subdomain (#7357 )

2025-10-24 01:45:22 -07:00

s3api_embedded_iam_test.go

fix(s3): omit NotResource:null from bucket policy JSON response (#8658 )

2026-03-16 12:58:26 -07:00

s3api_embedded_iam.go

fix(s3): omit NotResource:null from bucket policy JSON response (#8658 )

2026-03-16 12:58:26 -07:00

s3api_encrypted_volume_copy_test.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

s3api_governance_permissions_test.go

s3: enforce authentication and JSON error format for Iceberg REST Catalog (#8192 )

2026-02-03 11:55:12 -08:00

s3api_handlers.go

s3api: preserve lifecycle config responses for Terraform (#8805 )

2026-03-27 22:50:02 -07:00

s3api_implicit_directory_test.go

Add table operations test (#8241 )

2026-02-07 13:27:47 -08:00

s3api_list_normalization_test.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

s3api_nextmarker_test.go

fix: S3 listing NextMarker missing intermediate directory component (#8089 )

2026-01-22 16:56:35 -08:00

s3api_object_handlers_acl.go

Add Iceberg admin UI (#8246 )

2026-02-08 20:06:32 -08:00

s3api_object_handlers_attributes.go

s3api: add GetObjectAttributes API support (#8504 )

2026-03-04 12:52:09 -08:00

s3api_object_handlers_copy_unified.go

Add table operations test (#8241 )

2026-02-07 13:27:47 -08:00

s3api_object_handlers_copy.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3api_object_handlers_delete.go

s3api: make conditional mutations atomic and AWS-compatible (#8802 )

2026-03-27 19:22:26 -07:00

s3api_object_handlers_legal_hold.go

adjust logs

2026-01-17 18:40:48 -08:00

s3api_object_handlers_list_directory_test.go

fix: directory incorrectly listed as object in S3 ListObjects (#7939 )

2026-01-02 15:52:37 -08:00

s3api_object_handlers_list_test.go

fix(s3api): ListObjects with trailing-slash prefix matches sibling directories (#8599 )

2026-03-11 02:28:34 -07:00

s3api_object_handlers_list_versioned_test.go

fix(s3): skip directories before marker in ListObjectVersions pagination (#8890 )

2026-04-02 15:59:52 -07:00

s3api_object_handlers_list.go

Prune Unused Functions from weed/s3api (#8815 )

2026-03-28 13:24:11 -07:00

s3api_object_handlers_multipart.go

feat(s3): store and return checksum headers for additional checksum algorithms (#8914 )

2026-04-03 18:37:54 -07:00

s3api_object_handlers_postpolicy_test.go

Refactor: Replace removeDuplicateSlashes with NormalizeObjectKey (#7873 )

2025-12-24 19:07:08 -08:00

s3api_object_handlers_postpolicy.go

s3api: make conditional mutations atomic and AWS-compatible (#8802 )

2026-03-27 19:22:26 -07:00

s3api_object_handlers_put.go

feat(s3): store and return checksum headers for additional checksum algorithms (#8914 )

2026-04-03 18:37:54 -07:00

s3api_object_handlers_retention.go

refactor (#6999 )

2025-07-19 00:49:56 -07:00

s3api_object_handlers_tagging.go

Add Iceberg admin UI (#8246 )

2026-02-08 20:06:32 -08:00

s3api_object_handlers.go

feat(s3): add concurrent chunk prefetch for large file downloads (#8917 )

2026-04-03 19:57:30 -07:00

s3api_object_lock_fix_test.go

Fix get object lock configuration handler (#6996 )

2025-07-18 02:19:50 -07:00

s3api_object_lock_headers_test.go

Test object lock and retention (#6997 )

2025-07-18 22:25:58 -07:00

s3api_object_ownership_test.go

s3: implement Bucket Owner Enforced for object ownership (#7913 )

2025-12-29 23:54:00 -08:00

s3api_object_retention_test.go

S3: S3 Object Retention API to include XML namespace support (#7517 )

2025-11-20 11:42:22 -08:00

s3api_object_retention.go

Prune Unused Functions from weed/s3api (#8815 )

2026-03-28 13:24:11 -07:00

s3api_object_versioning.go

fix(s3): skip directories before marker in ListObjectVersions pagination (#8890 )

2026-04-02 15:59:52 -07:00

s3api_policy_test.go

s3api: accept all supported lifecycle rule types (#8813 )

2026-03-28 19:39:21 -07:00

s3api_policy.go

s3lifecycle: add lifecycle rule evaluator package and extend XML types (#8807 )

2026-03-28 11:10:31 -07:00

s3api_put_handlers.go

Prune Unused Functions from weed/s3api (#8815 )

2026-03-28 13:24:11 -07:00

s3api_put_object_helper_test.go

S3: Implement IAM defaults and STS signing key fallback (#8348 )

2026-02-16 13:59:13 -08:00

s3api_put_object_helper.go

s3api: remove redundant auth verification in getRequestDataReader (#7685 )

2025-12-09 10:24:35 -08:00

s3api_remote_storage_test.go

fix: S3 remote storage cold-cache read fails with 'size reported but no content available' (#7817 )

2025-12-18 21:19:44 -08:00

s3api_server_grpc.go

iam: add IAM group management (#8560 )

2026-03-09 11:54:32 -07:00

s3api_server_routing_test.go

S3: Implement IAM defaults and STS signing key fallback (#8348 )

2026-02-16 13:59:13 -08:00

s3api_server.go

STS: add GetCallerIdentity support (#8893 )

2026-04-02 15:59:09 -07:00

s3api_sosapi.go

chore: remove ~50k lines of unreachable dead code (#8913 )

2026-04-03 16:04:27 -07:00

s3api_sse_decrypt_test.go

S3: Directly read write volume servers (#7481 )

2025-11-18 23:18:35 -08:00

s3api_sse_s3_upload_test.go

S3: Directly read write volume servers (#7481 )

2025-11-18 23:18:35 -08:00

s3api_status_handlers.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

s3api_stream_error_test.go

Client disconnects create context cancelled errors, 500x errors and Filer lookup failures (#8845 )

2026-03-30 12:11:30 -07:00

s3api_sts_assume_role_test.go

Embed role policies in AssumeRole STS tokens (#8421 )

2026-02-23 22:59:53 -08:00

s3api_sts_get_caller_identity_test.go

STS: add GetCallerIdentity support (#8893 )

2026-04-02 15:59:09 -07:00

s3api_sts_get_federation_token_test.go

feat(s3): add STS GetFederationToken support (#8891 )

2026-04-02 17:37:05 -07:00

s3api_sts.go

feat(s3): add STS GetFederationToken support (#8891 )

2026-04-02 17:37:05 -07:00

s3api_tables_rest_validation_test.go

s3api: return 400 for invalid namespace query in REST table routes (#8296 )

2026-02-10 17:57:08 -08:00

s3api_tables.go

Enforce IAM for S3 Tables bucket creation (#8388 )

2026-02-19 22:52:05 -08:00

s3api_test.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

s3api_version_id_test.go

chore: execute goimports to format the code (#7983 )

2026-01-07 13:06:08 -08:00

s3api_version_id.go

Add table operations test (#8241 )

2026-02-07 13:27:47 -08:00

s3api_xsd_generated_helper.go

add s3test for sql (#5718 )

2024-07-04 11:00:41 -07:00

s3api_xsd_generated.go

fix ListAllMyBucketsResult xmlns

2025-08-14 20:38:03 -07:00

stats.go

Populate bucket_traffic_received_bytes_total metric (#7249 )

2025-09-17 19:04:51 -07:00

sts_params_test.go

s3/iam: reuse one request id per request (#8538 )

2026-03-06 15:22:39 -08:00

tags_test.go

tag parsing decode url encoded

2025-07-28 02:49:43 -07:00

tags.go

Add s3tables shell and admin UI (#8172 )

2026-01-30 22:57:05 -08:00

README.txt

see https://blog.aqwari.net/xml-schema-go/

1. go get aqwari.net/xml/cmd/xsdgen
2. Add EncodingType element for ListBucketResult in AmazonS3.xsd
3. xsdgen -o s3api_xsd_generated.go -pkg s3api AmazonS3.xsd
4. Remove empty Grantee struct in s3api_xsd_generated.go
5. Remove xmlns: sed s'/http:\/\/s3.amazonaws.com\/doc\/2006-03-01\/\ //' s3api_xsd_generated.go