* filer: improve FoundationDB performance by disabling batch by default This PR addresses a performance issue where FoundationDB filer was achieving only ~757 ops/sec with 12 concurrent S3 clients, despite FDB being capable of 17,000+ ops/sec. Root cause: The write batcher was waiting up to 5ms for each operation to batch, even though S3 semantics require waiting for durability confirmation. This added artificial latency that defeated the purpose of batching. Changes: - Disable write batching by default (batch_enabled = false) - Each write now commits immediately in its own transaction - Reduce batch interval from 5ms to 1ms when batching is enabled - Add batch_enabled config option to toggle behavior - Improve batcher to collect available ops without blocking - Add benchmarks comparing batch vs no-batch performance Benchmark results (16 concurrent goroutines): - With batch: 2,924 ops/sec (342,032 ns/op) - Without batch: 4,625 ops/sec (216,219 ns/op) - Improvement: +58% faster Configuration: - Default: batch_enabled = false (optimal for S3 PUT latency) - For bulk ingestion: set batch_enabled = true Also fixes ARM64 Docker test setup (shell compatibility, fdbserver path). * fix: address review comments - use atomic counter and remove duplicate batcher - Use sync/atomic.Uint64 for unique filenames in concurrent benchmarks - Remove duplicate batcher creation in createBenchmarkStoreWithBatching (initialize() already creates batcher when batchEnabled=true) * fix: add realistic default values to benchmark store helper Set directoryPrefix, timeout, and maxRetryDelay to reasonable defaults for more realistic benchmark conditions.
10 KiB
FoundationDB Filer Store Configuration Reference
This document provides comprehensive configuration options for the FoundationDB filer store.
Configuration Methods
1. Configuration File (filer.toml)
[foundationdb]
enabled = true
cluster_file = "/etc/foundationdb/fdb.cluster"
api_version = 740
timeout = "5s"
max_retry_delay = "1s"
directory_prefix = "seaweedfs"
2. Environment Variables
All configuration options can be set via environment variables with the WEED_FOUNDATIONDB_ prefix:
export WEED_FOUNDATIONDB_ENABLED=true
export WEED_FOUNDATIONDB_CLUSTER_FILE=/etc/foundationdb/fdb.cluster
export WEED_FOUNDATIONDB_API_VERSION=740
export WEED_FOUNDATIONDB_TIMEOUT=5s
export WEED_FOUNDATIONDB_MAX_RETRY_DELAY=1s
export WEED_FOUNDATIONDB_DIRECTORY_PREFIX=seaweedfs
# Write batching (disabled by default)
export WEED_FOUNDATIONDB_BATCH_ENABLED=false
export WEED_FOUNDATIONDB_BATCH_SIZE=100
export WEED_FOUNDATIONDB_BATCH_INTERVAL=1ms
3. Command Line Arguments
While not directly supported, configuration can be specified via config files passed to the weed command.
Configuration Options
Basic Options
| Option | Type | Default | Description |
|---|---|---|---|
enabled |
boolean | false |
Enable the FoundationDB filer store |
cluster_file |
string | /etc/foundationdb/fdb.cluster |
Path to FoundationDB cluster file |
api_version |
integer | 740 |
FoundationDB API version to use |
Connection Options
| Option | Type | Default | Description |
|---|---|---|---|
timeout |
duration | 5s |
Transaction timeout duration |
max_retry_delay |
duration | 1s |
Maximum delay between retries |
Storage Options
| Option | Type | Default | Description |
|---|---|---|---|
directory_prefix |
string | seaweedfs |
Directory prefix for key organization |
Write Batching Options
| Option | Type | Default | Description |
|---|---|---|---|
batch_enabled |
boolean | false |
Enable write batching. Disabled by default for optimal S3 PUT latency. |
batch_size |
integer | 100 |
Maximum number of operations per batch (when batching is enabled) |
batch_interval |
duration | 1ms |
Maximum time to wait before flushing a batch (when batching is enabled) |
Note: Write batching is disabled by default. Each write commits immediately in its own transaction, providing optimal latency for S3 PUT operations. Enable batching only for high-throughput bulk ingestion workloads where you can tolerate slightly higher per-operation latency.
Configuration Examples
Development Environment
[foundationdb]
enabled = true
cluster_file = "/var/fdb/config/fdb.cluster"
api_version = 740
timeout = "10s"
max_retry_delay = "2s"
directory_prefix = "seaweedfs_dev"
Production Environment
[foundationdb]
enabled = true
cluster_file = "/etc/foundationdb/fdb.cluster"
api_version = 740
timeout = "30s"
max_retry_delay = "5s"
directory_prefix = "seaweedfs_prod"
High-Performance Setup (Low Latency)
For S3 workloads requiring low latency per operation:
[foundationdb]
enabled = true
cluster_file = "/etc/foundationdb/fdb.cluster"
api_version = 740
timeout = "5s"
max_retry_delay = "1s"
directory_prefix = "sw" # Shorter prefix for efficiency
# Batching disabled (default) for optimal per-operation latency
batch_enabled = false
High-Throughput Bulk Ingestion
For bulk data loading where throughput matters more than per-operation latency:
[foundationdb]
enabled = true
cluster_file = "/etc/foundationdb/fdb.cluster"
api_version = 740
timeout = "60s"
max_retry_delay = "10s"
directory_prefix = "sw"
# Enable batching for higher throughput
batch_enabled = true
batch_size = 100
batch_interval = "1ms"
Path-Specific Configuration
Configure different FoundationDB settings for different paths:
# Default configuration
[foundationdb]
enabled = true
cluster_file = "/etc/foundationdb/fdb.cluster"
directory_prefix = "seaweedfs_main"
# Backup path with different prefix
[foundationdb.backup]
enabled = true
cluster_file = "/etc/foundationdb/fdb.cluster"
directory_prefix = "seaweedfs_backup"
location = "/backup"
timeout = "120s"
# Archive path with extended timeouts
[foundationdb.archive]
enabled = true
cluster_file = "/etc/foundationdb/fdb.cluster"
directory_prefix = "seaweedfs_archive"
location = "/archive"
timeout = "300s"
max_retry_delay = "30s"
Configuration Validation
Required Settings
The following settings are required for FoundationDB to function:
enabled = truecluster_filemust point to a valid FoundationDB cluster fileapi_versionmust match your FoundationDB installation
Validation Rules
api_versionmust be between 600 and 740timeoutmust be a valid duration string (e.g., "5s", "30s", "2m")max_retry_delaymust be a valid duration stringcluster_filemust exist and be readabledirectory_prefixmust not be empty
Error Handling
Invalid configurations will result in startup errors:
FATAL: Failed to initialize store for foundationdb: invalid timeout duration
FATAL: Failed to initialize store for foundationdb: failed to open FoundationDB database
FATAL: Failed to initialize store for foundationdb: cluster file not found
Performance Tuning
Timeout Configuration
| Use Case | Timeout | Max Retry Delay | Notes |
|---|---|---|---|
| Interactive workloads | 5s | 1s | Fast response times |
| Batch processing | 60s | 10s | Handle large operations |
| Archive operations | 300s | 30s | Very large data sets |
Connection Pool Settings
FoundationDB automatically manages connection pooling. No additional configuration needed.
Directory Organization
Use meaningful directory prefixes to organize data:
# Separate environments
directory_prefix = "prod_seaweedfs" # Production
directory_prefix = "staging_seaweedfs" # Staging
directory_prefix = "dev_seaweedfs" # Development
# Separate applications
directory_prefix = "app1_seaweedfs" # Application 1
directory_prefix = "app2_seaweedfs" # Application 2
Security Configuration
Cluster File Security
Protect the FoundationDB cluster file:
# Set proper permissions
sudo chown root:seaweedfs /etc/foundationdb/fdb.cluster
sudo chmod 640 /etc/foundationdb/fdb.cluster
Network Security
FoundationDB supports TLS encryption. Configure in the cluster file:
description:cluster_id@tls(server1:4500,server2:4500,server3:4500)
Access Control
Use FoundationDB's built-in access control mechanisms when available.
Monitoring Configuration
Health Check Settings
Configure health check timeouts appropriately:
[foundationdb]
enabled = true
timeout = "10s" # Reasonable timeout for health checks
Logging Configuration
Enable verbose logging for troubleshooting:
# Start SeaweedFS with debug logs
WEED_FOUNDATIONDB_ENABLED=true weed -v=2 server -filer
Migration Configuration
From Other Filer Stores
When migrating from other filer stores:
- Configure both stores temporarily
- Use path-specific configuration for gradual migration
- Migrate data using SeaweedFS tools
# During migration - keep old store for reads
[leveldb2]
enabled = true
dir = "/old/filer/data"
# New writes go to FoundationDB
[foundationdb.migration]
enabled = true
location = "/new"
cluster_file = "/etc/foundationdb/fdb.cluster"
Backup Configuration
Metadata Backup Strategy
# Main storage
[foundationdb]
enabled = true
directory_prefix = "seaweedfs_main"
# Backup storage (different cluster recommended)
[foundationdb.backup]
enabled = true
cluster_file = "/etc/foundationdb/backup_fdb.cluster"
directory_prefix = "seaweedfs_backup"
location = "/backup"
Container Configuration
Docker Environment Variables
# Docker environment
WEED_FOUNDATIONDB_ENABLED=true
WEED_FOUNDATIONDB_CLUSTER_FILE=/var/fdb/config/fdb.cluster
WEED_FOUNDATIONDB_API_VERSION=740
Kubernetes ConfigMap
apiVersion: v1
kind: ConfigMap
metadata:
name: seaweedfs-config
data:
filer.toml: |
[foundationdb]
enabled = true
cluster_file = "/var/fdb/config/cluster_file"
api_version = 740
timeout = "30s"
max_retry_delay = "5s"
directory_prefix = "k8s_seaweedfs"
Troubleshooting Configuration
Debug Configuration
[foundationdb]
enabled = true
cluster_file = "/etc/foundationdb/fdb.cluster"
timeout = "60s" # Longer timeouts for debugging
max_retry_delay = "10s"
directory_prefix = "debug_seaweedfs"
Test Configuration
[foundationdb]
enabled = true
cluster_file = "/tmp/fdb.cluster" # Test cluster
timeout = "5s"
directory_prefix = "test_seaweedfs"
Configuration Best Practices
1. Environment Separation
Use different directory prefixes for different environments:
- Production:
prod_seaweedfs - Staging:
staging_seaweedfs - Development:
dev_seaweedfs
2. Timeout Settings
- Interactive: 5-10 seconds
- Batch: 30-60 seconds
- Archive: 120-300 seconds
3. Cluster File Management
- Use absolute paths for cluster files
- Ensure proper file permissions
- Keep backup copies of cluster files
4. Directory Naming
- Use descriptive prefixes
- Include environment/application identifiers
- Keep prefixes reasonably short for efficiency
5. Error Handling
- Configure appropriate timeouts
- Monitor retry patterns
- Set up alerting for configuration errors
Configuration Testing
Validation Script
#!/bin/bash
# Test FoundationDB configuration
# Check cluster file
if [ ! -f "$WEED_FOUNDATIONDB_CLUSTER_FILE" ]; then
echo "ERROR: Cluster file not found: $WEED_FOUNDATIONDB_CLUSTER_FILE"
exit 1
fi
# Test connection
fdbcli -C "$WEED_FOUNDATIONDB_CLUSTER_FILE" --exec 'status' > /dev/null
if [ $? -ne 0 ]; then
echo "ERROR: Cannot connect to FoundationDB cluster"
exit 1
fi
echo "Configuration validation passed"
Integration Testing
# Test configuration with SeaweedFS
cd test/foundationdb
make check-env
make test-unit