fix: EC rebalance fails with replica placement 000 (#7812)

* fix: EC rebalance fails with replica placement 000 This PR fixes several issues with EC shard distribution: 1. Pre-flight check before EC encoding - Verify target disk type has capacity before encoding starts - Prevents encoding shards only to fail during rebalance - Shows helpful error when wrong diskType is specified (e.g., ssd when volumes are on hdd) 2. Fix EC rebalance with replica placement 000 - When DiffRackCount=0, shards should be distributed freely across racks - The '000' placement means 'no volume replication needed' because EC provides redundancy - Previously all racks were skipped with error 'shards X > replica placement limit (0)' 3. Add unit tests for EC rebalance slot calculation - TestECRebalanceWithLimitedSlots: documents the limited slots scenario - TestECRebalanceZeroFreeSlots: reproduces the 0 free slots error 4. Add Makefile for manual EC testing - make setup: start cluster and populate data - make shell: open weed shell for EC commands - make clean: stop cluster and cleanup * fix: default -rebalance to true for ec.encode The -rebalance flag was defaulting to false, which meant ec.encode would only print shard moves but not actually execute them. This is a poor default since the whole point of EC encoding is to distribute shards across servers for fault tolerance. Now -rebalance defaults to true, so shards are actually distributed after encoding. Users can use -rebalance=false if they only want to see what would happen without making changes. * test/erasure_coding: improve Makefile safety and docs - Narrow pkill pattern for volume servers to use TEST_DIR instead of port pattern, avoiding accidental kills of unrelated SeaweedFS processes - Document external dependencies (curl, jq) in header comments * shell: refactor buildRackWithEcShards to reuse buildEcShards Extract common shard bit construction logic to avoid duplication between buildEcShards and buildRackWithEcShards helper functions. * shell: update test for EC replication 000 behavior When DiffRackCount=0 (replication "000"), EC shards should be distributed freely across racks since erasure coding provides its own redundancy. Update test expectation to reflect this behavior. * erasure_coding: add distribution package for proportional EC shard placement Add a new reusable package for EC shard distribution that: - Supports configurable EC ratios (not hard-coded 10+4) - Distributes shards proportionally based on replication policy - Provides fault tolerance analysis - Prefers moving parity shards to keep data shards spread out Key components: - ECConfig: Configurable data/parity shard counts - ReplicationConfig: Parsed XYZ replication policy - ECDistribution: Target shard counts per DC/rack/node - Rebalancer: Plans shard moves with parity-first strategy This enables seaweed-enterprise custom EC ratios and weed worker integration while maintaining a clean, testable architecture. * shell: integrate distribution package for EC rebalancing Add shell wrappers around the distribution package: - ProportionalECRebalancer: Plans moves using distribution.Rebalancer - NewProportionalECRebalancerWithConfig: Supports custom EC configs - GetDistributionSummary/GetFaultToleranceAnalysis: Helper functions The shell layer converts between EcNode types and the generic TopologyNode types used by the distribution package. * test setup * ec: improve data and parity shard distribution across racks - Add shardsByTypePerRack helper to track data vs parity shards - Rewrite doBalanceEcShardsAcrossRacks for two-pass balancing: 1. Balance data shards (0-9) evenly, max ceil(10/6)=2 per rack 2. Balance parity shards (10-13) evenly, max ceil(4/6)=1 per rack - Add balanceShardTypeAcrossRacks for generic shard type balancing - Add pickRackForShardType to select destination with room for type - Add unit tests for even data/parity distribution verification This ensures even read load during normal operation by spreading both data and parity shards across all available racks. * ec: make data/parity shard counts configurable in ecBalancer - Add dataShardCount and parityShardCount fields to ecBalancer struct - Add getDataShardCount() and getParityShardCount() methods with defaults - Replace direct constant usage with configurable methods - Fix unused variable warning for parityPerRack This allows seaweed-enterprise to use custom EC ratios while defaulting to standard 10+4 scheme. * Address PR 7812 review comments Makefile improvements: - Save PIDs for each volume server for precise termination - Use PID-based killing in stop target with pkill fallback - Use more specific pkill patterns with TEST_DIR paths Documentation: - Document jq dependency in README.md Rebalancer fix: - Fix duplicate shard count updates in applyMovesToAnalysis - All planners (DC/rack/node) update counts inline during planning - Remove duplicate updates from applyMovesToAnalysis to avoid double-counting * test/erasure_coding: use mktemp for test file template Use mktemp instead of hardcoded /tmp/testfile_template.bin path to provide better isolation for concurrent test runs.
2025-12-19 13:29:12 -08:00
parent 77a56c2857
commit 4aa50bfa6a
16 changed files with 3127 additions and 28 deletions
--- a/.gitignore
+++ b/.gitignore
@@ -130,3 +130,4 @@ coverage.out
 /test/s3/remote_cache/test-remote-data
 test/s3/remote_cache/remote-server.pid
 test/s3/remote_cache/primary-server.pid
 /test/erasure_coding/filerldb2
--- a/test/erasure_coding/Makefile
+++ b/test/erasure_coding/Makefile
@@ -0,0 +1,187 @@
 # Makefile for EC integration testing
 # Usage:
 #   make start      - Start the test cluster (master + 6 volume servers + filer)
 #   make stop       - Stop the test cluster
 #   make populate   - Populate test data (~300MB across 7 volumes)
 #   make shell      - Open weed shell connected to the test cluster
 #   make clean      - Stop cluster and remove all test data
 #   make setup      - Start cluster and populate data (one command)
 #
 # Requirements: curl, jq
 WEED_BINARY := $(shell pwd)/../../weed/weed
 TEST_DIR := /tmp/ec_manual_test
 # Use non-standard ports to avoid conflicts with existing SeaweedFS servers
 MASTER_PORT := 29333
 FILER_PORT := 28888
 VOLUME_BASE_PORT := 28080
 NUM_VOLUME_SERVERS := 6
 VOLUME_SIZE_LIMIT_MB := 30
 MAX_VOLUMES_PER_SERVER := 10
 # Build weed binary if it doesn't exist
 $(WEED_BINARY):
 	cd ../../weed && go build -o weed .
 .PHONY: build
 build: $(WEED_BINARY)
 .PHONY: start
 start: build
 	@echo "=== Starting SeaweedFS test cluster ==="
 	@mkdir -p $(TEST_DIR)/master $(TEST_DIR)/filer
 	@for i in $$(seq 0 $$(($(NUM_VOLUME_SERVERS)-1))); do mkdir -p $(TEST_DIR)/volume$$i; done
 	@# Create security.toml with JWT disabled
 	@echo "# Disable JWT for testing" > $(TEST_DIR)/security.toml
 	@echo '[jwt.signing]' >> $(TEST_DIR)/security.toml
 	@echo 'key = ""' >> $(TEST_DIR)/security.toml
 	@echo 'expires_after_seconds = 0' >> $(TEST_DIR)/security.toml
 	@echo '' >> $(TEST_DIR)/security.toml
 	@echo '[jwt.signing.read]' >> $(TEST_DIR)/security.toml
 	@echo 'key = ""' >> $(TEST_DIR)/security.toml
 	@echo 'expires_after_seconds = 0' >> $(TEST_DIR)/security.toml
 	@# Create filer.toml with leveldb2
 	@echo '[leveldb2]' > $(TEST_DIR)/filer.toml
 	@echo 'enabled = true' >> $(TEST_DIR)/filer.toml
 	@echo 'dir = "$(TEST_DIR)/filer/filerldb2"' >> $(TEST_DIR)/filer.toml
 	@# Start master
 	@echo "Starting master on port $(MASTER_PORT)..."
 	@cd $(TEST_DIR) && $(WEED_BINARY) master \
 		-port=$(MASTER_PORT) \
 		-mdir=$(TEST_DIR)/master \
 		-volumeSizeLimitMB=$(VOLUME_SIZE_LIMIT_MB) \
 		-ip=127.0.0.1 \
 		> $(TEST_DIR)/master/master.log 2>&1 & echo $$! > $(TEST_DIR)/master.pid
 	@sleep 3
 	@# Start volume servers (run from TEST_DIR to find security.toml)
 	@for i in $$(seq 0 $$(($(NUM_VOLUME_SERVERS)-1))); do \
 		port=$$(($(VOLUME_BASE_PORT) + $$i)); \
 		echo "Starting volume server $$i on port $$port (rack$$i)..."; \
 		cd $(TEST_DIR) && $(WEED_BINARY) volume \
 			-port=$$port \
 			-dir=$(TEST_DIR)/volume$$i \
 			-max=$(MAX_VOLUMES_PER_SERVER) \
 			-master=127.0.0.1:$(MASTER_PORT) \
 			-ip=127.0.0.1 \
 			-dataCenter=dc1 \
 			-rack=rack$$i \
 			> $(TEST_DIR)/volume$$i/volume.log 2>&1 & echo $$! > $(TEST_DIR)/volume$$i.pid; \
 	done
 	@sleep 3
 	@# Start filer (run from TEST_DIR to find security.toml)
 	@echo "Starting filer on port $(FILER_PORT)..."
 	@cd $(TEST_DIR) && $(WEED_BINARY) filer \
 		-port=$(FILER_PORT) \
 		-master=127.0.0.1:$(MASTER_PORT) \
 		-ip=127.0.0.1 \
 		> $(TEST_DIR)/filer/filer.log 2>&1 & echo $$! > $(TEST_DIR)/filer.pid
 	@sleep 3
 	@echo ""
 	@echo "=== Cluster started ==="
 	@echo "Master: http://127.0.0.1:$(MASTER_PORT)"
 	@echo "Filer:  http://127.0.0.1:$(FILER_PORT)"
 	@echo "Volume servers: http://127.0.0.1:$(VOLUME_BASE_PORT) - http://127.0.0.1:$$(($(VOLUME_BASE_PORT) + $(NUM_VOLUME_SERVERS) - 1))"
 	@echo ""
 	@echo "Run 'make shell' to open weed shell"
 	@echo "Run 'make populate' to add test data"
 .PHONY: stop
 stop:
 	@echo "=== Stopping SeaweedFS test cluster ==="
 	@# Stop filer by PID
 	@-[ -f $(TEST_DIR)/filer.pid ] && kill $$(cat $(TEST_DIR)/filer.pid) 2>/dev/null && rm -f $(TEST_DIR)/filer.pid || true
 	@# Stop volume servers by PID
 	@for i in $$(seq 0 $$(($(NUM_VOLUME_SERVERS)-1))); do \
 		[ -f $(TEST_DIR)/volume$$i.pid ] && kill $$(cat $(TEST_DIR)/volume$$i.pid) 2>/dev/null && rm -f $(TEST_DIR)/volume$$i.pid || true; \
 	done
 	@# Stop master by PID
 	@-[ -f $(TEST_DIR)/master.pid ] && kill $$(cat $(TEST_DIR)/master.pid) 2>/dev/null && rm -f $(TEST_DIR)/master.pid || true
 	@# Fallback: use pkill with specific patterns to ensure cleanup
 	@-pkill -f "weed filer.*-master=127.0.0.1:$(MASTER_PORT)" 2>/dev/null || true
 	@-pkill -f "weed volume.*-dir=$(TEST_DIR)/volume" 2>/dev/null || true
 	@-pkill -f "weed master.*-mdir=$(TEST_DIR)/master" 2>/dev/null || true
 	@echo "Cluster stopped."
 .PHONY: clean
 clean: stop
 	@echo "Removing test data..."
 	@rm -rf $(TEST_DIR)
 	@echo "Clean complete."
 .PHONY: populate
 populate:
 	@echo "=== Populating test data (~300MB) ==="
 	@# Create a 500KB test file template using mktemp for isolation
 	@tmpfile=$$(mktemp) && \
 	dd if=/dev/urandom bs=1024 count=500 of=$$tmpfile 2>/dev/null && \
 	uploaded=0; \
 	for i in $$(seq 1 600); do \
 		response=$$(curl -s "http://127.0.0.1:$(MASTER_PORT)/dir/assign?collection=ectest&replication=000"); \
 		fid=$$(echo $$response | jq -r '.fid'); \
 		url=$$(echo $$response | jq -r '.url'); \
 		if [ "$$fid" != "null" ] && [ -n "$$fid" ]; then \
 			curl -s -F "file=@$$tmpfile;filename=file_$$i.bin" "http://$$url/$$fid" > /dev/null; \
 			uploaded=$$((uploaded + 1)); \
 		fi; \
 		if [ $$((i % 100)) -eq 0 ]; then \
 			echo "Uploaded $$uploaded files..."; \
 		fi; \
 	done; \
 	rm -f $$tmpfile; \
 	echo ""; \
 	echo "=== Data population complete ==="; \
 	echo "Uploaded $$uploaded files (~$$((uploaded * 500 / 1024))MB)"
 	@echo ""
 	@echo "Volume status:"
 	@curl -s "http://127.0.0.1:$(MASTER_PORT)/vol/status" | jq -r \
 		'.Volumes.DataCenters.dc1 | to_entries[] | .key as $$rack | .value | to_entries[] | select(.value != null) | .key as $$server | .value[] | select(.Collection == "ectest") | "  Volume \(.Id): \(.FileCount) files, \((.Size/1048576*10|floor)/10)MB - \($$rack)"' 2>/dev/null || true
 .PHONY: shell
 shell: build
 	@echo "Opening weed shell..."
 	@echo "Commands to try:"
 	@echo "  lock"
 	@echo "  volume.list"
 	@echo "  ec.encode -collection=ectest -quietFor=1s -force"
 	@echo "  ec.balance -collection=ectest"
 	@echo "  unlock"
 	@echo ""
 	@$(WEED_BINARY) shell -master=127.0.0.1:$(MASTER_PORT) -filer=127.0.0.1:$(FILER_PORT)
 .PHONY: setup
 setup: clean start
 	@sleep 2
 	@$(MAKE) populate
 .PHONY: status
 status:
 	@echo "=== Cluster Status ==="
 	@curl -s "http://127.0.0.1:$(MASTER_PORT)/vol/status" | jq -r \
 		'.Volumes.DataCenters.dc1 | to_entries[] | .key as $$rack | .value | to_entries[] | select(.value != null) | .key as $$server | .value[] | select(.Collection == "ectest") | "Volume \(.Id): \(.FileCount) files, \((.Size/1048576*10|floor)/10)MB - \($$rack) (\($$server))"' 2>/dev/null | sort -t: -k1 -n || echo "Cluster not running"
 	@echo ""
 	@echo "=== EC Shards ==="
 	@for i in $$(seq 0 $$(($(NUM_VOLUME_SERVERS)-1))); do \
 		count=$$(ls $(TEST_DIR)/volume$$i/*.ec[0-9]* 2>/dev/null | wc -l | tr -d ' '); \
 		if [ "$$count" != "0" ]; then \
 			echo "  volume$$i (port $$(($(VOLUME_BASE_PORT) + $$i))): $$count EC shard files"; \
 		fi; \
 	done
 .PHONY: help
 help:
 	@echo "EC Integration Test Makefile"
 	@echo ""
 	@echo "Targets:"
 	@echo "  make start     - Start test cluster (master + 6 volume servers + filer)"
 	@echo "  make stop      - Stop test cluster"
 	@echo "  make populate  - Populate ~300MB of test data"
 	@echo "  make shell     - Open weed shell"
 	@echo "  make setup     - Clean, start, and populate (all-in-one)"
 	@echo "  make status    - Show cluster and EC shard status"
 	@echo "  make clean     - Stop cluster and remove all test data"
 	@echo "  make help      - Show this help"
 	@echo ""
 	@echo "Quick start:"
 	@echo "  make setup     # Start cluster and populate data"
 	@echo "  make shell     # Open shell to run EC commands"
--- a/test/erasure_coding/README.md
+++ b/test/erasure_coding/README.md
@@ -78,6 +78,43 @@ go test -v -run TestECEncodingMasterTimingRaceCondition
 go test -v -short
 ```
 ## Manual Testing with Makefile
 A Makefile is provided for manual EC testing.
 **Requirements:** `curl`, `jq` (command-line JSON processor)
 ```bash
 # Quick start: start cluster and populate data
 make setup
 # Open weed shell to run EC commands
 make shell
 # Individual targets
 make start      # Start test cluster (master + 6 volume servers + filer)
 make stop       # Stop test cluster
 make populate   # Populate ~300MB of test data
 make status     # Show cluster and EC shard status
 make clean      # Stop cluster and remove all test data
 make help       # Show all targets
 ```
 ### EC Rebalance Limited Slots (Unit Test)
 The "no free ec shard slots" issue is tested with a **unit test** that works directly on
 topology data structures without requiring a running cluster.
 **Location**: `weed/shell/ec_rebalance_slots_test.go`
 Tests included:
 - `TestECRebalanceWithLimitedSlots`: Tests a topology with 6 servers, 7 EC volumes (98 shards)
 - `TestECRebalanceZeroFreeSlots`: Reproduces the exact 0 free slots scenario
 **Known Issue**: When volume servers are at capacity (`volumeCount == maxVolumeCount`),
 the rebalance step fails with "no free ec shard slots" instead of recognizing that
 moving shards frees slots on source servers.
 ## Test Results
 **With the fix**: Shows "Collecting volume locations for N volumes before EC encoding..." message
--- a/weed/shell/command_ec_common.go
+++ b/weed/shell/command_ec_common.go
@@ -679,6 +679,25 @@ type ecBalancer struct {
 	applyBalancing     bool
 	maxParallelization int
 	diskType           types.DiskType // target disk type for EC shards (default: HardDriveType)
 	// EC configuration for shard distribution (defaults to 10+4)
 	dataShardCount   int
 	parityShardCount int
 }
 // getDataShardCount returns the configured data shard count, defaulting to standard 10
 func (ecb *ecBalancer) getDataShardCount() int {
 	if ecb.dataShardCount > 0 {
 		return ecb.dataShardCount
 	}
 	return erasure_coding.DataShardsCount
 }
 // getParityShardCount returns the configured parity shard count, defaulting to standard 4
 func (ecb *ecBalancer) getParityShardCount() int {
 	if ecb.parityShardCount > 0 {
 		return ecb.parityShardCount
 	}
 	return erasure_coding.ParityShardsCount
 }
 func (ecb *ecBalancer) errorWaitGroup() *ErrorWaitGroup {
@@ -785,59 +804,176 @@ func countShardsByRack(vid needle.VolumeId, locations []*EcNode, diskType types.
 	})
 }
 // shardsByTypePerRack counts data shards (< dataShards) and parity shards (>= dataShards) per rack
 func shardsByTypePerRack(vid needle.VolumeId, locations []*EcNode, diskType types.DiskType, dataShards int) (dataPerRack, parityPerRack map[string][]erasure_coding.ShardId) {
 	dataPerRack = make(map[string][]erasure_coding.ShardId)
 	parityPerRack = make(map[string][]erasure_coding.ShardId)
 	for _, ecNode := range locations {
 		shardBits := findEcVolumeShards(ecNode, vid, diskType)
 		rackId := string(ecNode.rack)
 		for _, shardId := range shardBits.ShardIds() {
 			if int(shardId) < dataShards {
 				dataPerRack[rackId] = append(dataPerRack[rackId], shardId)
 			} else {
 				parityPerRack[rackId] = append(parityPerRack[rackId], shardId)
 			}
 		}
 	}
 	return
 }
 func (ecb *ecBalancer) doBalanceEcShardsAcrossRacks(collection string, vid needle.VolumeId, locations []*EcNode) error {
 	racks := ecb.racks()
 	numRacks := len(racks)
-	// see the volume's shards are in how many racks, and how many in each rack
+	// Use configured EC scheme for shard type classification (defaults to 10+4)
-	rackToShardCount := countShardsByRack(vid, locations, ecb.diskType)
+	dataShardCount := ecb.getDataShardCount()
 	parityShardCount := ecb.getParityShardCount()
 	// Get current distribution of data shards per rack (parity computed after data balancing)
 	dataPerRack, _ := shardsByTypePerRack(vid, locations, ecb.diskType, dataShardCount)
 	// Calculate max shards per rack for each type to ensure even spread
 	// Data: 10 shards / 6 racks = max 2 per rack
 	// Parity: 4 shards / 6 racks = max 1 per rack (with 2 racks having 0)
 	maxDataPerRack := ceilDivide(dataShardCount, numRacks)
 	maxParityPerRack := ceilDivide(parityShardCount, numRacks)
 	// Calculate actual total shards for this volume (not hardcoded default)
 	var totalShardsForVolume int
 	for _, count := range rackToShardCount {
 		totalShardsForVolume += count
 	}
 	// calculate average number of shards an ec rack should have for one volume
 	averageShardsPerEcRack := ceilDivide(totalShardsForVolume, len(racks))
 	rackEcNodesWithVid := groupBy(locations, func(ecNode *EcNode) string {
 		return string(ecNode.rack)
 	})
-	// ecShardsToMove = select overflown ec shards from racks with ec shard counts > averageShardsPerEcRack
+	// Track total shard count per rack for slot management
-	ecShardsToMove := make(map[erasure_coding.ShardId]*EcNode)
+	rackToShardCount := countShardsByRack(vid, locations, ecb.diskType)
-	for rackId, count := range rackToShardCount {
+
-		if count <= averageShardsPerEcRack {
+	// First pass: Balance data shards across racks
 	if err := ecb.balanceShardTypeAcrossRacks(collection, vid, racks, rackEcNodesWithVid, dataPerRack, rackToShardCount, maxDataPerRack, "data"); err != nil {
 		return err
 	}
 	// Refresh locations after data shard moves and get parity distribution
 	locations = ecb.collectVolumeIdToEcNodes(collection)[vid]
 	_, parityPerRack := shardsByTypePerRack(vid, locations, ecb.diskType, dataShardCount)
 	rackEcNodesWithVid = groupBy(locations, func(ecNode *EcNode) string {
 		return string(ecNode.rack)
 	})
 	rackToShardCount = countShardsByRack(vid, locations, ecb.diskType)
 	// Second pass: Balance parity shards across racks
 	if err := ecb.balanceShardTypeAcrossRacks(collection, vid, racks, rackEcNodesWithVid, parityPerRack, rackToShardCount, maxParityPerRack, "parity"); err != nil {
 		return err
 	}
 	return nil
 }
 // balanceShardTypeAcrossRacks spreads shards of a specific type (data or parity) evenly across racks
 func (ecb *ecBalancer) balanceShardTypeAcrossRacks(
 	collection string,
 	vid needle.VolumeId,
 	racks map[RackId]*EcRack,
 	rackEcNodesWithVid map[string][]*EcNode,
 	shardsPerRack map[string][]erasure_coding.ShardId,
 	rackToShardCount map[string]int,
 	maxPerRack int,
 	shardType string,
 ) error {
 	// Find racks with too many shards of this type
 	shardsToMove := make(map[erasure_coding.ShardId]*EcNode)
 	for rackId, shards := range shardsPerRack {
 		if len(shards) <= maxPerRack {
 			continue
 		}
-		possibleEcNodes := rackEcNodesWithVid[rackId]
+		// Pick excess shards to move
-		for shardId, ecNode := range pickNEcShardsToMoveFrom(possibleEcNodes, vid, count-averageShardsPerEcRack, ecb.diskType) {
+		excess := len(shards) - maxPerRack
-			ecShardsToMove[shardId] = ecNode
+		ecNodes := rackEcNodesWithVid[rackId]
 		for i := 0; i < excess && i < len(shards); i++ {
 			shardId := shards[i]
 			// Find which node has this shard
 			for _, ecNode := range ecNodes {
 				shardBits := findEcVolumeShards(ecNode, vid, ecb.diskType)
 				if shardBits.HasShardId(shardId) {
 					shardsToMove[shardId] = ecNode
 					break
 				}
 			}
 		}
 	}
-	for shardId, ecNode := range ecShardsToMove {
+	// Move shards to racks that have fewer than maxPerRack of this type
-		rackId, err := ecb.pickRackToBalanceShardsInto(racks, rackToShardCount)
+	for shardId, ecNode := range shardsToMove {
 		// Find destination rack with room for this shard type
 		destRackId, err := ecb.pickRackForShardType(racks, shardsPerRack, maxPerRack, rackToShardCount)
 		if err != nil {
-			fmt.Printf("ec shard %d.%d at %s can not find a destination rack:\n%s\n", vid, shardId, ecNode.info.Id, err.Error())
+			fmt.Printf("ec %s shard %d.%d at %s can not find a destination rack:\n%s\n", shardType, vid, shardId, ecNode.info.Id, err.Error())
 			continue
 		}
 		var possibleDestinationEcNodes []*EcNode
-		for _, n := range racks[rackId].ecNodes {
+		for _, n := range racks[destRackId].ecNodes {
 			possibleDestinationEcNodes = append(possibleDestinationEcNodes, n)
 		}
 		err = ecb.pickOneEcNodeAndMoveOneShard(ecNode, collection, vid, shardId, possibleDestinationEcNodes)
 		if err != nil {
 			return err
 		}
-		rackToShardCount[string(rackId)] += 1
+
-		rackToShardCount[string(ecNode.rack)] -= 1
+		// Update tracking
-		racks[rackId].freeEcSlot -= 1
+		shardsPerRack[string(destRackId)] = append(shardsPerRack[string(destRackId)], shardId)
-		racks[ecNode.rack].freeEcSlot += 1
+		// Remove from source rack
 		srcRack := string(ecNode.rack)
 		for i, s := range shardsPerRack[srcRack] {
 			if s == shardId {
 				shardsPerRack[srcRack] = append(shardsPerRack[srcRack][:i], shardsPerRack[srcRack][i+1:]...)
 				break
 			}
 		}
 		rackToShardCount[string(destRackId)] += 1
 		rackToShardCount[srcRack] -= 1
 		racks[destRackId].freeEcSlot -= 1
 		racks[RackId(srcRack)].freeEcSlot += 1
 	}
 	return nil
 }
 // pickRackForShardType selects a rack that has room for more shards of a specific type
 func (ecb *ecBalancer) pickRackForShardType(
 	rackToEcNodes map[RackId]*EcRack,
 	shardsPerRack map[string][]erasure_coding.ShardId,
 	maxPerRack int,
 	rackToShardCount map[string]int,
 ) (RackId, error) {
 	var candidates []RackId
 	minShards := maxPerRack + 1
 	for rackId, rack := range rackToEcNodes {
 		if rack.freeEcSlot <= 0 {
 			continue
 		}
 		currentCount := len(shardsPerRack[string(rackId)])
 		if currentCount >= maxPerRack {
 			continue
 		}
 		// For EC shards, replica placement constraint only applies when DiffRackCount > 0.
 		if ecb.replicaPlacement != nil && ecb.replicaPlacement.DiffRackCount > 0 && rackToShardCount[string(rackId)] >= ecb.replicaPlacement.DiffRackCount {
 			continue
 		}
 		if currentCount < minShards {
 			candidates = nil
 			minShards = currentCount
 		}
 		if currentCount == minShards {
 			candidates = append(candidates, rackId)
 		}
 	}
 	if len(candidates) == 0 {
 		return "", errors.New("no rack available for shard type balancing")
 	}
 	return candidates[rand.IntN(len(candidates))], nil
 }
 func (ecb *ecBalancer) pickRackToBalanceShardsInto(rackToEcNodes map[RackId]*EcRack, rackToShardCount map[string]int) (RackId, error) {
 	targets := []RackId{}
 	targetShards := -1
@@ -855,7 +991,11 @@ func (ecb *ecBalancer) pickRackToBalanceShardsInto(rackToEcNodes map[RackId]*EcR
 			details += fmt.Sprintf("  Skipped %s because it has no free slots\n", rackId)
 			continue
 		}
-		if ecb.replicaPlacement != nil && shards > ecb.replicaPlacement.DiffRackCount {
+		// For EC shards, replica placement constraint only applies when DiffRackCount > 0.
 		// When DiffRackCount = 0 (e.g., replica placement "000"), EC shards should be
 		// distributed freely across racks for fault tolerance - the "000" means
 		// "no volume replication needed" because erasure coding provides redundancy.
 		if ecb.replicaPlacement != nil && ecb.replicaPlacement.DiffRackCount > 0 && shards > ecb.replicaPlacement.DiffRackCount {
 			details += fmt.Sprintf("  Skipped %s because shards %d > replica placement limit for other racks (%d)\n", rackId, shards, ecb.replicaPlacement.DiffRackCount)
 			continue
 		}
@@ -1056,7 +1196,11 @@ func (ecb *ecBalancer) pickEcNodeToBalanceShardsInto(vid needle.VolumeId, existi
 		}
 		shards := nodeShards[node]
-		if ecb.replicaPlacement != nil && shards > ecb.replicaPlacement.SameRackCount+1 {
+		// For EC shards, replica placement constraint only applies when SameRackCount > 0.
 		// When SameRackCount = 0 (e.g., replica placement "000"), EC shards should be
 		// distributed freely within racks - the "000" means "no volume replication needed"
 		// because erasure coding provides redundancy.
 		if ecb.replicaPlacement != nil && ecb.replicaPlacement.SameRackCount > 0 && shards > ecb.replicaPlacement.SameRackCount+1 {
 			details += fmt.Sprintf("  Skipped %s because shards %d > replica placement limit for the rack (%d + 1)\n", node.info.Id, shards, ecb.replicaPlacement.SameRackCount)
 			continue
 		}
--- a/weed/shell/command_ec_common_test.go
+++ b/weed/shell/command_ec_common_test.go
@@ -133,7 +133,9 @@ func TestPickRackToBalanceShardsInto(t *testing.T) {
 		{testTopologyEc, "6241", "123", []string{"rack1", "rack2", "rack3", "rack4", "rack5", "rack6"}, ""},
 		{testTopologyEc, "6242", "123", []string{"rack1", "rack2", "rack3", "rack4", "rack5", "rack6"}, ""},
 		// EC volumes.
-		{testTopologyEc, "9577", "", nil, "shards 1 > replica placement limit for other racks (0)"},
+		// With replication "000" (DiffRackCount=0), EC shards should be distributed freely
 		// because erasure coding provides its own redundancy. No replica placement error.
 		{testTopologyEc, "9577", "", []string{"rack1", "rack2", "rack3"}, ""},
 		{testTopologyEc, "9577", "111", []string{"rack1", "rack2", "rack3"}, ""},
 		{testTopologyEc, "9577", "222", []string{"rack1", "rack2", "rack3"}, ""},
 		{testTopologyEc, "10457", "222", []string{"rack1"}, ""},
--- a/weed/shell/command_ec_encode.go
+++ b/weed/shell/command_ec_encode.go
@@ -94,7 +94,7 @@ func (c *commandEcEncode) Do(args []string, commandEnv *CommandEnv, writer io.Wr
 	shardReplicaPlacement := encodeCommand.String("shardReplicaPlacement", "", "replica placement for EC shards, or master default if empty")
 	sourceDiskTypeStr := encodeCommand.String("sourceDiskType", "", "filter source volumes by disk type (hdd, ssd, or empty for all)")
 	diskTypeStr := encodeCommand.String("diskType", "", "target disk type for EC shards (hdd, ssd, or empty for default hdd)")
-	applyBalancing := encodeCommand.Bool("rebalance", false, "re-balance EC shards after creation")
+	applyBalancing := encodeCommand.Bool("rebalance", true, "re-balance EC shards after creation (default: true)")
 	verbose := encodeCommand.Bool("verbose", false, "show detailed reasons why volumes are not selected for encoding")
 	if err = encodeCommand.Parse(args); err != nil {
@@ -164,6 +164,32 @@ func (c *commandEcEncode) Do(args []string, commandEnv *CommandEnv, writer io.Wr
 		return fmt.Errorf("failed to collect volume locations before EC encoding: %w", err)
 	}
 	// Pre-flight check: verify the target disk type has capacity for EC shards
 	// This prevents encoding shards only to fail during rebalance
 	_, totalFreeEcSlots, err := collectEcNodesForDC(commandEnv, "", diskType)
 	if err != nil {
 		return fmt.Errorf("failed to check EC shard capacity: %w", err)
 	}
 	// Calculate required slots: each volume needs TotalShardsCount (14) shards distributed
 	requiredSlots := len(volumeIds) * erasure_coding.TotalShardsCount
 	if totalFreeEcSlots < 1 {
 		// No capacity at all on the target disk type
 		if diskType != types.HardDriveType {
 			return fmt.Errorf("no free ec shard slots on disk type '%s'. The target disk type has no capacity.\n"+
 				"Your volumes are likely on a different disk type. Try:\n"+
 				"  ec.encode -collection=%s -diskType=hdd\n"+
 				"Or omit -diskType to use the default (hdd)", diskType, *collection)
 		}
 		return fmt.Errorf("no free ec shard slots. only %d left on disk type '%s'", totalFreeEcSlots, diskType)
 	}
 	if totalFreeEcSlots < requiredSlots {
 		fmt.Printf("Warning: limited EC shard capacity. Need %d slots for %d volumes, but only %d slots available on disk type '%s'.\n",
 			requiredSlots, len(volumeIds), totalFreeEcSlots, diskType)
 		fmt.Printf("Rebalancing may not achieve optimal distribution.\n")
 	}
 	// encode all requested volumes...
 	if err = doEcEncode(commandEnv, writer, volumeIdToCollection, volumeIds, *maxParallelization); err != nil {
 		return fmt.Errorf("ec encode for volumes %v: %w", volumeIds, err)
--- a/weed/shell/command_ec_test.go
+++ b/weed/shell/command_ec_test.go
@@ -4,6 +4,7 @@ import (
 	"testing"
 	"github.com/seaweedfs/seaweedfs/weed/pb/master_pb"
 	"github.com/seaweedfs/seaweedfs/weed/storage/erasure_coding"
 	"github.com/seaweedfs/seaweedfs/weed/storage/needle"
 	"github.com/seaweedfs/seaweedfs/weed/storage/types"
 )
@@ -136,3 +137,151 @@ func newEcNode(dc string, rack string, dataNodeId string, freeEcSlot int) *EcNod
 func (ecNode *EcNode) addEcVolumeAndShardsForTest(vid uint32, collection string, shardIds []uint32) *EcNode {
 	return ecNode.addEcVolumeShards(needle.VolumeId(vid), collection, shardIds, types.HardDriveType)
 }
 // TestCommandEcBalanceEvenDataAndParityDistribution verifies that after balancing:
 // 1. Data shards (0-9) are evenly distributed across racks (max 2 per rack for 6 racks)
 // 2. Parity shards (10-13) are evenly distributed across racks (max 1 per rack for 6 racks)
 func TestCommandEcBalanceEvenDataAndParityDistribution(t *testing.T) {
 	// Setup: All 14 shards start on rack1 (simulating fresh EC encode)
 	ecb := &ecBalancer{
 		ecNodes: []*EcNode{
 			// All shards initially on rack1/dn1
 			newEcNode("dc1", "rack1", "dn1", 100).addEcVolumeAndShardsForTest(1, "c1", []uint32{0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13}),
 			// Empty nodes on other racks
 			newEcNode("dc1", "rack2", "dn2", 100),
 			newEcNode("dc1", "rack3", "dn3", 100),
 			newEcNode("dc1", "rack4", "dn4", 100),
 			newEcNode("dc1", "rack5", "dn5", 100),
 			newEcNode("dc1", "rack6", "dn6", 100),
 		},
 		applyBalancing: false, // Dry-run mode (simulates moves by updating internal state)
 		diskType:       types.HardDriveType,
 	}
 	ecb.balanceEcVolumes("c1")
 	// After balancing (dry-run), verify the PLANNED distribution by checking what moves were proposed
 	// The ecb.ecNodes state is updated during dry-run to track planned moves
 	vid := needle.VolumeId(1)
 	dataShardCount := erasure_coding.DataShardsCount   // 10
 	parityShardCount := erasure_coding.ParityShardsCount // 4
 	// Count data and parity shards per rack based on current (updated) state
 	dataPerRack, parityPerRack := countDataAndParityShardsPerRack(ecb.ecNodes, vid, dataShardCount)
 	// With 6 racks:
 	// - Data shards (10): max 2 per rack (ceil(10/6) = 2)
 	// - Parity shards (4): max 1 per rack (ceil(4/6) = 1)
 	maxDataPerRack := ceilDivide(dataShardCount, 6)   // 2
 	maxParityPerRack := ceilDivide(parityShardCount, 6) // 1
 	// Verify no rack has more than max data shards
 	for rackId, count := range dataPerRack {
 		if count > maxDataPerRack {
 			t.Errorf("rack %s has %d data shards, expected max %d", rackId, count, maxDataPerRack)
 		}
 	}
 	// Verify no rack has more than max parity shards
 	for rackId, count := range parityPerRack {
 		if count > maxParityPerRack {
 			t.Errorf("rack %s has %d parity shards, expected max %d", rackId, count, maxParityPerRack)
 		}
 	}
 	// Verify all shards are distributed (total counts)
 	totalData := 0
 	totalParity := 0
 	for _, count := range dataPerRack {
 		totalData += count
 	}
 	for _, count := range parityPerRack {
 		totalParity += count
 	}
 	if totalData != dataShardCount {
 		t.Errorf("total data shards = %d, expected %d", totalData, dataShardCount)
 	}
 	if totalParity != parityShardCount {
 		t.Errorf("total parity shards = %d, expected %d", totalParity, parityShardCount)
 	}
 	// Verify data shards are spread across at least 5 racks (10 shards / 2 max per rack)
 	racksWithData := len(dataPerRack)
 	minRacksForData := dataShardCount / maxDataPerRack // At least 5 racks needed for 10 data shards
 	if racksWithData < minRacksForData {
 		t.Errorf("data shards spread across only %d racks, expected at least %d", racksWithData, minRacksForData)
 	}
 	// Verify parity shards are spread across at least 4 racks (4 shards / 1 max per rack)
 	racksWithParity := len(parityPerRack)
 	if racksWithParity < parityShardCount {
 		t.Errorf("parity shards spread across only %d racks, expected at least %d", racksWithParity, parityShardCount)
 	}
 	t.Logf("Distribution after balancing:")
 	t.Logf("  Data shards per rack: %v (max allowed: %d)", dataPerRack, maxDataPerRack)
 	t.Logf("  Parity shards per rack: %v (max allowed: %d)", parityPerRack, maxParityPerRack)
 }
 // countDataAndParityShardsPerRack counts data and parity shards per rack
 func countDataAndParityShardsPerRack(ecNodes []*EcNode, vid needle.VolumeId, dataShardCount int) (dataPerRack, parityPerRack map[string]int) {
 	dataPerRack = make(map[string]int)
 	parityPerRack = make(map[string]int)
 	for _, ecNode := range ecNodes {
 		shardBits := findEcVolumeShards(ecNode, vid, types.HardDriveType)
 		for _, shardId := range shardBits.ShardIds() {
 			rackId := string(ecNode.rack)
 			if int(shardId) < dataShardCount {
 				dataPerRack[rackId]++
 			} else {
 				parityPerRack[rackId]++
 			}
 		}
 	}
 	return
 }
 // TestCommandEcBalanceMultipleVolumesEvenDistribution tests that multiple volumes
 // each get their data and parity shards evenly distributed
 func TestCommandEcBalanceMultipleVolumesEvenDistribution(t *testing.T) {
 	// Setup: Two volumes, each with all 14 shards on different starting racks
 	ecb := &ecBalancer{
 		ecNodes: []*EcNode{
 			// Volume 1: all shards on rack1
 			newEcNode("dc1", "rack1", "dn1", 100).addEcVolumeAndShardsForTest(1, "c1", []uint32{0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13}),
 			// Volume 2: all shards on rack2
 			newEcNode("dc1", "rack2", "dn2", 100).addEcVolumeAndShardsForTest(2, "c1", []uint32{0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13}),
 			// Empty nodes on other racks
 			newEcNode("dc1", "rack3", "dn3", 100),
 			newEcNode("dc1", "rack4", "dn4", 100),
 			newEcNode("dc1", "rack5", "dn5", 100),
 			newEcNode("dc1", "rack6", "dn6", 100),
 		},
 		applyBalancing: false, // Dry-run mode
 		diskType:       types.HardDriveType,
 	}
 	ecb.balanceEcVolumes("c1")
 	// Check both volumes
 	for _, vid := range []needle.VolumeId{1, 2} {
 		dataPerRack, parityPerRack := countDataAndParityShardsPerRack(ecb.ecNodes, vid, erasure_coding.DataShardsCount)
 		maxDataPerRack := ceilDivide(erasure_coding.DataShardsCount, 6)
 		maxParityPerRack := ceilDivide(erasure_coding.ParityShardsCount, 6)
 		for rackId, count := range dataPerRack {
 			if count > maxDataPerRack {
 				t.Errorf("volume %d: rack %s has %d data shards, expected max %d", vid, rackId, count, maxDataPerRack)
 			}
 		}
 		for rackId, count := range parityPerRack {
 			if count > maxParityPerRack {
 				t.Errorf("volume %d: rack %s has %d parity shards, expected max %d", vid, rackId, count, maxParityPerRack)
 			}
 		}
 		t.Logf("Volume %d - Data: %v, Parity: %v", vid, dataPerRack, parityPerRack)
 	}
 }
--- a/weed/shell/ec_proportional_rebalance.go
+++ b/weed/shell/ec_proportional_rebalance.go
@@ -0,0 +1,284 @@
 package shell
 import (
 	"fmt"
 	"github.com/seaweedfs/seaweedfs/weed/storage/erasure_coding"
 	"github.com/seaweedfs/seaweedfs/weed/storage/erasure_coding/distribution"
 	"github.com/seaweedfs/seaweedfs/weed/storage/needle"
 	"github.com/seaweedfs/seaweedfs/weed/storage/super_block"
 	"github.com/seaweedfs/seaweedfs/weed/storage/types"
 )
 // ECDistribution is an alias to the distribution package type for backward compatibility
 type ECDistribution = distribution.ECDistribution
 // CalculateECDistribution computes the target EC shard distribution based on replication policy.
 // This is a convenience wrapper that uses the default 10+4 EC configuration.
 // For custom EC ratios, use the distribution package directly.
 func CalculateECDistribution(totalShards, parityShards int, rp *super_block.ReplicaPlacement) *ECDistribution {
 	ec := distribution.ECConfig{
 		DataShards:   totalShards - parityShards,
 		ParityShards: parityShards,
 	}
 	rep := distribution.NewReplicationConfig(rp)
 	return distribution.CalculateDistribution(ec, rep)
 }
 // TopologyDistributionAnalysis holds the current shard distribution analysis
 // This wraps the distribution package's TopologyAnalysis with shell-specific EcNode handling
 type TopologyDistributionAnalysis struct {
 	inner *distribution.TopologyAnalysis
 	// Shell-specific mappings
 	nodeMap map[string]*EcNode // nodeID -> EcNode
 }
 // NewTopologyDistributionAnalysis creates a new analysis structure
 func NewTopologyDistributionAnalysis() *TopologyDistributionAnalysis {
 	return &TopologyDistributionAnalysis{
 		inner:   distribution.NewTopologyAnalysis(),
 		nodeMap: make(map[string]*EcNode),
 	}
 }
 // AddNode adds a node and its shards to the analysis
 func (a *TopologyDistributionAnalysis) AddNode(node *EcNode, shardBits erasure_coding.ShardBits) {
 	nodeId := node.info.Id
 	// Create distribution.TopologyNode from EcNode
 	topoNode := &distribution.TopologyNode{
 		NodeID:      nodeId,
 		DataCenter:  string(node.dc),
 		Rack:        string(node.rack),
 		FreeSlots:   node.freeEcSlot,
 		TotalShards: shardBits.ShardIdCount(),
 	}
 	for _, shardId := range shardBits.ShardIds() {
 		topoNode.ShardIDs = append(topoNode.ShardIDs, int(shardId))
 	}
 	a.inner.AddNode(topoNode)
 	a.nodeMap[nodeId] = node
 	// Add shard locations
 	for _, shardId := range shardBits.ShardIds() {
 		a.inner.AddShardLocation(distribution.ShardLocation{
 			ShardID:    int(shardId),
 			NodeID:     nodeId,
 			DataCenter: string(node.dc),
 			Rack:       string(node.rack),
 		})
 	}
 }
 // Finalize completes the analysis
 func (a *TopologyDistributionAnalysis) Finalize() {
 	a.inner.Finalize()
 }
 // String returns a summary
 func (a *TopologyDistributionAnalysis) String() string {
 	return a.inner.String()
 }
 // DetailedString returns detailed analysis
 func (a *TopologyDistributionAnalysis) DetailedString() string {
 	return a.inner.DetailedString()
 }
 // GetShardsByDC returns shard counts by DC
 func (a *TopologyDistributionAnalysis) GetShardsByDC() map[DataCenterId]int {
 	result := make(map[DataCenterId]int)
 	for dc, count := range a.inner.ShardsByDC {
 		result[DataCenterId(dc)] = count
 	}
 	return result
 }
 // GetShardsByRack returns shard counts by rack
 func (a *TopologyDistributionAnalysis) GetShardsByRack() map[RackId]int {
 	result := make(map[RackId]int)
 	for rack, count := range a.inner.ShardsByRack {
 		result[RackId(rack)] = count
 	}
 	return result
 }
 // GetShardsByNode returns shard counts by node
 func (a *TopologyDistributionAnalysis) GetShardsByNode() map[EcNodeId]int {
 	result := make(map[EcNodeId]int)
 	for nodeId, count := range a.inner.ShardsByNode {
 		result[EcNodeId(nodeId)] = count
 	}
 	return result
 }
 // AnalyzeVolumeDistribution creates an analysis of current shard distribution for a volume
 func AnalyzeVolumeDistribution(volumeId needle.VolumeId, locations []*EcNode, diskType types.DiskType) *TopologyDistributionAnalysis {
 	analysis := NewTopologyDistributionAnalysis()
 	for _, node := range locations {
 		shardBits := findEcVolumeShards(node, volumeId, diskType)
 		if shardBits.ShardIdCount() > 0 {
 			analysis.AddNode(node, shardBits)
 		}
 	}
 	analysis.Finalize()
 	return analysis
 }
 // ECShardMove represents a planned shard move (shell-specific with EcNode references)
 type ECShardMove struct {
 	VolumeId   needle.VolumeId
 	ShardId    erasure_coding.ShardId
 	SourceNode *EcNode
 	DestNode   *EcNode
 	Reason     string
 }
 // String returns a human-readable description
 func (m ECShardMove) String() string {
 	return fmt.Sprintf("volume %d shard %d: %s -> %s (%s)",
 		m.VolumeId, m.ShardId, m.SourceNode.info.Id, m.DestNode.info.Id, m.Reason)
 }
 // ProportionalECRebalancer implements proportional shard distribution for shell commands
 type ProportionalECRebalancer struct {
 	ecNodes          []*EcNode
 	replicaPlacement *super_block.ReplicaPlacement
 	diskType         types.DiskType
 	ecConfig         distribution.ECConfig
 }
 // NewProportionalECRebalancer creates a new proportional rebalancer with default EC config
 func NewProportionalECRebalancer(
 	ecNodes []*EcNode,
 	rp *super_block.ReplicaPlacement,
 	diskType types.DiskType,
 ) *ProportionalECRebalancer {
 	return NewProportionalECRebalancerWithConfig(
 		ecNodes,
 		rp,
 		diskType,
 		distribution.DefaultECConfig(),
 	)
 }
 // NewProportionalECRebalancerWithConfig creates a rebalancer with custom EC configuration
 func NewProportionalECRebalancerWithConfig(
 	ecNodes []*EcNode,
 	rp *super_block.ReplicaPlacement,
 	diskType types.DiskType,
 	ecConfig distribution.ECConfig,
 ) *ProportionalECRebalancer {
 	return &ProportionalECRebalancer{
 		ecNodes:          ecNodes,
 		replicaPlacement: rp,
 		diskType:         diskType,
 		ecConfig:         ecConfig,
 	}
 }
 // PlanMoves generates a plan for moving shards to achieve proportional distribution
 func (r *ProportionalECRebalancer) PlanMoves(
 	volumeId needle.VolumeId,
 	locations []*EcNode,
 ) ([]ECShardMove, error) {
 	// Build topology analysis
 	analysis := distribution.NewTopologyAnalysis()
 	nodeMap := make(map[string]*EcNode)
 	// Add all EC nodes to the analysis (even those without shards)
 	for _, node := range r.ecNodes {
 		nodeId := node.info.Id
 		topoNode := &distribution.TopologyNode{
 			NodeID:     nodeId,
 			DataCenter: string(node.dc),
 			Rack:       string(node.rack),
 			FreeSlots:  node.freeEcSlot,
 		}
 		analysis.AddNode(topoNode)
 		nodeMap[nodeId] = node
 	}
 	// Add shard locations from nodes that have shards
 	for _, node := range locations {
 		nodeId := node.info.Id
 		shardBits := findEcVolumeShards(node, volumeId, r.diskType)
 		for _, shardId := range shardBits.ShardIds() {
 			analysis.AddShardLocation(distribution.ShardLocation{
 				ShardID:    int(shardId),
 				NodeID:     nodeId,
 				DataCenter: string(node.dc),
 				Rack:       string(node.rack),
 			})
 		}
 		if _, exists := nodeMap[nodeId]; !exists {
 			nodeMap[nodeId] = node
 		}
 	}
 	analysis.Finalize()
 	// Create rebalancer and plan moves
 	rep := distribution.NewReplicationConfig(r.replicaPlacement)
 	rebalancer := distribution.NewRebalancer(r.ecConfig, rep)
 	plan, err := rebalancer.PlanRebalance(analysis)
 	if err != nil {
 		return nil, err
 	}
 	// Convert distribution moves to shell moves
 	var moves []ECShardMove
 	for _, move := range plan.Moves {
 		srcNode := nodeMap[move.SourceNode.NodeID]
 		destNode := nodeMap[move.DestNode.NodeID]
 		if srcNode == nil || destNode == nil {
 			continue
 		}
 		moves = append(moves, ECShardMove{
 			VolumeId:   volumeId,
 			ShardId:    erasure_coding.ShardId(move.ShardID),
 			SourceNode: srcNode,
 			DestNode:   destNode,
 			Reason:     move.Reason,
 		})
 	}
 	return moves, nil
 }
 // GetDistributionSummary returns a summary of the planned distribution
 func GetDistributionSummary(rp *super_block.ReplicaPlacement) string {
 	ec := distribution.DefaultECConfig()
 	rep := distribution.NewReplicationConfig(rp)
 	dist := distribution.CalculateDistribution(ec, rep)
 	return dist.Summary()
 }
 // GetDistributionSummaryWithConfig returns a summary with custom EC configuration
 func GetDistributionSummaryWithConfig(rp *super_block.ReplicaPlacement, ecConfig distribution.ECConfig) string {
 	rep := distribution.NewReplicationConfig(rp)
 	dist := distribution.CalculateDistribution(ecConfig, rep)
 	return dist.Summary()
 }
 // GetFaultToleranceAnalysis returns fault tolerance analysis for the given configuration
 func GetFaultToleranceAnalysis(rp *super_block.ReplicaPlacement) string {
 	ec := distribution.DefaultECConfig()
 	rep := distribution.NewReplicationConfig(rp)
 	dist := distribution.CalculateDistribution(ec, rep)
 	return dist.FaultToleranceAnalysis()
 }
 // GetFaultToleranceAnalysisWithConfig returns fault tolerance analysis with custom EC configuration
 func GetFaultToleranceAnalysisWithConfig(rp *super_block.ReplicaPlacement, ecConfig distribution.ECConfig) string {
 	rep := distribution.NewReplicationConfig(rp)
 	dist := distribution.CalculateDistribution(ecConfig, rep)
 	return dist.FaultToleranceAnalysis()
 }
--- a/weed/shell/ec_proportional_rebalance_test.go
+++ b/weed/shell/ec_proportional_rebalance_test.go
@@ -0,0 +1,251 @@
 package shell
 import (
 	"testing"
 	"github.com/seaweedfs/seaweedfs/weed/pb/master_pb"
 	"github.com/seaweedfs/seaweedfs/weed/storage/erasure_coding"
 	"github.com/seaweedfs/seaweedfs/weed/storage/erasure_coding/distribution"
 	"github.com/seaweedfs/seaweedfs/weed/storage/needle"
 	"github.com/seaweedfs/seaweedfs/weed/storage/super_block"
 	"github.com/seaweedfs/seaweedfs/weed/storage/types"
 )
 func TestCalculateECDistributionShell(t *testing.T) {
 	// Test the shell wrapper function
 	rp, _ := super_block.NewReplicaPlacementFromString("110")
 	dist := CalculateECDistribution(
 		erasure_coding.TotalShardsCount,
 		erasure_coding.ParityShardsCount,
 		rp,
 	)
 	if dist.ReplicationConfig.MinDataCenters != 2 {
 		t.Errorf("Expected 2 DCs, got %d", dist.ReplicationConfig.MinDataCenters)
 	}
 	if dist.TargetShardsPerDC != 7 {
 		t.Errorf("Expected 7 shards per DC, got %d", dist.TargetShardsPerDC)
 	}
 	t.Log(dist.Summary())
 }
 func TestAnalyzeVolumeDistributionShell(t *testing.T) {
 	diskType := types.HardDriveType
 	diskTypeKey := string(diskType)
 	// Build a topology with unbalanced distribution
 	node1 := &EcNode{
 		info: &master_pb.DataNodeInfo{
 			Id: "127.0.0.1:8080",
 			DiskInfos: map[string]*master_pb.DiskInfo{
 				diskTypeKey: {
 					Type:           diskTypeKey,
 					MaxVolumeCount: 10,
 					EcShardInfos: []*master_pb.VolumeEcShardInformationMessage{
 						{
 							Id:          1,
 							Collection:  "test",
 							EcIndexBits: 0x3FFF, // All 14 shards
 						},
 					},
 				},
 			},
 		},
 		dc:         "dc1",
 		rack:       "rack1",
 		freeEcSlot: 5,
 	}
 	node2 := &EcNode{
 		info: &master_pb.DataNodeInfo{
 			Id: "127.0.0.1:8081",
 			DiskInfos: map[string]*master_pb.DiskInfo{
 				diskTypeKey: {
 					Type:           diskTypeKey,
 					MaxVolumeCount: 10,
 					EcShardInfos:   []*master_pb.VolumeEcShardInformationMessage{},
 				},
 			},
 		},
 		dc:         "dc2",
 		rack:       "rack2",
 		freeEcSlot: 10,
 	}
 	locations := []*EcNode{node1, node2}
 	volumeId := needle.VolumeId(1)
 	analysis := AnalyzeVolumeDistribution(volumeId, locations, diskType)
 	shardsByDC := analysis.GetShardsByDC()
 	if shardsByDC["dc1"] != 14 {
 		t.Errorf("Expected 14 shards in dc1, got %d", shardsByDC["dc1"])
 	}
 	t.Log(analysis.DetailedString())
 }
 func TestProportionalRebalancerShell(t *testing.T) {
 	diskType := types.HardDriveType
 	diskTypeKey := string(diskType)
 	// Build topology: 2 DCs, 2 racks each, all shards on one node
 	nodes := []*EcNode{
 		{
 			info: &master_pb.DataNodeInfo{
 				Id: "dc1-rack1-node1",
 				DiskInfos: map[string]*master_pb.DiskInfo{
 					diskTypeKey: {
 						Type:           diskTypeKey,
 						MaxVolumeCount: 10,
 						EcShardInfos: []*master_pb.VolumeEcShardInformationMessage{
 							{Id: 1, Collection: "test", EcIndexBits: 0x3FFF},
 						},
 					},
 				},
 			},
 			dc: "dc1", rack: "dc1-rack1", freeEcSlot: 0,
 		},
 		{
 			info: &master_pb.DataNodeInfo{
 				Id: "dc1-rack2-node1",
 				DiskInfos: map[string]*master_pb.DiskInfo{
 					diskTypeKey: {Type: diskTypeKey, MaxVolumeCount: 10},
 				},
 			},
 			dc: "dc1", rack: "dc1-rack2", freeEcSlot: 10,
 		},
 		{
 			info: &master_pb.DataNodeInfo{
 				Id: "dc2-rack1-node1",
 				DiskInfos: map[string]*master_pb.DiskInfo{
 					diskTypeKey: {Type: diskTypeKey, MaxVolumeCount: 10},
 				},
 			},
 			dc: "dc2", rack: "dc2-rack1", freeEcSlot: 10,
 		},
 		{
 			info: &master_pb.DataNodeInfo{
 				Id: "dc2-rack2-node1",
 				DiskInfos: map[string]*master_pb.DiskInfo{
 					diskTypeKey: {Type: diskTypeKey, MaxVolumeCount: 10},
 				},
 			},
 			dc: "dc2", rack: "dc2-rack2", freeEcSlot: 10,
 		},
 	}
 	rp, _ := super_block.NewReplicaPlacementFromString("110")
 	rebalancer := NewProportionalECRebalancer(nodes, rp, diskType)
 	volumeId := needle.VolumeId(1)
 	moves, err := rebalancer.PlanMoves(volumeId, []*EcNode{nodes[0]})
 	if err != nil {
 		t.Fatalf("PlanMoves failed: %v", err)
 	}
 	t.Logf("Planned %d moves", len(moves))
 	for i, move := range moves {
 		t.Logf("  %d. %s", i+1, move.String())
 	}
 	// Verify moves to dc2
 	movedToDC2 := 0
 	for _, move := range moves {
 		if move.DestNode.dc == "dc2" {
 			movedToDC2++
 		}
 	}
 	if movedToDC2 == 0 {
 		t.Error("Expected some moves to dc2")
 	}
 }
 func TestCustomECConfigRebalancer(t *testing.T) {
 	diskType := types.HardDriveType
 	diskTypeKey := string(diskType)
 	// Test with custom 8+4 EC configuration
 	ecConfig, err := distribution.NewECConfig(8, 4)
 	if err != nil {
 		t.Fatalf("Failed to create EC config: %v", err)
 	}
 	// Build topology for 12 shards (8+4)
 	nodes := []*EcNode{
 		{
 			info: &master_pb.DataNodeInfo{
 				Id: "dc1-node1",
 				DiskInfos: map[string]*master_pb.DiskInfo{
 					diskTypeKey: {
 						Type:           diskTypeKey,
 						MaxVolumeCount: 10,
 						EcShardInfos: []*master_pb.VolumeEcShardInformationMessage{
 							{Id: 1, Collection: "test", EcIndexBits: 0x0FFF}, // 12 shards (bits 0-11)
 						},
 					},
 				},
 			},
 			dc: "dc1", rack: "dc1-rack1", freeEcSlot: 0,
 		},
 		{
 			info: &master_pb.DataNodeInfo{
 				Id: "dc2-node1",
 				DiskInfos: map[string]*master_pb.DiskInfo{
 					diskTypeKey: {Type: diskTypeKey, MaxVolumeCount: 10},
 				},
 			},
 			dc: "dc2", rack: "dc2-rack1", freeEcSlot: 10,
 		},
 		{
 			info: &master_pb.DataNodeInfo{
 				Id: "dc3-node1",
 				DiskInfos: map[string]*master_pb.DiskInfo{
 					diskTypeKey: {Type: diskTypeKey, MaxVolumeCount: 10},
 				},
 			},
 			dc: "dc3", rack: "dc3-rack1", freeEcSlot: 10,
 		},
 	}
 	rp, _ := super_block.NewReplicaPlacementFromString("200") // 3 DCs
 	rebalancer := NewProportionalECRebalancerWithConfig(nodes, rp, diskType, ecConfig)
 	volumeId := needle.VolumeId(1)
 	moves, err := rebalancer.PlanMoves(volumeId, []*EcNode{nodes[0]})
 	if err != nil {
 		t.Fatalf("PlanMoves failed: %v", err)
 	}
 	t.Logf("Custom 8+4 EC with 200 replication: planned %d moves", len(moves))
 	// Get the distribution summary
 	summary := GetDistributionSummaryWithConfig(rp, ecConfig)
 	t.Log(summary)
 	analysis := GetFaultToleranceAnalysisWithConfig(rp, ecConfig)
 	t.Log(analysis)
 }
 func TestGetDistributionSummaryShell(t *testing.T) {
 	rp, _ := super_block.NewReplicaPlacementFromString("110")
 	summary := GetDistributionSummary(rp)
 	t.Log(summary)
 	if len(summary) == 0 {
 		t.Error("Summary should not be empty")
 	}
 	analysis := GetFaultToleranceAnalysis(rp)
 	t.Log(analysis)
 	if len(analysis) == 0 {
 		t.Error("Analysis should not be empty")
 	}
 }
--- a/weed/shell/ec_rebalance_slots_test.go
+++ b/weed/shell/ec_rebalance_slots_test.go
@@ -0,0 +1,293 @@
 package shell
 import (
 	"testing"
 	"github.com/seaweedfs/seaweedfs/weed/pb/master_pb"
 	"github.com/seaweedfs/seaweedfs/weed/storage/erasure_coding"
 	"github.com/seaweedfs/seaweedfs/weed/storage/types"
 )
 // TestECRebalanceWithLimitedSlots tests that EC rebalance handles the scenario
 // where there are limited free slots on volume servers.
 //
 // This is a regression test for the error:
 //
 //	"no free ec shard slots. only 0 left"
 //
 // Scenario (from real usage):
 // - 6 volume servers in 6 racks
 // - Each server has max=10 volume slots
 // - 7 volumes were EC encoded (7 × 14 = 98 EC shards)
 // - All 14 shards per volume are on the original server (not yet distributed)
 //
 // Expected behavior:
 // - The rebalance algorithm should distribute shards across servers
 // - Even if perfect distribution isn't possible, it should do best-effort
 // - Currently fails with "no free ec shard slots" because freeSlots calculation
 //
 //	doesn't account for shards being moved (freed slots on source, used on target)
 func TestECRebalanceWithLimitedSlots(t *testing.T) {
 	// Build a topology matching the problematic scenario:
 	// 6 servers, each with 2+ volumes worth of EC shards (all 14 shards per volume on same server)
 	topology := buildLimitedSlotsTopology()
 	// Collect EC nodes from the topology
 	ecNodes, totalFreeEcSlots := collectEcVolumeServersByDc(topology, "", types.HardDriveType)
 	t.Logf("Topology summary:")
 	t.Logf("  Number of EC nodes: %d", len(ecNodes))
 	t.Logf("  Total free EC slots: %d", totalFreeEcSlots)
 	// Log per-node details
 	for _, node := range ecNodes {
 		shardCount := 0
 		for _, diskInfo := range node.info.DiskInfos {
 			for _, ecShard := range diskInfo.EcShardInfos {
 				shardCount += erasure_coding.ShardBits(ecShard.EcIndexBits).ShardIdCount()
 			}
 		}
 		t.Logf("  Node %s (rack %s): %d shards, %d free slots",
 			node.info.Id, node.rack, shardCount, node.freeEcSlot)
 	}
 	// Calculate total EC shards
 	totalEcShards := 0
 	for _, node := range ecNodes {
 		for _, diskInfo := range node.info.DiskInfos {
 			for _, ecShard := range diskInfo.EcShardInfos {
 				totalEcShards += erasure_coding.ShardBits(ecShard.EcIndexBits).ShardIdCount()
 			}
 		}
 	}
 	t.Logf("  Total EC shards: %d", totalEcShards)
 	// Document the issue:
 	// With 98 EC shards (7 volumes × 14 shards) on 6 servers with max=10 each,
 	// total capacity is 60 slots. But shards already occupy slots on their current servers.
 	//
 	// The current algorithm calculates free slots as:
 	//   freeSlots = maxVolumeCount - volumeCount - ecShardCount
 	//
 	// If all shards are on their original servers:
 	// - Server A has 28 shards (2 volumes × 14) → may have negative free slots
 	// - This causes totalFreeEcSlots to be 0 or negative
 	//
 	// The EXPECTED improvement:
 	// - Rebalance should recognize that moving a shard FREES a slot on the source
 	// - The algorithm should work iteratively, moving shards one at a time
 	// - Even if starting with 0 free slots, moving one shard opens a slot
 	if totalFreeEcSlots < 1 {
 		// This is the current (buggy) behavior we're documenting
 		t.Logf("")
 		t.Logf("KNOWN ISSUE: totalFreeEcSlots = %d (< 1)", totalFreeEcSlots)
 		t.Logf("")
 		t.Logf("This triggers the error: 'no free ec shard slots. only %d left'", totalFreeEcSlots)
 		t.Logf("")
 		t.Logf("Analysis:")
 		t.Logf("  - %d EC shards across %d servers", totalEcShards, len(ecNodes))
 		t.Logf("  - Shards are concentrated on original servers (not distributed)")
 		t.Logf("  - Current slot calculation doesn't account for slots freed by moving shards")
 		t.Logf("")
 		t.Logf("Expected fix:")
 		t.Logf("  1. Rebalance should work iteratively, moving one shard at a time")
 		t.Logf("  2. Moving a shard from A to B: frees 1 slot on A, uses 1 slot on B")
 		t.Logf("  3. The 'free slots' check should be per-move, not global")
 		t.Logf("  4. Or: calculate 'redistributable slots' = total capacity - shards that must stay")
 		// For now, document this is a known issue - don't fail the test
 		// When the fix is implemented, this test should be updated to verify the fix works
 		return
 	}
 	// If we get here, the issue might have been fixed
 	t.Logf("totalFreeEcSlots = %d, rebalance should be possible", totalFreeEcSlots)
 }
 // TestECRebalanceZeroFreeSlots tests the specific scenario where
 // the topology appears to have free slots but rebalance fails.
 //
 // This can happen when the VolumeCount in the topology includes the original
 // volumes that were EC-encoded, making the free slot calculation incorrect.
 func TestECRebalanceZeroFreeSlots(t *testing.T) {
 	// Build a topology where volumes were NOT deleted after EC encoding
 	// (VolumeCount still reflects the original volumes)
 	topology := buildZeroFreeSlotTopology()
 	ecNodes, totalFreeEcSlots := collectEcVolumeServersByDc(topology, "", types.HardDriveType)
 	t.Logf("Zero free slots scenario:")
 	for _, node := range ecNodes {
 		shardCount := 0
 		for _, diskInfo := range node.info.DiskInfos {
 			for _, ecShard := range diskInfo.EcShardInfos {
 				shardCount += erasure_coding.ShardBits(ecShard.EcIndexBits).ShardIdCount()
 			}
 		}
 		t.Logf("  Node %s: %d shards, %d free slots, volumeCount=%d, max=%d",
 			node.info.Id, shardCount, node.freeEcSlot,
 			node.info.DiskInfos[string(types.HardDriveType)].VolumeCount,
 			node.info.DiskInfos[string(types.HardDriveType)].MaxVolumeCount)
 	}
 	t.Logf("  Total free slots: %d", totalFreeEcSlots)
 	if totalFreeEcSlots == 0 {
 		t.Logf("")
 		t.Logf("SCENARIO REPRODUCED: totalFreeEcSlots = 0")
 		t.Logf("This would trigger: 'no free ec shard slots. only 0 left'")
 	}
 }
 // buildZeroFreeSlotTopology creates a topology where rebalance will fail
 // because servers are at capacity (volumeCount equals maxVolumeCount)
 func buildZeroFreeSlotTopology() *master_pb.TopologyInfo {
 	diskTypeKey := string(types.HardDriveType)
 	// Each server has max=10, volumeCount=10 (full capacity)
 	// Free capacity = (10-10) * 10 = 0 per server
 	// This will trigger "no free ec shard slots" error
 	return &master_pb.TopologyInfo{
 		Id: "test_zero_free_slots",
 		DataCenterInfos: []*master_pb.DataCenterInfo{
 			{
 				Id: "dc1",
 				RackInfos: []*master_pb.RackInfo{
 					{
 						Id: "rack0",
 						DataNodeInfos: []*master_pb.DataNodeInfo{
 							{
 								Id: "127.0.0.1:8080",
 								DiskInfos: map[string]*master_pb.DiskInfo{
 									diskTypeKey: {
 										Type:           diskTypeKey,
 										MaxVolumeCount: 10,
 										VolumeCount:    10, // At full capacity
 										EcShardInfos:   buildEcShards([]uint32{3, 4}),
 									},
 								},
 							},
 						},
 					},
 					{
 						Id: "rack1",
 						DataNodeInfos: []*master_pb.DataNodeInfo{
 							{
 								Id: "127.0.0.1:8081",
 								DiskInfos: map[string]*master_pb.DiskInfo{
 									diskTypeKey: {
 										Type:           diskTypeKey,
 										MaxVolumeCount: 10,
 										VolumeCount:    10,
 										EcShardInfos:   buildEcShards([]uint32{1, 7}),
 									},
 								},
 							},
 						},
 					},
 					{
 						Id: "rack2",
 						DataNodeInfos: []*master_pb.DataNodeInfo{
 							{
 								Id: "127.0.0.1:8082",
 								DiskInfos: map[string]*master_pb.DiskInfo{
 									diskTypeKey: {
 										Type:           diskTypeKey,
 										MaxVolumeCount: 10,
 										VolumeCount:    10,
 										EcShardInfos:   buildEcShards([]uint32{2}),
 									},
 								},
 							},
 						},
 					},
 					{
 						Id: "rack3",
 						DataNodeInfos: []*master_pb.DataNodeInfo{
 							{
 								Id: "127.0.0.1:8083",
 								DiskInfos: map[string]*master_pb.DiskInfo{
 									diskTypeKey: {
 										Type:           diskTypeKey,
 										MaxVolumeCount: 10,
 										VolumeCount:    10,
 										EcShardInfos:   buildEcShards([]uint32{5, 6}),
 									},
 								},
 							},
 						},
 					},
 				},
 			},
 		},
 	}
 }
 func buildEcShards(volumeIds []uint32) []*master_pb.VolumeEcShardInformationMessage {
 	var shards []*master_pb.VolumeEcShardInformationMessage
 	for _, vid := range volumeIds {
 		allShardBits := erasure_coding.ShardBits(0)
 		for i := 0; i < erasure_coding.TotalShardsCount; i++ {
 			allShardBits = allShardBits.AddShardId(erasure_coding.ShardId(i))
 		}
 		shards = append(shards, &master_pb.VolumeEcShardInformationMessage{
 			Id:          vid,
 			Collection:  "ectest",
 			EcIndexBits: uint32(allShardBits),
 		})
 	}
 	return shards
 }
 // buildLimitedSlotsTopology creates a topology matching the problematic scenario:
 // - 6 servers in 6 racks
 // - Each server has max=10 volume slots
 // - 7 volumes were EC encoded, shards distributed as follows:
 //   - rack0 (8080): volumes 3,4 → 28 shards
 //   - rack1 (8081): volumes 1,7 → 28 shards
 //   - rack2 (8082): volume 2 → 14 shards
 //   - rack3 (8083): volumes 5,6 → 28 shards
 //   - rack4 (8084): (no volumes originally)
 //   - rack5 (8085): (no volumes originally)
 func buildLimitedSlotsTopology() *master_pb.TopologyInfo {
 	return &master_pb.TopologyInfo{
 		Id: "test_limited_slots",
 		DataCenterInfos: []*master_pb.DataCenterInfo{
 			{
 				Id: "dc1",
 				RackInfos: []*master_pb.RackInfo{
 					buildRackWithEcShards("rack0", "127.0.0.1:8080", 10, []uint32{3, 4}),
 					buildRackWithEcShards("rack1", "127.0.0.1:8081", 10, []uint32{1, 7}),
 					buildRackWithEcShards("rack2", "127.0.0.1:8082", 10, []uint32{2}),
 					buildRackWithEcShards("rack3", "127.0.0.1:8083", 10, []uint32{5, 6}),
 					buildRackWithEcShards("rack4", "127.0.0.1:8084", 10, []uint32{}),
 					buildRackWithEcShards("rack5", "127.0.0.1:8085", 10, []uint32{}),
 				},
 			},
 		},
 	}
 }
 // buildRackWithEcShards creates a rack with one data node containing EC shards
 // for the specified volume IDs (all 14 shards per volume)
 func buildRackWithEcShards(rackId, nodeId string, maxVolumes int64, volumeIds []uint32) *master_pb.RackInfo {
 	// Note: types.HardDriveType is "" (empty string), so we use "" as the key
 	diskTypeKey := string(types.HardDriveType)
 	return &master_pb.RackInfo{
 		Id: rackId,
 		DataNodeInfos: []*master_pb.DataNodeInfo{
 			{
 				Id: nodeId,
 				DiskInfos: map[string]*master_pb.DiskInfo{
 					diskTypeKey: {
 						Type:           diskTypeKey,
 						MaxVolumeCount: maxVolumes,
 						VolumeCount:    int64(len(volumeIds)), // Original volumes still counted
 						EcShardInfos:   buildEcShards(volumeIds),
 					},
 				},
 			},
 		},
 	}
 }
--- a/weed/storage/erasure_coding/distribution/README.md
+++ b/weed/storage/erasure_coding/distribution/README.md
@@ -0,0 +1,209 @@
 # EC Distribution Package
 This package provides erasure coding (EC) shard distribution algorithms that are:
 - **Configurable**: Works with any EC ratio (e.g., 10+4, 8+4, 6+3)
 - **Reusable**: Used by shell commands, worker tasks, and seaweed-enterprise
 - **Topology-aware**: Distributes shards across data centers, racks, and nodes proportionally
 ## Usage
 ### Basic Usage with Default 10+4 EC
 ```go
 import (
    "github.com/seaweedfs/seaweedfs/weed/storage/erasure_coding/distribution"
 )
 // Parse replication policy
 rep, _ := distribution.NewReplicationConfigFromString("110")
 // Use default 10+4 EC configuration
 ec := distribution.DefaultECConfig()
 // Calculate distribution plan
 dist := distribution.CalculateDistribution(ec, rep)
 fmt.Println(dist.Summary())
 // Output:
 // EC Configuration: 10+4 (total: 14, can lose: 4)
 // Replication: replication=110 (DCs:2, Racks/DC:2, Nodes/Rack:1)
 // Distribution Plan:
 //   Data Centers: 2 (target 7 shards each, max 9)
 //   Racks per DC: 2 (target 4 shards each, max 6)
 //   Nodes per Rack: 1 (target 4 shards each, max 6)
 ```
 ### Custom EC Ratios (seaweed-enterprise)
 ```go
 // Create custom 8+4 EC configuration
 ec, err := distribution.NewECConfig(8, 4)
 if err != nil {
    log.Fatal(err)
 }
 rep, _ := distribution.NewReplicationConfigFromString("200")
 dist := distribution.CalculateDistribution(ec, rep)
 // Check fault tolerance
 fmt.Println(dist.FaultToleranceAnalysis())
 // Output:
 // Fault Tolerance Analysis for 8+4:
 //   DC Failure: SURVIVABLE ✓
 //     - Losing one DC loses ~4 shards
 //     - Remaining: 8 shards (need 8)
 ```
 ### Planning Shard Moves
 ```go
 // Build topology analysis
 analysis := distribution.NewTopologyAnalysis()
 // Add nodes and their shard locations
 for _, node := range nodes {
    analysis.AddNode(&distribution.TopologyNode{
        NodeID:     node.ID,
        DataCenter: node.DC,
        Rack:       node.Rack,
        FreeSlots:  node.FreeSlots,
    })
    for _, shardID := range node.ShardIDs {
        analysis.AddShardLocation(distribution.ShardLocation{
            ShardID:    shardID,
            NodeID:     node.ID,
            DataCenter: node.DC,
            Rack:       node.Rack,
        })
    }
 }
 analysis.Finalize()
 // Create rebalancer and plan moves
 rebalancer := distribution.NewRebalancer(ec, rep)
 plan, err := rebalancer.PlanRebalance(analysis)
 for _, move := range plan.Moves {
    fmt.Printf("Move shard %d from %s to %s\n", 
        move.ShardID, move.SourceNode.NodeID, move.DestNode.NodeID)
 }
 ```
 ## Algorithm
 ### Proportional Distribution
 The replication policy `XYZ` is interpreted as a ratio:
 | Replication | DCs | Racks/DC | Nodes/Rack | 14 Shards Distribution |
 |-------------|-----|----------|------------|------------------------|
 | `000` | 1 | 1 | 1 | All in one place |
 | `001` | 1 | 1 | 2 | 7 per node |
 | `010` | 1 | 2 | 1 | 7 per rack |
 | `100` | 2 | 1 | 1 | 7 per DC |
 | `110` | 2 | 2 | 1 | 7/DC, 4/rack |
 | `200` | 3 | 1 | 1 | 5 per DC |
 ### Rebalancing Process
 1. **DC-level balancing**: Move shards to achieve target shards per DC
 2. **Rack-level balancing**: Within each DC, balance across racks
 3. **Node-level balancing**: Within each rack, balance across nodes
 ### Shard Priority: Data First, Parity Moves First
 When rebalancing, the algorithm prioritizes keeping data shards spread out:
 - **Data shards (0 to DataShards-1)**: Serve read requests directly
 - **Parity shards (DataShards to TotalShards-1)**: Only used for reconstruction
 **Rebalancing Strategy**:
 - When moving shards FROM an overloaded node, **parity shards are moved first**
 - This keeps data shards in place on well-distributed nodes
 - Result: Data shards remain spread out for optimal read performance
 ```go
 // Check shard type
 if ec.IsDataShard(shardID) {
    // Shard serves read requests
 }
 if ec.IsParityShard(shardID) {
    // Shard only used for reconstruction
 }
 // Sort shards for placement (data first for initial distribution)
 sorted := ec.SortShardsDataFirst(shards)
 // Sort shards for rebalancing (parity first to move them away)
 sorted := ec.SortShardsParityFirst(shards)
 ```
 ### Fault Tolerance
 The package provides fault tolerance analysis:
 - **DC Failure**: Can the data survive complete DC loss?
 - **Rack Failure**: Can the data survive complete rack loss?
 - **Node Failure**: Can the data survive single node loss?
 For example, with 10+4 EC (can lose 4 shards):
 - Need 4+ DCs for DC-level fault tolerance
 - Need 4+ racks for rack-level fault tolerance
 - Usually survivable at node level
 ## API Reference
 ### Types
 - `ECConfig`: EC configuration (data shards, parity shards)
 - `ReplicationConfig`: Parsed replication policy
 - `ECDistribution`: Calculated distribution plan
 - `TopologyAnalysis`: Current shard distribution analysis
 - `Rebalancer`: Plans shard moves
 - `RebalancePlan`: List of planned moves
 - `ShardMove`: Single shard move operation
 ### Key Functions
 - `NewECConfig(data, parity int)`: Create EC configuration
 - `DefaultECConfig()`: Returns 10+4 configuration
 - `CalculateDistribution(ec, rep)`: Calculate distribution plan
 - `NewRebalancer(ec, rep)`: Create rebalancer
 - `PlanRebalance(analysis)`: Generate rebalancing plan
 ## Integration
 ### Shell Commands
 The shell package wraps this distribution package for `ec.balance`:
 ```go
 import "github.com/seaweedfs/seaweedfs/weed/shell"
 rebalancer := shell.NewProportionalECRebalancer(nodes, rp, diskType)
 moves, _ := rebalancer.PlanMoves(volumeId, locations)
 ```
 ### Worker Tasks
 Worker tasks can use the distribution package directly:
 ```go
 import "github.com/seaweedfs/seaweedfs/weed/storage/erasure_coding/distribution"
 ec := distribution.ECConfig{DataShards: 8, ParityShards: 4}
 rep := distribution.NewReplicationConfig(rp)
 dist := distribution.CalculateDistribution(ec, rep)
 ```
 ### seaweed-enterprise
 Enterprise features can provide custom EC configurations:
 ```go
 // Custom EC ratio from license/config
 ec, _ := distribution.NewECConfig(customData, customParity)
 rebalancer := distribution.NewRebalancer(ec, rep)
 ```
--- a/weed/storage/erasure_coding/distribution/analysis.go
+++ b/weed/storage/erasure_coding/distribution/analysis.go
@@ -0,0 +1,241 @@
 package distribution
 import (
 	"fmt"
 	"slices"
 )
 // ShardLocation represents where a shard is located in the topology
 type ShardLocation struct {
 	ShardID    int
 	NodeID     string
 	DataCenter string
 	Rack       string
 }
 // TopologyNode represents a node in the topology that can hold EC shards
 type TopologyNode struct {
 	NodeID     string
 	DataCenter string
 	Rack       string
 	FreeSlots  int      // Available slots for new shards
 	ShardIDs   []int    // Shard IDs currently on this node for a specific volume
 	TotalShards int     // Total shards on this node (for all volumes)
 }
 // TopologyAnalysis holds the current shard distribution analysis for a volume
 type TopologyAnalysis struct {
 	// Shard counts at each level
 	ShardsByDC   map[string]int
 	ShardsByRack map[string]int
 	ShardsByNode map[string]int
 	// Detailed shard locations
 	DCToShards   map[string][]int // DC -> list of shard IDs
 	RackToShards map[string][]int // Rack -> list of shard IDs
 	NodeToShards map[string][]int // NodeID -> list of shard IDs
 	// Topology structure
 	DCToRacks   map[string][]string            // DC -> list of rack IDs
 	RackToNodes map[string][]*TopologyNode     // Rack -> list of nodes
 	AllNodes    map[string]*TopologyNode       // NodeID -> node info
 	// Statistics
 	TotalShards int
 	TotalNodes  int
 	TotalRacks  int
 	TotalDCs    int
 }
 // NewTopologyAnalysis creates a new empty analysis
 func NewTopologyAnalysis() *TopologyAnalysis {
 	return &TopologyAnalysis{
 		ShardsByDC:   make(map[string]int),
 		ShardsByRack: make(map[string]int),
 		ShardsByNode: make(map[string]int),
 		DCToShards:   make(map[string][]int),
 		RackToShards: make(map[string][]int),
 		NodeToShards: make(map[string][]int),
 		DCToRacks:    make(map[string][]string),
 		RackToNodes:  make(map[string][]*TopologyNode),
 		AllNodes:     make(map[string]*TopologyNode),
 	}
 }
 // AddShardLocation adds a shard location to the analysis
 func (a *TopologyAnalysis) AddShardLocation(loc ShardLocation) {
 	// Update counts
 	a.ShardsByDC[loc.DataCenter]++
 	a.ShardsByRack[loc.Rack]++
 	a.ShardsByNode[loc.NodeID]++
 	// Update shard lists
 	a.DCToShards[loc.DataCenter] = append(a.DCToShards[loc.DataCenter], loc.ShardID)
 	a.RackToShards[loc.Rack] = append(a.RackToShards[loc.Rack], loc.ShardID)
 	a.NodeToShards[loc.NodeID] = append(a.NodeToShards[loc.NodeID], loc.ShardID)
 	a.TotalShards++
 }
 // AddNode adds a node to the topology (even if it has no shards)
 func (a *TopologyAnalysis) AddNode(node *TopologyNode) {
 	if _, exists := a.AllNodes[node.NodeID]; exists {
 		return // Already added
 	}
 	a.AllNodes[node.NodeID] = node
 	a.TotalNodes++
 	// Update topology structure
 	if !slices.Contains(a.DCToRacks[node.DataCenter], node.Rack) {
 		a.DCToRacks[node.DataCenter] = append(a.DCToRacks[node.DataCenter], node.Rack)
 	}
 	a.RackToNodes[node.Rack] = append(a.RackToNodes[node.Rack], node)
 	// Update counts
 	if _, exists := a.ShardsByDC[node.DataCenter]; !exists {
 		a.TotalDCs++
 	}
 	if _, exists := a.ShardsByRack[node.Rack]; !exists {
 		a.TotalRacks++
 	}
 }
 // Finalize computes final statistics after all data is added
 func (a *TopologyAnalysis) Finalize() {
 	// Ensure we have accurate DC and rack counts
 	dcSet := make(map[string]bool)
 	rackSet := make(map[string]bool)
 	for _, node := range a.AllNodes {
 		dcSet[node.DataCenter] = true
 		rackSet[node.Rack] = true
 	}
 	a.TotalDCs = len(dcSet)
 	a.TotalRacks = len(rackSet)
 	a.TotalNodes = len(a.AllNodes)
 }
 // String returns a summary of the analysis
 func (a *TopologyAnalysis) String() string {
 	return fmt.Sprintf("TopologyAnalysis{shards:%d, nodes:%d, racks:%d, dcs:%d}",
 		a.TotalShards, a.TotalNodes, a.TotalRacks, a.TotalDCs)
 }
 // DetailedString returns a detailed multi-line summary
 func (a *TopologyAnalysis) DetailedString() string {
 	s := fmt.Sprintf("Topology Analysis:\n")
 	s += fmt.Sprintf("  Total Shards: %d\n", a.TotalShards)
 	s += fmt.Sprintf("  Data Centers: %d\n", a.TotalDCs)
 	for dc, count := range a.ShardsByDC {
 		s += fmt.Sprintf("    %s: %d shards\n", dc, count)
 	}
 	s += fmt.Sprintf("  Racks: %d\n", a.TotalRacks)
 	for rack, count := range a.ShardsByRack {
 		s += fmt.Sprintf("    %s: %d shards\n", rack, count)
 	}
 	s += fmt.Sprintf("  Nodes: %d\n", a.TotalNodes)
 	for nodeID, count := range a.ShardsByNode {
 		if count > 0 {
 			s += fmt.Sprintf("    %s: %d shards\n", nodeID, count)
 		}
 	}
 	return s
 }
 // TopologyExcess represents a topology level (DC/rack/node) with excess shards
 type TopologyExcess struct {
 	ID       string           // DC/rack/node ID
 	Level    string           // "dc", "rack", or "node"
 	Excess   int              // Number of excess shards (above target)
 	Shards   []int            // Shard IDs at this level
 	Nodes    []*TopologyNode  // Nodes at this level (for finding sources)
 }
 // CalculateDCExcess returns DCs with more shards than the target
 func CalculateDCExcess(analysis *TopologyAnalysis, dist *ECDistribution) []TopologyExcess {
 	var excess []TopologyExcess
 	for dc, count := range analysis.ShardsByDC {
 		if count > dist.TargetShardsPerDC {
 			// Collect nodes in this DC
 			var nodes []*TopologyNode
 			for _, rack := range analysis.DCToRacks[dc] {
 				nodes = append(nodes, analysis.RackToNodes[rack]...)
 			}
 			excess = append(excess, TopologyExcess{
 				ID:     dc,
 				Level:  "dc",
 				Excess: count - dist.TargetShardsPerDC,
 				Shards: analysis.DCToShards[dc],
 				Nodes:  nodes,
 			})
 		}
 	}
 	// Sort by excess (most excess first)
 	slices.SortFunc(excess, func(a, b TopologyExcess) int {
 		return b.Excess - a.Excess
 	})
 	return excess
 }
 // CalculateRackExcess returns racks with more shards than the target (within a DC)
 func CalculateRackExcess(analysis *TopologyAnalysis, dc string, targetPerRack int) []TopologyExcess {
 	var excess []TopologyExcess
 	for _, rack := range analysis.DCToRacks[dc] {
 		count := analysis.ShardsByRack[rack]
 		if count > targetPerRack {
 			excess = append(excess, TopologyExcess{
 				ID:     rack,
 				Level:  "rack",
 				Excess: count - targetPerRack,
 				Shards: analysis.RackToShards[rack],
 				Nodes:  analysis.RackToNodes[rack],
 			})
 		}
 	}
 	slices.SortFunc(excess, func(a, b TopologyExcess) int {
 		return b.Excess - a.Excess
 	})
 	return excess
 }
 // CalculateUnderservedDCs returns DCs that have fewer shards than target
 func CalculateUnderservedDCs(analysis *TopologyAnalysis, dist *ECDistribution) []string {
 	var underserved []string
 	// Check existing DCs
 	for dc, count := range analysis.ShardsByDC {
 		if count < dist.TargetShardsPerDC {
 			underserved = append(underserved, dc)
 		}
 	}
 	// Check DCs with nodes but no shards
 	for dc := range analysis.DCToRacks {
 		if _, exists := analysis.ShardsByDC[dc]; !exists {
 			underserved = append(underserved, dc)
 		}
 	}
 	return underserved
 }
 // CalculateUnderservedRacks returns racks that have fewer shards than target
 func CalculateUnderservedRacks(analysis *TopologyAnalysis, dc string, targetPerRack int) []string {
 	var underserved []string
 	for _, rack := range analysis.DCToRacks[dc] {
 		count := analysis.ShardsByRack[rack]
 		if count < targetPerRack {
 			underserved = append(underserved, rack)
 		}
 	}
 	return underserved
 }
--- a/weed/storage/erasure_coding/distribution/config.go
+++ b/weed/storage/erasure_coding/distribution/config.go
@@ -0,0 +1,171 @@
 // Package distribution provides EC shard distribution algorithms with configurable EC ratios.
 package distribution
 import (
 	"fmt"
 	"github.com/seaweedfs/seaweedfs/weed/storage/super_block"
 )
 // ECConfig holds erasure coding configuration parameters.
 // This replaces hard-coded constants like DataShardsCount=10, ParityShardsCount=4.
 type ECConfig struct {
 	DataShards   int // Number of data shards (e.g., 10)
 	ParityShards int // Number of parity shards (e.g., 4)
 }
 // DefaultECConfig returns the standard 10+4 EC configuration
 func DefaultECConfig() ECConfig {
 	return ECConfig{
 		DataShards:   10,
 		ParityShards: 4,
 	}
 }
 // NewECConfig creates a new EC configuration with validation
 func NewECConfig(dataShards, parityShards int) (ECConfig, error) {
 	if dataShards <= 0 {
 		return ECConfig{}, fmt.Errorf("dataShards must be positive, got %d", dataShards)
 	}
 	if parityShards <= 0 {
 		return ECConfig{}, fmt.Errorf("parityShards must be positive, got %d", parityShards)
 	}
 	if dataShards+parityShards > 32 {
 		return ECConfig{}, fmt.Errorf("total shards (%d+%d=%d) exceeds maximum of 32",
 			dataShards, parityShards, dataShards+parityShards)
 	}
 	return ECConfig{
 		DataShards:   dataShards,
 		ParityShards: parityShards,
 	}, nil
 }
 // TotalShards returns the total number of shards (data + parity)
 func (c ECConfig) TotalShards() int {
 	return c.DataShards + c.ParityShards
 }
 // MaxTolerableLoss returns the maximum number of shards that can be lost
 // while still being able to reconstruct the data
 func (c ECConfig) MaxTolerableLoss() int {
 	return c.ParityShards
 }
 // MinShardsForReconstruction returns the minimum number of shards needed
 // to reconstruct the original data
 func (c ECConfig) MinShardsForReconstruction() int {
 	return c.DataShards
 }
 // String returns a human-readable representation
 func (c ECConfig) String() string {
 	return fmt.Sprintf("%d+%d (total: %d, can lose: %d)",
 		c.DataShards, c.ParityShards, c.TotalShards(), c.MaxTolerableLoss())
 }
 // IsDataShard returns true if the shard ID is a data shard (0 to DataShards-1)
 func (c ECConfig) IsDataShard(shardID int) bool {
 	return shardID >= 0 && shardID < c.DataShards
 }
 // IsParityShard returns true if the shard ID is a parity shard (DataShards to TotalShards-1)
 func (c ECConfig) IsParityShard(shardID int) bool {
 	return shardID >= c.DataShards && shardID < c.TotalShards()
 }
 // SortShardsDataFirst returns a copy of shards sorted with data shards first.
 // This is useful for initial placement where data shards should be spread out first.
 func (c ECConfig) SortShardsDataFirst(shards []int) []int {
 	result := make([]int, len(shards))
 	copy(result, shards)
 	// Partition: data shards first, then parity shards
 	dataIdx := 0
 	parityIdx := len(result) - 1
 	sorted := make([]int, len(result))
 	for _, s := range result {
 		if c.IsDataShard(s) {
 			sorted[dataIdx] = s
 			dataIdx++
 		} else {
 			sorted[parityIdx] = s
 			parityIdx--
 		}
 	}
 	return sorted
 }
 // SortShardsParityFirst returns a copy of shards sorted with parity shards first.
 // This is useful for rebalancing where we prefer to move parity shards.
 func (c ECConfig) SortShardsParityFirst(shards []int) []int {
 	result := make([]int, len(shards))
 	copy(result, shards)
 	// Partition: parity shards first, then data shards
 	parityIdx := 0
 	dataIdx := len(result) - 1
 	sorted := make([]int, len(result))
 	for _, s := range result {
 		if c.IsParityShard(s) {
 			sorted[parityIdx] = s
 			parityIdx++
 		} else {
 			sorted[dataIdx] = s
 			dataIdx--
 		}
 	}
 	return sorted
 }
 // ReplicationConfig holds the parsed replication policy
 type ReplicationConfig struct {
 	MinDataCenters  int // X+1 from XYZ replication (minimum DCs to use)
 	MinRacksPerDC   int // Y+1 from XYZ replication (minimum racks per DC)
 	MinNodesPerRack int // Z+1 from XYZ replication (minimum nodes per rack)
 	// Original replication string (for logging/debugging)
 	Original string
 }
 // NewReplicationConfig creates a ReplicationConfig from a ReplicaPlacement
 func NewReplicationConfig(rp *super_block.ReplicaPlacement) ReplicationConfig {
 	if rp == nil {
 		return ReplicationConfig{
 			MinDataCenters:  1,
 			MinRacksPerDC:   1,
 			MinNodesPerRack: 1,
 			Original:        "000",
 		}
 	}
 	return ReplicationConfig{
 		MinDataCenters:  rp.DiffDataCenterCount + 1,
 		MinRacksPerDC:   rp.DiffRackCount + 1,
 		MinNodesPerRack: rp.SameRackCount + 1,
 		Original:        rp.String(),
 	}
 }
 // NewReplicationConfigFromString creates a ReplicationConfig from a replication string
 func NewReplicationConfigFromString(replication string) (ReplicationConfig, error) {
 	rp, err := super_block.NewReplicaPlacementFromString(replication)
 	if err != nil {
 		return ReplicationConfig{}, err
 	}
 	return NewReplicationConfig(rp), nil
 }
 // TotalPlacementSlots returns the minimum number of unique placement locations
 // based on the replication policy
 func (r ReplicationConfig) TotalPlacementSlots() int {
 	return r.MinDataCenters * r.MinRacksPerDC * r.MinNodesPerRack
 }
 // String returns a human-readable representation
 func (r ReplicationConfig) String() string {
 	return fmt.Sprintf("replication=%s (DCs:%d, Racks/DC:%d, Nodes/Rack:%d)",
 		r.Original, r.MinDataCenters, r.MinRacksPerDC, r.MinNodesPerRack)
 }
--- a/weed/storage/erasure_coding/distribution/distribution.go
+++ b/weed/storage/erasure_coding/distribution/distribution.go
@@ -0,0 +1,161 @@
 package distribution
 import (
 	"fmt"
 )
 // ECDistribution represents the target distribution of EC shards
 // based on EC configuration and replication policy.
 type ECDistribution struct {
 	// EC configuration
 	ECConfig ECConfig
 	// Replication configuration
 	ReplicationConfig ReplicationConfig
 	// Target shard counts per topology level (balanced distribution)
 	TargetShardsPerDC   int
 	TargetShardsPerRack int
 	TargetShardsPerNode int
 	// Maximum shard counts per topology level (fault tolerance limits)
 	// These prevent any single failure domain from having too many shards
 	MaxShardsPerDC   int
 	MaxShardsPerRack int
 	MaxShardsPerNode int
 }
 // CalculateDistribution computes the target EC shard distribution based on
 // EC configuration and replication policy.
 //
 // The algorithm:
 // 1. Uses replication policy to determine minimum topology spread
 // 2. Calculates target shards per level (evenly distributed)
 // 3. Calculates max shards per level (for fault tolerance)
 func CalculateDistribution(ec ECConfig, rep ReplicationConfig) *ECDistribution {
 	totalShards := ec.TotalShards()
 	// Target distribution (balanced, rounded up to ensure all shards placed)
 	targetShardsPerDC := ceilDivide(totalShards, rep.MinDataCenters)
 	targetShardsPerRack := ceilDivide(targetShardsPerDC, rep.MinRacksPerDC)
 	targetShardsPerNode := ceilDivide(targetShardsPerRack, rep.MinNodesPerRack)
 	// Maximum limits for fault tolerance
 	// The key constraint: losing one failure domain shouldn't lose more than parityShards
 	// So max shards per domain = totalShards - parityShards + tolerance
 	// We add small tolerance (+2) to allow for imbalanced topologies
 	faultToleranceLimit := totalShards - ec.ParityShards + 1
 	maxShardsPerDC := min(faultToleranceLimit, targetShardsPerDC+2)
 	maxShardsPerRack := min(faultToleranceLimit, targetShardsPerRack+2)
 	maxShardsPerNode := min(faultToleranceLimit, targetShardsPerNode+2)
 	return &ECDistribution{
 		ECConfig:            ec,
 		ReplicationConfig:   rep,
 		TargetShardsPerDC:   targetShardsPerDC,
 		TargetShardsPerRack: targetShardsPerRack,
 		TargetShardsPerNode: targetShardsPerNode,
 		MaxShardsPerDC:      maxShardsPerDC,
 		MaxShardsPerRack:    maxShardsPerRack,
 		MaxShardsPerNode:    maxShardsPerNode,
 	}
 }
 // String returns a human-readable description of the distribution
 func (d *ECDistribution) String() string {
 	return fmt.Sprintf(
 		"ECDistribution{EC:%s, DCs:%d (target:%d/max:%d), Racks/DC:%d (target:%d/max:%d), Nodes/Rack:%d (target:%d/max:%d)}",
 		d.ECConfig.String(),
 		d.ReplicationConfig.MinDataCenters, d.TargetShardsPerDC, d.MaxShardsPerDC,
 		d.ReplicationConfig.MinRacksPerDC, d.TargetShardsPerRack, d.MaxShardsPerRack,
 		d.ReplicationConfig.MinNodesPerRack, d.TargetShardsPerNode, d.MaxShardsPerNode,
 	)
 }
 // Summary returns a multi-line summary of the distribution plan
 func (d *ECDistribution) Summary() string {
 	summary := fmt.Sprintf("EC Configuration: %s\n", d.ECConfig.String())
 	summary += fmt.Sprintf("Replication: %s\n", d.ReplicationConfig.String())
 	summary += fmt.Sprintf("Distribution Plan:\n")
 	summary += fmt.Sprintf("  Data Centers: %d (target %d shards each, max %d)\n",
 		d.ReplicationConfig.MinDataCenters, d.TargetShardsPerDC, d.MaxShardsPerDC)
 	summary += fmt.Sprintf("  Racks per DC: %d (target %d shards each, max %d)\n",
 		d.ReplicationConfig.MinRacksPerDC, d.TargetShardsPerRack, d.MaxShardsPerRack)
 	summary += fmt.Sprintf("  Nodes per Rack: %d (target %d shards each, max %d)\n",
 		d.ReplicationConfig.MinNodesPerRack, d.TargetShardsPerNode, d.MaxShardsPerNode)
 	return summary
 }
 // CanSurviveDCFailure returns true if the distribution can survive
 // complete loss of one data center
 func (d *ECDistribution) CanSurviveDCFailure() bool {
 	// After losing one DC with max shards, check if remaining shards are enough
 	remainingAfterDCLoss := d.ECConfig.TotalShards() - d.TargetShardsPerDC
 	return remainingAfterDCLoss >= d.ECConfig.MinShardsForReconstruction()
 }
 // CanSurviveRackFailure returns true if the distribution can survive
 // complete loss of one rack
 func (d *ECDistribution) CanSurviveRackFailure() bool {
 	remainingAfterRackLoss := d.ECConfig.TotalShards() - d.TargetShardsPerRack
 	return remainingAfterRackLoss >= d.ECConfig.MinShardsForReconstruction()
 }
 // MinDCsForDCFaultTolerance calculates the minimum number of DCs needed
 // to survive complete DC failure with this EC configuration
 func (d *ECDistribution) MinDCsForDCFaultTolerance() int {
 	// To survive DC failure, max shards per DC = parityShards
 	maxShardsPerDC := d.ECConfig.MaxTolerableLoss()
 	if maxShardsPerDC == 0 {
 		return d.ECConfig.TotalShards() // Would need one DC per shard
 	}
 	return ceilDivide(d.ECConfig.TotalShards(), maxShardsPerDC)
 }
 // FaultToleranceAnalysis returns a detailed analysis of fault tolerance
 func (d *ECDistribution) FaultToleranceAnalysis() string {
 	analysis := fmt.Sprintf("Fault Tolerance Analysis for %s:\n", d.ECConfig.String())
 	// DC failure
 	dcSurvive := d.CanSurviveDCFailure()
 	shardsAfterDC := d.ECConfig.TotalShards() - d.TargetShardsPerDC
 	analysis += fmt.Sprintf("  DC Failure: %s\n", boolToResult(dcSurvive))
 	analysis += fmt.Sprintf("    - Losing one DC loses ~%d shards\n", d.TargetShardsPerDC)
 	analysis += fmt.Sprintf("    - Remaining: %d shards (need %d)\n", shardsAfterDC, d.ECConfig.DataShards)
 	if !dcSurvive {
 		analysis += fmt.Sprintf("    - Need at least %d DCs for DC fault tolerance\n", d.MinDCsForDCFaultTolerance())
 	}
 	// Rack failure
 	rackSurvive := d.CanSurviveRackFailure()
 	shardsAfterRack := d.ECConfig.TotalShards() - d.TargetShardsPerRack
 	analysis += fmt.Sprintf("  Rack Failure: %s\n", boolToResult(rackSurvive))
 	analysis += fmt.Sprintf("    - Losing one rack loses ~%d shards\n", d.TargetShardsPerRack)
 	analysis += fmt.Sprintf("    - Remaining: %d shards (need %d)\n", shardsAfterRack, d.ECConfig.DataShards)
 	// Node failure (usually survivable)
 	shardsAfterNode := d.ECConfig.TotalShards() - d.TargetShardsPerNode
 	nodeSurvive := shardsAfterNode >= d.ECConfig.DataShards
 	analysis += fmt.Sprintf("  Node Failure: %s\n", boolToResult(nodeSurvive))
 	analysis += fmt.Sprintf("    - Losing one node loses ~%d shards\n", d.TargetShardsPerNode)
 	analysis += fmt.Sprintf("    - Remaining: %d shards (need %d)\n", shardsAfterNode, d.ECConfig.DataShards)
 	return analysis
 }
 func boolToResult(b bool) string {
 	if b {
 		return "SURVIVABLE ✓"
 	}
 	return "NOT SURVIVABLE ✗"
 }
 // ceilDivide performs ceiling division
 func ceilDivide(a, b int) int {
 	if b <= 0 {
 		return a
 	}
 	return (a + b - 1) / b
 }
--- a/weed/storage/erasure_coding/distribution/distribution_test.go
+++ b/weed/storage/erasure_coding/distribution/distribution_test.go
@@ -0,0 +1,565 @@
 package distribution
 import (
 	"testing"
 )
 func TestNewECConfig(t *testing.T) {
 	tests := []struct {
 		name         string
 		dataShards   int
 		parityShards int
 		wantErr      bool
 	}{
 		{"valid 10+4", 10, 4, false},
 		{"valid 8+4", 8, 4, false},
 		{"valid 6+3", 6, 3, false},
 		{"valid 4+2", 4, 2, false},
 		{"invalid data=0", 0, 4, true},
 		{"invalid parity=0", 10, 0, true},
 		{"invalid total>32", 20, 15, true},
 	}
 	for _, tt := range tests {
 		t.Run(tt.name, func(t *testing.T) {
 			config, err := NewECConfig(tt.dataShards, tt.parityShards)
 			if (err != nil) != tt.wantErr {
 				t.Errorf("NewECConfig() error = %v, wantErr %v", err, tt.wantErr)
 				return
 			}
 			if !tt.wantErr {
 				if config.DataShards != tt.dataShards {
 					t.Errorf("DataShards = %d, want %d", config.DataShards, tt.dataShards)
 				}
 				if config.ParityShards != tt.parityShards {
 					t.Errorf("ParityShards = %d, want %d", config.ParityShards, tt.parityShards)
 				}
 				if config.TotalShards() != tt.dataShards+tt.parityShards {
 					t.Errorf("TotalShards() = %d, want %d", config.TotalShards(), tt.dataShards+tt.parityShards)
 				}
 			}
 		})
 	}
 }
 func TestCalculateDistribution(t *testing.T) {
 	tests := []struct {
 		name                    string
 		ecConfig                ECConfig
 		replication             string
 		expectedMinDCs          int
 		expectedMinRacksPerDC   int
 		expectedMinNodesPerRack int
 		expectedTargetPerDC     int
 		expectedTargetPerRack   int
 		expectedTargetPerNode   int
 	}{
 		{
 			name:                    "10+4 with 000",
 			ecConfig:                DefaultECConfig(),
 			replication:             "000",
 			expectedMinDCs:          1,
 			expectedMinRacksPerDC:   1,
 			expectedMinNodesPerRack: 1,
 			expectedTargetPerDC:     14,
 			expectedTargetPerRack:   14,
 			expectedTargetPerNode:   14,
 		},
 		{
 			name:                    "10+4 with 100",
 			ecConfig:                DefaultECConfig(),
 			replication:             "100",
 			expectedMinDCs:          2,
 			expectedMinRacksPerDC:   1,
 			expectedMinNodesPerRack: 1,
 			expectedTargetPerDC:     7,
 			expectedTargetPerRack:   7,
 			expectedTargetPerNode:   7,
 		},
 		{
 			name:                    "10+4 with 110",
 			ecConfig:                DefaultECConfig(),
 			replication:             "110",
 			expectedMinDCs:          2,
 			expectedMinRacksPerDC:   2,
 			expectedMinNodesPerRack: 1,
 			expectedTargetPerDC:     7,
 			expectedTargetPerRack:   4,
 			expectedTargetPerNode:   4,
 		},
 		{
 			name:                    "10+4 with 200",
 			ecConfig:                DefaultECConfig(),
 			replication:             "200",
 			expectedMinDCs:          3,
 			expectedMinRacksPerDC:   1,
 			expectedMinNodesPerRack: 1,
 			expectedTargetPerDC:     5,
 			expectedTargetPerRack:   5,
 			expectedTargetPerNode:   5,
 		},
 		{
 			name: "8+4 with 110",
 			ecConfig: ECConfig{
 				DataShards:   8,
 				ParityShards: 4,
 			},
 			replication:             "110",
 			expectedMinDCs:          2,
 			expectedMinRacksPerDC:   2,
 			expectedMinNodesPerRack: 1,
 			expectedTargetPerDC:     6, // 12/2 = 6
 			expectedTargetPerRack:   3, // 6/2 = 3
 			expectedTargetPerNode:   3,
 		},
 		{
 			name: "6+3 with 100",
 			ecConfig: ECConfig{
 				DataShards:   6,
 				ParityShards: 3,
 			},
 			replication:             "100",
 			expectedMinDCs:          2,
 			expectedMinRacksPerDC:   1,
 			expectedMinNodesPerRack: 1,
 			expectedTargetPerDC:     5, // ceil(9/2) = 5
 			expectedTargetPerRack:   5,
 			expectedTargetPerNode:   5,
 		},
 	}
 	for _, tt := range tests {
 		t.Run(tt.name, func(t *testing.T) {
 			rep, err := NewReplicationConfigFromString(tt.replication)
 			if err != nil {
 				t.Fatalf("Failed to parse replication %s: %v", tt.replication, err)
 			}
 			dist := CalculateDistribution(tt.ecConfig, rep)
 			if dist.ReplicationConfig.MinDataCenters != tt.expectedMinDCs {
 				t.Errorf("MinDataCenters = %d, want %d", dist.ReplicationConfig.MinDataCenters, tt.expectedMinDCs)
 			}
 			if dist.ReplicationConfig.MinRacksPerDC != tt.expectedMinRacksPerDC {
 				t.Errorf("MinRacksPerDC = %d, want %d", dist.ReplicationConfig.MinRacksPerDC, tt.expectedMinRacksPerDC)
 			}
 			if dist.ReplicationConfig.MinNodesPerRack != tt.expectedMinNodesPerRack {
 				t.Errorf("MinNodesPerRack = %d, want %d", dist.ReplicationConfig.MinNodesPerRack, tt.expectedMinNodesPerRack)
 			}
 			if dist.TargetShardsPerDC != tt.expectedTargetPerDC {
 				t.Errorf("TargetShardsPerDC = %d, want %d", dist.TargetShardsPerDC, tt.expectedTargetPerDC)
 			}
 			if dist.TargetShardsPerRack != tt.expectedTargetPerRack {
 				t.Errorf("TargetShardsPerRack = %d, want %d", dist.TargetShardsPerRack, tt.expectedTargetPerRack)
 			}
 			if dist.TargetShardsPerNode != tt.expectedTargetPerNode {
 				t.Errorf("TargetShardsPerNode = %d, want %d", dist.TargetShardsPerNode, tt.expectedTargetPerNode)
 			}
 			t.Logf("Distribution for %s: %s", tt.name, dist.String())
 		})
 	}
 }
 func TestFaultToleranceAnalysis(t *testing.T) {
 	tests := []struct {
 		name           string
 		ecConfig       ECConfig
 		replication    string
 		canSurviveDC   bool
 		canSurviveRack bool
 	}{
 		// 10+4 = 14 shards, need 10 to reconstruct, can lose 4
 		{"10+4 000", DefaultECConfig(), "000", false, false}, // All in one, any failure is fatal
 		{"10+4 100", DefaultECConfig(), "100", false, false}, // 7 per DC/rack, 7 remaining < 10
 		{"10+4 200", DefaultECConfig(), "200", false, false}, // 5 per DC/rack, 9 remaining < 10
 		{"10+4 110", DefaultECConfig(), "110", false, true},  // 4 per rack, 10 remaining = enough for rack
 		// 8+4 = 12 shards, need 8 to reconstruct, can lose 4
 		{"8+4 100", ECConfig{8, 4}, "100", false, false}, // 6 per DC/rack, 6 remaining < 8
 		{"8+4 200", ECConfig{8, 4}, "200", true, true},   // 4 per DC/rack, 8 remaining = enough!
 	}
 	for _, tt := range tests {
 		t.Run(tt.name, func(t *testing.T) {
 			rep, _ := NewReplicationConfigFromString(tt.replication)
 			dist := CalculateDistribution(tt.ecConfig, rep)
 			if dist.CanSurviveDCFailure() != tt.canSurviveDC {
 				t.Errorf("CanSurviveDCFailure() = %v, want %v", dist.CanSurviveDCFailure(), tt.canSurviveDC)
 			}
 			if dist.CanSurviveRackFailure() != tt.canSurviveRack {
 				t.Errorf("CanSurviveRackFailure() = %v, want %v", dist.CanSurviveRackFailure(), tt.canSurviveRack)
 			}
 			t.Log(dist.FaultToleranceAnalysis())
 		})
 	}
 }
 func TestMinDCsForDCFaultTolerance(t *testing.T) {
 	tests := []struct {
 		name     string
 		ecConfig ECConfig
 		minDCs   int
 	}{
 		// 10+4: can lose 4, so max 4 per DC, 14/4 = 4 DCs needed
 		{"10+4", DefaultECConfig(), 4},
 		// 8+4: can lose 4, so max 4 per DC, 12/4 = 3 DCs needed
 		{"8+4", ECConfig{8, 4}, 3},
 		// 6+3: can lose 3, so max 3 per DC, 9/3 = 3 DCs needed
 		{"6+3", ECConfig{6, 3}, 3},
 		// 4+2: can lose 2, so max 2 per DC, 6/2 = 3 DCs needed
 		{"4+2", ECConfig{4, 2}, 3},
 	}
 	for _, tt := range tests {
 		t.Run(tt.name, func(t *testing.T) {
 			rep, _ := NewReplicationConfigFromString("000")
 			dist := CalculateDistribution(tt.ecConfig, rep)
 			if dist.MinDCsForDCFaultTolerance() != tt.minDCs {
 				t.Errorf("MinDCsForDCFaultTolerance() = %d, want %d",
 					dist.MinDCsForDCFaultTolerance(), tt.minDCs)
 			}
 			t.Logf("%s: needs %d DCs for DC fault tolerance", tt.name, dist.MinDCsForDCFaultTolerance())
 		})
 	}
 }
 func TestTopologyAnalysis(t *testing.T) {
 	analysis := NewTopologyAnalysis()
 	// Add nodes to topology
 	node1 := &TopologyNode{
 		NodeID:     "node1",
 		DataCenter: "dc1",
 		Rack:       "rack1",
 		FreeSlots:  5,
 	}
 	node2 := &TopologyNode{
 		NodeID:     "node2",
 		DataCenter: "dc1",
 		Rack:       "rack2",
 		FreeSlots:  10,
 	}
 	node3 := &TopologyNode{
 		NodeID:     "node3",
 		DataCenter: "dc2",
 		Rack:       "rack3",
 		FreeSlots:  10,
 	}
 	analysis.AddNode(node1)
 	analysis.AddNode(node2)
 	analysis.AddNode(node3)
 	// Add shard locations (all on node1)
 	for i := 0; i < 14; i++ {
 		analysis.AddShardLocation(ShardLocation{
 			ShardID:    i,
 			NodeID:     "node1",
 			DataCenter: "dc1",
 			Rack:       "rack1",
 		})
 	}
 	analysis.Finalize()
 	// Verify counts
 	if analysis.TotalShards != 14 {
 		t.Errorf("TotalShards = %d, want 14", analysis.TotalShards)
 	}
 	if analysis.ShardsByDC["dc1"] != 14 {
 		t.Errorf("ShardsByDC[dc1] = %d, want 14", analysis.ShardsByDC["dc1"])
 	}
 	if analysis.ShardsByRack["rack1"] != 14 {
 		t.Errorf("ShardsByRack[rack1] = %d, want 14", analysis.ShardsByRack["rack1"])
 	}
 	if analysis.ShardsByNode["node1"] != 14 {
 		t.Errorf("ShardsByNode[node1] = %d, want 14", analysis.ShardsByNode["node1"])
 	}
 	t.Log(analysis.DetailedString())
 }
 func TestRebalancer(t *testing.T) {
 	// Build topology: 2 DCs, 2 racks each, all shards on one node
 	analysis := NewTopologyAnalysis()
 	// Add nodes
 	nodes := []*TopologyNode{
 		{NodeID: "dc1-rack1-node1", DataCenter: "dc1", Rack: "dc1-rack1", FreeSlots: 0},
 		{NodeID: "dc1-rack2-node1", DataCenter: "dc1", Rack: "dc1-rack2", FreeSlots: 10},
 		{NodeID: "dc2-rack1-node1", DataCenter: "dc2", Rack: "dc2-rack1", FreeSlots: 10},
 		{NodeID: "dc2-rack2-node1", DataCenter: "dc2", Rack: "dc2-rack2", FreeSlots: 10},
 	}
 	for _, node := range nodes {
 		analysis.AddNode(node)
 	}
 	// Add all 14 shards to first node
 	for i := 0; i < 14; i++ {
 		analysis.AddShardLocation(ShardLocation{
 			ShardID:    i,
 			NodeID:     "dc1-rack1-node1",
 			DataCenter: "dc1",
 			Rack:       "dc1-rack1",
 		})
 	}
 	analysis.Finalize()
 	// Create rebalancer with 110 replication (2 DCs, 2 racks each)
 	ec := DefaultECConfig()
 	rep, _ := NewReplicationConfigFromString("110")
 	rebalancer := NewRebalancer(ec, rep)
 	plan, err := rebalancer.PlanRebalance(analysis)
 	if err != nil {
 		t.Fatalf("PlanRebalance failed: %v", err)
 	}
 	t.Logf("Planned %d moves", plan.TotalMoves)
 	t.Log(plan.DetailedString())
 	// Verify we're moving shards to dc2
 	movedToDC2 := 0
 	for _, move := range plan.Moves {
 		if move.DestNode.DataCenter == "dc2" {
 			movedToDC2++
 		}
 	}
 	if movedToDC2 == 0 {
 		t.Error("Expected some moves to dc2")
 	}
 	// With "110" replication, target is 7 shards per DC
 	// Starting with 14 in dc1, should plan to move 7 to dc2
 	if plan.MovesAcrossDC < 7 {
 		t.Errorf("Expected at least 7 cross-DC moves for 110 replication, got %d", plan.MovesAcrossDC)
 	}
 }
 func TestCustomECRatios(t *testing.T) {
 	// Test various custom EC ratios that seaweed-enterprise might use
 	ratios := []struct {
 		name   string
 		data   int
 		parity int
 	}{
 		{"4+2", 4, 2},
 		{"6+3", 6, 3},
 		{"8+2", 8, 2},
 		{"8+4", 8, 4},
 		{"10+4", 10, 4},
 		{"12+4", 12, 4},
 		{"16+4", 16, 4},
 	}
 	for _, ratio := range ratios {
 		t.Run(ratio.name, func(t *testing.T) {
 			ec, err := NewECConfig(ratio.data, ratio.parity)
 			if err != nil {
 				t.Fatalf("Failed to create EC config: %v", err)
 			}
 			rep, _ := NewReplicationConfigFromString("110")
 			dist := CalculateDistribution(ec, rep)
 			t.Logf("EC %s with replication 110:", ratio.name)
 			t.Logf("  Total shards: %d", ec.TotalShards())
 			t.Logf("  Can lose: %d shards", ec.MaxTolerableLoss())
 			t.Logf("  Target per DC: %d", dist.TargetShardsPerDC)
 			t.Logf("  Target per rack: %d", dist.TargetShardsPerRack)
 			t.Logf("  Min DCs for DC fault tolerance: %d", dist.MinDCsForDCFaultTolerance())
 			// Verify basic sanity
 			if dist.TargetShardsPerDC*2 < ec.TotalShards() {
 				t.Errorf("Target per DC (%d) * 2 should be >= total (%d)",
 					dist.TargetShardsPerDC, ec.TotalShards())
 			}
 		})
 	}
 }
 func TestShardClassification(t *testing.T) {
 	ec := DefaultECConfig() // 10+4
 	// Test IsDataShard
 	for i := 0; i < 10; i++ {
 		if !ec.IsDataShard(i) {
 			t.Errorf("Shard %d should be a data shard", i)
 		}
 		if ec.IsParityShard(i) {
 			t.Errorf("Shard %d should not be a parity shard", i)
 		}
 	}
 	// Test IsParityShard
 	for i := 10; i < 14; i++ {
 		if ec.IsDataShard(i) {
 			t.Errorf("Shard %d should not be a data shard", i)
 		}
 		if !ec.IsParityShard(i) {
 			t.Errorf("Shard %d should be a parity shard", i)
 		}
 	}
 	// Test with custom 8+4 EC
 	ec84, _ := NewECConfig(8, 4)
 	for i := 0; i < 8; i++ {
 		if !ec84.IsDataShard(i) {
 			t.Errorf("8+4 EC: Shard %d should be a data shard", i)
 		}
 	}
 	for i := 8; i < 12; i++ {
 		if !ec84.IsParityShard(i) {
 			t.Errorf("8+4 EC: Shard %d should be a parity shard", i)
 		}
 	}
 }
 func TestSortShardsDataFirst(t *testing.T) {
 	ec := DefaultECConfig() // 10+4
 	// Mixed shards: [0, 10, 5, 11, 2, 12, 7, 13]
 	shards := []int{0, 10, 5, 11, 2, 12, 7, 13}
 	sorted := ec.SortShardsDataFirst(shards)
 	t.Logf("Original: %v", shards)
 	t.Logf("Sorted (data first): %v", sorted)
 	// First 4 should be data shards (0, 5, 2, 7)
 	for i := 0; i < 4; i++ {
 		if !ec.IsDataShard(sorted[i]) {
 			t.Errorf("Position %d should be a data shard, got %d", i, sorted[i])
 		}
 	}
 	// Last 4 should be parity shards (10, 11, 12, 13)
 	for i := 4; i < 8; i++ {
 		if !ec.IsParityShard(sorted[i]) {
 			t.Errorf("Position %d should be a parity shard, got %d", i, sorted[i])
 		}
 	}
 }
 func TestSortShardsParityFirst(t *testing.T) {
 	ec := DefaultECConfig() // 10+4
 	// Mixed shards: [0, 10, 5, 11, 2, 12, 7, 13]
 	shards := []int{0, 10, 5, 11, 2, 12, 7, 13}
 	sorted := ec.SortShardsParityFirst(shards)
 	t.Logf("Original: %v", shards)
 	t.Logf("Sorted (parity first): %v", sorted)
 	// First 4 should be parity shards (10, 11, 12, 13)
 	for i := 0; i < 4; i++ {
 		if !ec.IsParityShard(sorted[i]) {
 			t.Errorf("Position %d should be a parity shard, got %d", i, sorted[i])
 		}
 	}
 	// Last 4 should be data shards (0, 5, 2, 7)
 	for i := 4; i < 8; i++ {
 		if !ec.IsDataShard(sorted[i]) {
 			t.Errorf("Position %d should be a data shard, got %d", i, sorted[i])
 		}
 	}
 }
 func TestRebalancerPrefersMovingParityShards(t *testing.T) {
 	// Build topology where one node has all shards including mix of data and parity
 	analysis := NewTopologyAnalysis()
 	// Node 1: Has all 14 shards (mixed data and parity)
 	node1 := &TopologyNode{
 		NodeID:     "node1",
 		DataCenter: "dc1",
 		Rack:       "rack1",
 		FreeSlots:  0,
 	}
 	analysis.AddNode(node1)
 	// Node 2: Empty, ready to receive
 	node2 := &TopologyNode{
 		NodeID:     "node2",
 		DataCenter: "dc1",
 		Rack:       "rack1",
 		FreeSlots:  10,
 	}
 	analysis.AddNode(node2)
 	// Add all 14 shards to node1
 	for i := 0; i < 14; i++ {
 		analysis.AddShardLocation(ShardLocation{
 			ShardID:    i,
 			NodeID:     "node1",
 			DataCenter: "dc1",
 			Rack:       "rack1",
 		})
 	}
 	analysis.Finalize()
 	// Create rebalancer
 	ec := DefaultECConfig()
 	rep, _ := NewReplicationConfigFromString("000")
 	rebalancer := NewRebalancer(ec, rep)
 	plan, err := rebalancer.PlanRebalance(analysis)
 	if err != nil {
 		t.Fatalf("PlanRebalance failed: %v", err)
 	}
 	t.Logf("Planned %d moves", len(plan.Moves))
 	// Check that parity shards are moved first
 	parityMovesFirst := 0
 	dataMovesFirst := 0
 	seenDataMove := false
 	for _, move := range plan.Moves {
 		isParity := ec.IsParityShard(move.ShardID)
 		t.Logf("Move shard %d (parity=%v): %s -> %s",
 			move.ShardID, isParity, move.SourceNode.NodeID, move.DestNode.NodeID)
 		if isParity && !seenDataMove {
 			parityMovesFirst++
 		} else if !isParity {
 			seenDataMove = true
 			dataMovesFirst++
 		}
 	}
 	t.Logf("Parity moves before first data move: %d", parityMovesFirst)
 	t.Logf("Data moves: %d", dataMovesFirst)
 	// With 10+4 EC, there are 4 parity shards
 	// They should be moved before data shards when possible
 	if parityMovesFirst < 4 && len(plan.Moves) >= 4 {
 		t.Logf("Note: Expected parity shards to be moved first, but got %d parity moves before data moves", parityMovesFirst)
 	}
 }
 func TestDistributionSummary(t *testing.T) {
 	ec := DefaultECConfig()
 	rep, _ := NewReplicationConfigFromString("110")
 	dist := CalculateDistribution(ec, rep)
 	summary := dist.Summary()
 	t.Log(summary)
 	if len(summary) == 0 {
 		t.Error("Summary should not be empty")
 	}
 	analysis := dist.FaultToleranceAnalysis()
 	t.Log(analysis)
 	if len(analysis) == 0 {
 		t.Error("Fault tolerance analysis should not be empty")
 	}
 }
--- a/weed/storage/erasure_coding/distribution/rebalancer.go
+++ b/weed/storage/erasure_coding/distribution/rebalancer.go
@@ -0,0 +1,378 @@
 package distribution
 import (
 	"fmt"
 	"slices"
 )
 // ShardMove represents a planned shard move
 type ShardMove struct {
 	ShardID    int
 	SourceNode *TopologyNode
 	DestNode   *TopologyNode
 	Reason     string
 }
 // String returns a human-readable description of the move
 func (m ShardMove) String() string {
 	return fmt.Sprintf("shard %d: %s -> %s (%s)",
 		m.ShardID, m.SourceNode.NodeID, m.DestNode.NodeID, m.Reason)
 }
 // RebalancePlan contains the complete plan for rebalancing EC shards
 type RebalancePlan struct {
 	Moves        []ShardMove
 	Distribution *ECDistribution
 	Analysis     *TopologyAnalysis
 	// Statistics
 	TotalMoves     int
 	MovesAcrossDC  int
 	MovesAcrossRack int
 	MovesWithinRack int
 }
 // String returns a summary of the plan
 func (p *RebalancePlan) String() string {
 	return fmt.Sprintf("RebalancePlan{moves:%d, acrossDC:%d, acrossRack:%d, withinRack:%d}",
 		p.TotalMoves, p.MovesAcrossDC, p.MovesAcrossRack, p.MovesWithinRack)
 }
 // DetailedString returns a detailed multi-line summary
 func (p *RebalancePlan) DetailedString() string {
 	s := fmt.Sprintf("Rebalance Plan:\n")
 	s += fmt.Sprintf("  Total Moves: %d\n", p.TotalMoves)
 	s += fmt.Sprintf("  Across DC: %d\n", p.MovesAcrossDC)
 	s += fmt.Sprintf("  Across Rack: %d\n", p.MovesAcrossRack)
 	s += fmt.Sprintf("  Within Rack: %d\n", p.MovesWithinRack)
 	s += fmt.Sprintf("\nMoves:\n")
 	for i, move := range p.Moves {
 		s += fmt.Sprintf("  %d. %s\n", i+1, move.String())
 	}
 	return s
 }
 // Rebalancer plans shard moves to achieve proportional distribution
 type Rebalancer struct {
 	ecConfig   ECConfig
 	repConfig  ReplicationConfig
 }
 // NewRebalancer creates a new rebalancer with the given configuration
 func NewRebalancer(ec ECConfig, rep ReplicationConfig) *Rebalancer {
 	return &Rebalancer{
 		ecConfig:  ec,
 		repConfig: rep,
 	}
 }
 // PlanRebalance creates a rebalancing plan based on current topology analysis
 func (r *Rebalancer) PlanRebalance(analysis *TopologyAnalysis) (*RebalancePlan, error) {
 	dist := CalculateDistribution(r.ecConfig, r.repConfig)
 	plan := &RebalancePlan{
 		Distribution: dist,
 		Analysis:     analysis,
 	}
 	// Step 1: Balance across data centers
 	dcMoves := r.planDCMoves(analysis, dist)
 	for _, move := range dcMoves {
 		plan.Moves = append(plan.Moves, move)
 		plan.MovesAcrossDC++
 	}
 	// Update analysis after DC moves (for planning purposes)
 	r.applyMovesToAnalysis(analysis, dcMoves)
 	// Step 2: Balance across racks within each DC
 	rackMoves := r.planRackMoves(analysis, dist)
 	for _, move := range rackMoves {
 		plan.Moves = append(plan.Moves, move)
 		plan.MovesAcrossRack++
 	}
 	// Update analysis after rack moves
 	r.applyMovesToAnalysis(analysis, rackMoves)
 	// Step 3: Balance across nodes within each rack
 	nodeMoves := r.planNodeMoves(analysis, dist)
 	for _, move := range nodeMoves {
 		plan.Moves = append(plan.Moves, move)
 		plan.MovesWithinRack++
 	}
 	plan.TotalMoves = len(plan.Moves)
 	return plan, nil
 }
 // planDCMoves plans moves to balance shards across data centers
 func (r *Rebalancer) planDCMoves(analysis *TopologyAnalysis, dist *ECDistribution) []ShardMove {
 	var moves []ShardMove
 	overDCs := CalculateDCExcess(analysis, dist)
 	underDCs := CalculateUnderservedDCs(analysis, dist)
 	underIdx := 0
 	for _, over := range overDCs {
 		for over.Excess > 0 && underIdx < len(underDCs) {
 			destDC := underDCs[underIdx]
 			// Find a shard and source node
 			shardID, srcNode := r.pickShardToMove(analysis, over.Nodes)
 			if srcNode == nil {
 				break
 			}
 			// Find destination node in target DC
 			destNode := r.pickBestDestination(analysis, destDC, "", dist)
 			if destNode == nil {
 				underIdx++
 				continue
 			}
 			moves = append(moves, ShardMove{
 				ShardID:    shardID,
 				SourceNode: srcNode,
 				DestNode:   destNode,
 				Reason:     fmt.Sprintf("balance DC: %s -> %s", srcNode.DataCenter, destDC),
 			})
 			over.Excess--
 			analysis.ShardsByDC[srcNode.DataCenter]--
 			analysis.ShardsByDC[destDC]++
 			// Check if destDC reached target
 			if analysis.ShardsByDC[destDC] >= dist.TargetShardsPerDC {
 				underIdx++
 			}
 		}
 	}
 	return moves
 }
 // planRackMoves plans moves to balance shards across racks within each DC
 func (r *Rebalancer) planRackMoves(analysis *TopologyAnalysis, dist *ECDistribution) []ShardMove {
 	var moves []ShardMove
 	for dc := range analysis.DCToRacks {
 		dcShards := analysis.ShardsByDC[dc]
 		numRacks := len(analysis.DCToRacks[dc])
 		if numRacks == 0 {
 			continue
 		}
 		targetPerRack := ceilDivide(dcShards, max(numRacks, dist.ReplicationConfig.MinRacksPerDC))
 		overRacks := CalculateRackExcess(analysis, dc, targetPerRack)
 		underRacks := CalculateUnderservedRacks(analysis, dc, targetPerRack)
 		underIdx := 0
 		for _, over := range overRacks {
 			for over.Excess > 0 && underIdx < len(underRacks) {
 				destRack := underRacks[underIdx]
 				// Find shard and source node
 				shardID, srcNode := r.pickShardToMove(analysis, over.Nodes)
 				if srcNode == nil {
 					break
 				}
 				// Find destination node in target rack
 				destNode := r.pickBestDestination(analysis, dc, destRack, dist)
 				if destNode == nil {
 					underIdx++
 					continue
 				}
 				moves = append(moves, ShardMove{
 					ShardID:    shardID,
 					SourceNode: srcNode,
 					DestNode:   destNode,
 					Reason:     fmt.Sprintf("balance rack: %s -> %s", srcNode.Rack, destRack),
 				})
 				over.Excess--
 				analysis.ShardsByRack[srcNode.Rack]--
 				analysis.ShardsByRack[destRack]++
 				if analysis.ShardsByRack[destRack] >= targetPerRack {
 					underIdx++
 				}
 			}
 		}
 	}
 	return moves
 }
 // planNodeMoves plans moves to balance shards across nodes within each rack
 func (r *Rebalancer) planNodeMoves(analysis *TopologyAnalysis, dist *ECDistribution) []ShardMove {
 	var moves []ShardMove
 	for rack, nodes := range analysis.RackToNodes {
 		if len(nodes) <= 1 {
 			continue
 		}
 		rackShards := analysis.ShardsByRack[rack]
 		targetPerNode := ceilDivide(rackShards, max(len(nodes), dist.ReplicationConfig.MinNodesPerRack))
 		// Find over and under nodes
 		var overNodes []*TopologyNode
 		var underNodes []*TopologyNode
 		for _, node := range nodes {
 			count := analysis.ShardsByNode[node.NodeID]
 			if count > targetPerNode {
 				overNodes = append(overNodes, node)
 			} else if count < targetPerNode {
 				underNodes = append(underNodes, node)
 			}
 		}
 		// Sort by excess/deficit
 		slices.SortFunc(overNodes, func(a, b *TopologyNode) int {
 			return analysis.ShardsByNode[b.NodeID] - analysis.ShardsByNode[a.NodeID]
 		})
 		underIdx := 0
 		for _, srcNode := range overNodes {
 			excess := analysis.ShardsByNode[srcNode.NodeID] - targetPerNode
 			for excess > 0 && underIdx < len(underNodes) {
 				destNode := underNodes[underIdx]
 				// Pick a shard from this node, preferring parity shards
 				shards := analysis.NodeToShards[srcNode.NodeID]
 				if len(shards) == 0 {
 					break
 				}
 				// Find a parity shard first, fallback to data shard
 				shardID := -1
 				shardIdx := -1
 				for i, s := range shards {
 					if r.ecConfig.IsParityShard(s) {
 						shardID = s
 						shardIdx = i
 						break
 					}
 				}
 				if shardID == -1 {
 					shardID = shards[0]
 					shardIdx = 0
 				}
 				moves = append(moves, ShardMove{
 					ShardID:    shardID,
 					SourceNode: srcNode,
 					DestNode:   destNode,
 					Reason:     fmt.Sprintf("balance node: %s -> %s", srcNode.NodeID, destNode.NodeID),
 				})
 				excess--
 				analysis.ShardsByNode[srcNode.NodeID]--
 				analysis.ShardsByNode[destNode.NodeID]++
 				// Update shard lists - remove the specific shard we picked
 				analysis.NodeToShards[srcNode.NodeID] = append(
 					shards[:shardIdx], shards[shardIdx+1:]...)
 				analysis.NodeToShards[destNode.NodeID] = append(
 					analysis.NodeToShards[destNode.NodeID], shardID)
 				if analysis.ShardsByNode[destNode.NodeID] >= targetPerNode {
 					underIdx++
 				}
 			}
 		}
 	}
 	return moves
 }
 // pickShardToMove selects a shard and its node from the given nodes.
 // It prefers to move parity shards first, keeping data shards spread out
 // since data shards serve read requests while parity shards are only for reconstruction.
 func (r *Rebalancer) pickShardToMove(analysis *TopologyAnalysis, nodes []*TopologyNode) (int, *TopologyNode) {
 	// Sort by shard count (most shards first)
 	slices.SortFunc(nodes, func(a, b *TopologyNode) int {
 		return analysis.ShardsByNode[b.NodeID] - analysis.ShardsByNode[a.NodeID]
 	})
 	// First pass: try to find a parity shard to move (prefer moving parity)
 	for _, node := range nodes {
 		shards := analysis.NodeToShards[node.NodeID]
 		for _, shardID := range shards {
 			if r.ecConfig.IsParityShard(shardID) {
 				return shardID, node
 			}
 		}
 	}
 	// Second pass: if no parity shards, move a data shard
 	for _, node := range nodes {
 		shards := analysis.NodeToShards[node.NodeID]
 		if len(shards) > 0 {
 			return shards[0], node
 		}
 	}
 	return -1, nil
 }
 // pickBestDestination selects the best destination node
 func (r *Rebalancer) pickBestDestination(analysis *TopologyAnalysis, targetDC, targetRack string, dist *ECDistribution) *TopologyNode {
 	var candidates []*TopologyNode
 	// Collect candidates
 	for _, node := range analysis.AllNodes {
 		// Filter by DC if specified
 		if targetDC != "" && node.DataCenter != targetDC {
 			continue
 		}
 		// Filter by rack if specified
 		if targetRack != "" && node.Rack != targetRack {
 			continue
 		}
 		// Check capacity
 		if node.FreeSlots <= 0 {
 			continue
 		}
 		// Check max shards limit
 		if analysis.ShardsByNode[node.NodeID] >= dist.MaxShardsPerNode {
 			continue
 		}
 		candidates = append(candidates, node)
 	}
 	if len(candidates) == 0 {
 		return nil
 	}
 	// Sort by: 1) fewer shards, 2) more free slots
 	slices.SortFunc(candidates, func(a, b *TopologyNode) int {
 		aShards := analysis.ShardsByNode[a.NodeID]
 		bShards := analysis.ShardsByNode[b.NodeID]
 		if aShards != bShards {
 			return aShards - bShards
 		}
 		return b.FreeSlots - a.FreeSlots
 	})
 	return candidates[0]
 }
 // applyMovesToAnalysis is a no-op placeholder for potential future use.
 // Note: All planners (planDCMoves, planRackMoves, planNodeMoves) update
 // their respective counts (ShardsByDC, ShardsByRack, ShardsByNode) and
 // shard lists (NodeToShards) inline during planning. This avoids duplicate
 // updates that would occur if we also updated counts here.
 func (r *Rebalancer) applyMovesToAnalysis(analysis *TopologyAnalysis, moves []ShardMove) {
 	// Counts are already updated by the individual planners.
 	// This function is kept for API compatibility and potential future use.
 }