seaweedFS

Author	SHA1	Message	Date
Chris Lu	4c88fbfd5e	Fix nil pointer crash during concurrent vacuum compaction (#8592 ) * check for nil needle map before compaction sync When CommitCompact runs concurrently, it sets v.nm = nil under dataFileAccessLock. CompactByIndex does not hold that lock, so v.nm.Sync() can hit a nil pointer. Add an early nil check to return an error instead of crashing. Fixes #8591 * guard copyDataBasedOnIndexFile size check against nil needle map The post-compaction size validation at line 538 accesses v.nm.ContentSize() and v.nm.DeletedSize(). If CommitCompact has concurrently set v.nm to nil, this causes a SIGSEGV. Skip the validation when v.nm is nil since the actual data copy uses local needle maps (oldNm/newNm) and is unaffected. Fixes #8591 * use atomic.Bool for compaction flags to prevent concurrent vacuum races The isCompacting and isCommitCompacting flags were plain bools read and written from multiple goroutines without synchronization. This allowed concurrent vacuums on the same volume to pass the guard checks and run simultaneously, leading to the nil pointer crash. Using atomic.Bool with CompareAndSwap ensures only one compaction or commit can run per volume at a time. Fixes #8591 * use go-version-file in CI workflows instead of hardcoded versions Use go-version-file: 'go.mod' so CI automatically picks up the Go version from go.mod, avoiding future version drift. Reordered checkout before setup-go in go.yml and e2e.yml so go.mod is available. Removed the now-unused GO_VERSION env vars. * capture v.nm locally in CompactByIndex to close TOCTOU race A bare nil check on v.nm followed by v.nm.Sync() has a race window where CommitCompact can set v.nm = nil between the two. Snapshot the pointer into a local variable so the nil check and Sync operate on the same reference. * add dynamic timeouts to plugin worker vacuum gRPC calls All vacuum gRPC calls used context.Background() with no deadline, so the plugin scheduler's execution timeout could kill a job while a large volume compact was still in progress. Use volume-size-scaled timeouts matching the topology vacuum approach: 3 min/GB for compact, 1 min/GB for check, commit, and cleanup. Fixes #8591 * Revert "add dynamic timeouts to plugin worker vacuum gRPC calls" This reverts commit 80951934c37416bc4f6c1472a5d3f8d204a637d9. * unify compaction lifecycle into single atomic flag Replace separate isCompacting and isCommitCompacting flags with a single isCompactionInProgress atomic.Bool. This ensures CompactBy*, CommitCompact, Close, and Destroy are mutually exclusive — only one can run at a time per volume. Key changes: - All entry points use CompareAndSwap(false, true) to claim exclusive access. CompactByVolumeData and CompactByIndex now also guard v.nm and v.DataBackend with local captures. - Close() waits for the flag outside dataFileAccessLock to avoid deadlocking with CommitCompact (which holds the flag while waiting for the lock). It claims the flag before acquiring the lock so no new compaction can start. - Destroy() uses CAS instead of a racy Load check, preventing concurrent compaction from racing with volume teardown. - unmountVolumeByCollection no longer deletes from the map; DeleteCollectionFromDiskLocation removes entries only after successful Destroy, preventing orphaned volumes on failure. Fixes #8591	2026-03-10 13:31:45 -07:00
Chris Lu	891a2fb6eb	Admin: misc improvements on admin server and workers. EC now works. (#7055 ) * initial design * added simulation as tests * reorganized the codebase to move the simulation framework and tests into their own dedicated package * integration test. ec worker task * remove "enhanced" reference * start master, volume servers, filer Current Status ✅ Master: Healthy and running (port 9333) ✅ Filer: Healthy and running (port 8888) ✅ Volume Servers: All 6 servers running (ports 8080-8085) 🔄 Admin/Workers: Will start when dependencies are ready * generate write load * tasks are assigned * admin start wtih grpc port. worker has its own working directory * Update .gitignore * working worker and admin. Task detection is not working yet. * compiles, detection uses volumeSizeLimitMB from master * compiles * worker retries connecting to admin * build and restart * rendering pending tasks * skip task ID column * sticky worker id * test canScheduleTaskNow * worker reconnect to admin * clean up logs * worker register itself first * worker can run ec work and report status but: 1. one volume should not be repeatedly worked on. 2. ec shards needs to be distributed and source data should be deleted. * move ec task logic * listing ec shards * local copy, ec. Need to distribute. * ec is mostly working now * distribution of ec shards needs improvement * need configuration to enable ec * show ec volumes * interval field UI component * rename * integration test with vauuming * garbage percentage threshold * fix warning * display ec shard sizes * fix ec volumes list * Update ui.go * show default values * ensure correct default value * MaintenanceConfig use ConfigField * use schema defined defaults * config * reduce duplication * refactor to use BaseUIProvider * each task register its schema * checkECEncodingCandidate use ecDetector * use vacuumDetector * use volumeSizeLimitMB * remove remove * remove unused * refactor * use new framework * remove v2 reference * refactor * left menu can scroll now * The maintenance manager was not being initialized when no data directory was configured for persistent storage. * saving config * Update task_config_schema_templ.go * enable/disable tasks * protobuf encoded task configurations * fix system settings * use ui component * remove logs * interface{} Reduction * reduce interface{} * reduce interface{} * avoid from/to map * reduce interface{} * refactor * keep it DRY * added logging * debug messages * debug level * debug * show the log caller line * use configured task policy * log level * handle admin heartbeat response * Update worker.go * fix EC rack and dc count * Report task status to admin server * fix task logging, simplify interface checking, use erasure_coding constants * factor in empty volume server during task planning * volume.list adds disk id * track disk id also * fix locking scheduled and manual scanning * add active topology * simplify task detector * ec task completed, but shards are not showing up * implement ec in ec_typed.go * adjust log level * dedup * implementing ec copying shards and only ecx files * use disk id when distributing ec shards 🎯 Planning: ActiveTopology creates DestinationPlan with specific TargetDisk 📦 Task Creation: maintenance_integration.go creates ECDestination with DiskId 🚀 Task Execution: EC task passes DiskId in VolumeEcShardsCopyRequest 💾 Volume Server: Receives disk_id and stores shards on specific disk (vs.store.Locations[req.DiskId]) 📂 File System: EC shards and metadata land in the exact disk directory planned * Delete original volume from all locations * clean up existing shard locations * local encoding and distributing * Update docker/admin_integration/EC-TESTING-README.md Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> * check volume id range * simplify * fix tests * fix types * clean up logs and tests --------- Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>	2025-07-30 12:38:03 -07:00
chrislu	2f1b3d68d7	pass volume version when creating a volume	2025-06-19 01:15:25 -07:00
Bruce	f9e141a412	persist readonly state to volume info (#5977 )	2024-09-05 07:58:24 -07:00
steve.wei	0bdf121e51	rename VolumeServerVolumeGauge (#5504 )	2024-04-17 04:49:50 -07:00
chrislu	6ebe26a765	Revert "Revert "Revert "Add disk type to prometheus metrics" (#4777 )"" This reverts commit `567d788928`.	2023-10-03 08:28:52 -07:00
chrislu	567d788928	Revert "Revert "Add disk type to prometheus metrics" (#4777 )" This reverts commit `9215ba24be`.	2023-10-02 11:49:54 -07:00
chrislu	2aa59ab37c	fix copying level db files during commi fix https://github.com/seaweedfs/seaweedfs/issues/4635	2023-07-04 11:28:12 -07:00
Bai Jie	3b88ab42aa	remove duplicate `fileCount` query (#4588 )	2023-06-18 00:14:14 -07:00
Bai Jie	44b9d72ef0	doIsEmpty() return error if v.DataBackend is nil (#4587 )	2023-06-18 00:13:40 -07:00
柏杰	0b0fb9b9e4	avoid data race read volume.IsEmpty (#4574 ) * avoid data race read volume.IsEmpty - avoid phantom read isEmpty for onlyEmpty - use `v.DataBackend.GetStat()` in v.dataFileAccessLock scope * add Destroy(onlyEmpty: true) test * add Destroy(onlyEmpty: false) test * remove unused `IsEmpty()` * change literal `8` to `SuperBlockSize`	2023-06-14 14:39:58 -07:00
Konstantin Lebedev	1e22d5caf2	fix get file stats for IsEmpty (#4576 )	2023-06-14 01:43:30 -07:00
Konstantin Lebedev	4527ead295	fix from comment delete volume is empty (#4573 ) * fix from coments https://github.com/seaweedfs/seaweedfs/pull/4561 * fix tests --------- Co-authored-by: Konstantin Lebedev <9497591+kmlebedev@users.noreply.github.co>	2023-06-12 22:22:46 -07:00
Konstantin Lebedev	25535e9c36	Delete volume is empty (#4561 ) * use onlyEmpty for deleteVolume https://github.com/seaweedfs/seaweedfs/issues/4559 * fix IsEmpty * fix test --------- Co-authored-by: Konstantin Lebedev <9497591+kmlebedev@users.noreply.github.co>	2023-06-12 10:42:44 -07:00
Guo Lei	5b905fb2b7	Lazy loading (#3958 ) * types packages is imported more than onece * lazy-loading * fix bugs * fix bugs * fix unit tests * fix test error * rename function * unload ldb after initial startup * Don't load ldb when starting volume server if ldbtimeout is set. * remove uncessary unloadldb * Update weed/command/server.go Co-authored-by: Chris Lu <chrislusf@users.noreply.github.com> * Update weed/command/volume.go Co-authored-by: Chris Lu <chrislusf@users.noreply.github.com> Co-authored-by: guol-fnst <goul-fnst@fujitsu.com> Co-authored-by: Chris Lu <chrislusf@users.noreply.github.com>	2022-11-14 00:19:27 -08:00
Konstantin Lebedev	1f7e52c63e	vacuum metrics and force sync dst files (#3832 )	2022-10-13 00:51:20 -07:00
chrislu	7c6324b114	adjust log level	2022-09-04 22:21:24 -07:00
Konstantin Lebedev	ade94b0d0a	avoid race conditions access to SuperBlock.Version (#3539 ) * avoid race conditions access to SuperBlock.Version https://github.com/seaweedfs/seaweedfs/issues/3515 * superBlockAccessLock replace to sync.Mutex	2022-08-30 00:08:00 -07:00
Guo Lei	c57c79a0ab	optimiz commitig compact (#3388 ) * optimiz vacuuming volume * fix bugx * rename parameters * fix conflict * change copyDataBasedOnIndexFile to an instance method * close needlemap * optimiz commiting Vacuum volume for leveldb index * fix bugs * fix leveldb loading bugs * refactor * fix leveldb loading bug * add leveldb recovery * add test case for levelDB * modify test case to cover all the new branches * use one tmpNm instead of two instances * refactor * refactor * move setWatermark to the end * add test for watermark and updating leveldb * fix error logic * refactor, add test * check nil before close needlemapeer add test case fix metric bug * add tests, fix bugs * adjust log level remove wrong test case refactor * avoid duplicate updating metric for leveldb index	2022-08-23 23:53:35 -07:00
chrislu	26dbc6c905	move to https://github.com/seaweedfs/seaweedfs	2022-07-29 00:17:28 -07:00
chrislu	9f8b72a54d	Revert "Merge pull request #3159 from shichanglin5/_duplicateUUID" This reverts commit `37da689319`, reversing changes made to `00d53c34c4`.	2022-06-10 06:38:17 -07:00
shichanglin5	f5b0c04b14	perf: Optimized volume handling duplicateUUID logic to avoid quitting when volume is actualy normal Under normal circumstances, there will be no problems, but when the master is debugged in the local environment, the volume client cannot communicate with the master normally, so the sendHeartBeat logic is restarted, and a new connection is created to report the heartbeat. If the master has not cleared the uuid of the volume at this time, then The master will respond to volume duplicateUUIDS, and the volume service will exit, but in fact the uuid of the volume is not duplicated	2022-06-09 20:41:16 +08:00
chrislu	70e5a1b632	volume close should wait for committing compaction	2022-04-26 23:34:05 -07:00
Chris Lu	ffe028f8d0	Merge pull request #2974 from kmlebedev/wait_volume_closed_compression waite volume being closed during compression idx	2022-04-26 23:29:22 -07:00
chrislu	37ab8909b0	use two flags: v.isCompacting and v.isCommitCompacting	2022-04-26 23:28:34 -07:00
chrislu	94f824e1ce	volume: sync to disk before copying volume files address https://github.com/chrislusf/seaweedfs/issues/2976	2022-04-26 13:03:43 -07:00
Konstantin Lebedev	7315d1d039	waite volume being closed during compression idx	2022-04-26 13:40:42 +05:00
Konstantin Lebedev	9438738693	avoid invalid memory address or nil pointer dereference	2022-04-18 12:10:22 +05:00
chrislu	a129bda7d9	sync data first before stopping	2022-02-16 09:11:34 -08:00
Konstantin Lebedev	99ef280c7c	avoid data loss after restarting a container with a volum server	2021-06-02 17:07:19 +05:00
Chris Lu	972327f966	prevent nil volume nm	2021-03-13 11:04:51 -08:00
Chris Lu	f8446b42ab	this can compile now!!!	2021-02-16 02:47:02 -08:00
Chris Lu	821c46edf1	Merge branch 'master' into support_ssd_volume	2021-02-09 11:37:07 -08:00
bingoohuang	94ea3bd3a5	renaming NeedleMapType to NeedleMapKind	2021-02-07 09:00:03 +08:00
Chris Lu	4f31c1bb94	go fmt	2020-12-22 02:34:08 -08:00
Chris Lu	94525aa0fd	allocate volume by disk type	2020-12-13 23:08:21 -08:00
Chris Lu	0d2ec832e2	rename from volumeType to diskType	2020-12-13 11:59:32 -08:00
Chris Lu	d156c74ec0	volume server set volume type and heartbeat to the master	2020-12-13 03:11:24 -08:00
Chris Lu	ae655033ac	adjust logging	2020-12-11 16:57:53 -08:00
Chris Lu	2c913dde04	volume: detect and drop volumes with disk IO error from Jethro in slack: is it possible to make the assign request a bit smarter? Currently I’m in the state that a disk failed but all assign request are being send to this volume. It would be cool if the master sees this and stopped using this volume. e=HTTP(http://x:8089/913,045a782b63176edf) not 200 but 500 Internal Server Error Body={"size":740167,"error":"failed to write to local disk: write /mnt/v9/913.dat: input/output error","eTag":"ee4381e202212ff3aee647704c036689"} e=HTTP(http://x:8089/913,045a782c90240077) not 200 but 500 Internal Server Error Body={"size":792779,"error":"failed to write to local disk: write /mnt/v9/913.dat: input/output error","eTag":"c43463ccc11eb6eb2fc306f407a6a953"} e=HTTP(http://x:8089/913,045a782e6b7901ea) not 200 but 500 Internal Server Error Body={"size":3962392,"error":"failed to write to local disk: write /mnt/v9/913.dat: input/output error","eTag":"04c91198e9b276c81f11dbf189af5d28"}	2020-11-28 00:09:29 -08:00
Chris Lu	6d30b21b10	volume: add "-dir.idx" option for separate index storage fix https://github.com/chrislusf/seaweedfs/issues/1265	2020-11-27 03:17:10 -08:00
Chris Lu	9104cfa744	reduce locks	2020-10-24 19:40:35 -07:00
James Hartig	3ccfa4c6ad	Added VolumeMarkWritable and VolumeStatus grpc methods This is necessary for copy to mark as read-only and then restore the original state afterwards.	2020-08-19 11:42:56 -04:00
Chris Lu	faa5c2e89a	refactoring	2020-07-03 16:34:31 -07:00
Evgenii Kozlov	0e0db70f55	Set volumes ReadOnly if low free disk space	2020-06-05 18:18:15 +03:00
Chris Lu	5568395edd	Revert "Revert "Merge pull request #1299 from song-zhang/master"" This reverts commit `afb6a1dbb4`.	2020-05-06 15:37:17 -07:00
Chris Lu	afb6a1dbb4	Revert "Merge pull request #1299 from song-zhang/master" This reverts commit `9016fa19ba`, reversing changes made to `47234760f4`.	2020-05-04 20:34:26 -07:00
zhangsong	f9e8702bb4	use async write to persistent file to disk - part1	2020-05-04 17:39:44 +08:00
Chris Lu	c3cb6fa1d7	volume: compaction can cause readonly volumes address https://github.com/chrislusf/seaweedfs/issues/1233	2020-03-17 09:43:57 -07:00
Chris Lu	89eb05b50f	filer: support TTL for all filer stores	2020-03-09 01:02:01 -07:00

1 2 3

114 Commits