syncthing

mirror of https://github.com/octoleo/syncthing.git synced 2024-11-09 14:50:56 +00:00

Author	SHA1	Message	Date
greatroar	42917d707d	lib/scanner: Remove unused field, move WaitGroup.Add out of loop (#7323 )	2021-02-03 14:25:24 +01:00
greatroar	fbe52faf49	lib/scanner: Allocate structure for final partial block (#7310 ) Benchmark results on Linux/amd64, using updated benchmark for old and new: name old time/op new time/op delta HashFile-8 88.6ms ± 1% 88.3ms ± 1% -0.33% (p=0.046 n=19+19) name old speed new speed delta HashFile-8 201MB/s ± 1% 202MB/s ± 1% +0.33% (p=0.044 n=19+19) name old alloc/op new alloc/op delta HashFile-8 59.4kB ± 0% 46.1kB ± 0% -22.47% (p=0.000 n=14+20) name old allocs/op new allocs/op delta HashFile-8 29.0 ± 0% 27.0 ± 0% -6.90% (p=0.000 n=20+20) Co-authored-by: greatroar <@>	2021-01-28 14:23:24 +01:00
Simon Frei	9524b51708	all: Implement suture v4-api (#6947 )	2020-11-17 13:19:04 +01:00
Simon Frei	31559e908b	all: Add untrusted folders behind feature flag (ref #62 ) (#7055 )	2020-11-09 15:33:32 +01:00
Audrius Butkevicius	e027175446	all: Move remaining protos to use the vanity plugin (#7009 )	2020-10-02 08:07:05 +02:00
Simon Frei	8452fd2ab4	lib/model, lib/scanner: Prevent races aborting scans (fixes #6994 ) (#6997 )	2020-09-25 11:27:44 +02:00
Simon Frei	932d8c69de	lib/fs: Properly handle case insensitive systems (fixes #1787 , fixes #2739 , fixes #5708 ) With this change we emulate a case sensitive filesystem on top of insensitive filesystems. This means we correctly pick up case-only renames and throw a case conflict error when there would be multiple files differing only in case. This safety check has a small performance hit (about 20% more filesystem operations when scanning for changes). The new advanced folder option `caseSensitiveFS` can be used to disable the safety checks, retaining the previous behavior on systems known to be fully case sensitive. Co-authored-by: Jakob Borg <jakob@kastelo.net>	2020-07-28 11:15:11 +02:00
Jakob Borg	fc1dac5196	lib/scanner: Less strict validation (fixes #6827 ) (#6828 ) This fixes the change in #6674 where the weak hash became a deciding factor. Now we again just use it to accept a block, but don't take a negative as meaning the block is bad.	2020-07-11 14:48:45 +02:00
Simon Frei	90e248615f	lib/scanner: Test weak hash consistency (ref #5556 ) (#6794 ) Relevant much earlier changes: `9b1c592fb7` `bd1c29ee32` Make sure vanilla and rolling adler are consistent. And that they match with scanner.Validate.	2020-06-25 14:47:35 +02:00
greatroar	d985aa9e4b	lib/scanner: Fix Validate docs (#6776 ) Co-authored-by: greatroar <@>	2020-06-21 19:23:06 +01:00
Simon Frei	a47546a1f1	lib/scanner, lib/model: Improve error handling when scanning (#6736 )	2020-06-16 09:25:41 +02:00
Simon Frei	0b65a616ba	lib/scanner: Save one stat call per file (#6715 )	2020-06-08 08:14:50 +02:00
greatroar	9c0825c0d9	lib/scanner: Simplify, optimize and document Validate (#6674 ) (#6688 )	2020-05-27 22:23:12 +02:00
Simon Frei	07ce3572a0	lib/ignore: Only skip for toplevel includes (fixes #6487 ) (#6508 )	2020-04-07 10:23:38 +02:00
Jakob Borg	7956e7d0ef	lib/{fs,scanner}: gofmt from Go 1.14 (#6509 )	2020-04-07 09:31:29 +02:00
Jakob Borg	dd92b2b8f4	all: Tweak error creation (#6391 ) - In the few places where we wrap errors, use the new Go 1.13 "%w" construction instead of %s or %v. - Where we create errors with constant strings, consistently use errors.New and not fmt.Errorf. - Remove capitalization from errors in the few places where we had that.	2020-03-03 22:40:00 +01:00
Jakob Borg	8fc2dfad0c	lib/db: Deduplicate block lists in database (fixes #5898 ) (#6283 ) * lib/db: Deduplicate block lists in database (fixes #5898) This moves the block list in the database out from being just a field on the FileInfo to being an object of its own. When putting a FileInfo we marshal the block list separately and store it keyed by the sha256 of the marshalled block list. When getting, if we are not doing a "truncated" get, we do an extra read and unmarshal for the block list. Old block lists are cleared out by a periodic GC sweep. The alternative would be to use refcounting, but: - There is a larger risk of getting that wrong and either dropping a block list in error or keeping them around forever. - It's tricky with our current database, as we don't have dirty reads. This means that if we update two FileInfos with identical block lists in the same transaction we can't just do read/modify/write for the ref counters as we wouldn't see our own first update. See above about tracking this and risks about getting it wrong. GC uses a bloom filter for keys to avoid heavy RAM usage. GC can't run concurrently with FileInfo updates so there is a new lock around those operation at the lowlevel. The end result is a much more compact database, especially for setups with many peers where files get duplicated many times. This is per-key-class stats for a large database I'm currently working with, under the current schema: ``` 0x00: 9138161 items, 870876 KB keys + 7397482 KB data, 95 B + 809 B avg, 1637651 B max 0x01: 185656 items, 10388 KB keys + 1790909 KB data, 55 B + 9646 B avg, 924525 B max 0x02: 916890 items, 84795 KB keys + 3667 KB data, 92 B + 4 B avg, 192 B max 0x03: 384 items, 27 KB keys + 5 KB data, 72 B + 15 B avg, 87 B max 0x04: 1109 items, 17 KB keys + 17 KB data, 15 B + 15 B avg, 69 B max 0x06: 383 items, 3 KB keys + 0 KB data, 9 B + 2 B avg, 18 B max 0x07: 510 items, 4 KB keys + 12 KB data, 9 B + 24 B avg, 41 B max 0x08: 1349 items, 12 KB keys + 10 KB data, 9 B + 8 B avg, 17 B max 0x09: 194 items, 0 KB keys + 123 KB data, 5 B + 634 B avg, 11484 B max 0x0a: 3 items, 0 KB keys + 0 KB data, 14 B + 7 B avg, 30 B max 0x0b: 181836 items, 2363 KB keys + 10694 KB data, 13 B + 58 B avg, 173 B max Total 10426475 items, 968490 KB keys + 9202925 KB data. ``` Note 7.4 GB of data in class 00, total size 9.2 GB. After running the migration we get this instead: ``` 0x00: 9138161 items, 870876 KB keys + 2611392 KB data, 95 B + 285 B avg, 4788 B max 0x01: 185656 items, 10388 KB keys + 1790909 KB data, 55 B + 9646 B avg, 924525 B max 0x02: 916890 items, 84795 KB keys + 3667 KB data, 92 B + 4 B avg, 192 B max 0x03: 384 items, 27 KB keys + 5 KB data, 72 B + 15 B avg, 87 B max 0x04: 1109 items, 17 KB keys + 17 KB data, 15 B + 15 B avg, 69 B max 0x06: 383 items, 3 KB keys + 0 KB data, 9 B + 2 B avg, 18 B max 0x07: 510 items, 4 KB keys + 12 KB data, 9 B + 24 B avg, 41 B max 0x09: 194 items, 0 KB keys + 123 KB data, 5 B + 634 B avg, 11484 B max 0x0a: 3 items, 0 KB keys + 0 KB data, 14 B + 17 B avg, 51 B max 0x0b: 181836 items, 2363 KB keys + 10694 KB data, 13 B + 58 B avg, 173 B max 0x0d: 44282 items, 1461 KB keys + 61081 KB data, 33 B + 1379 B avg, 1637399 B max Total 10469408 items, 969939 KB keys + 4477905 KB data. ``` Class 00 is now down to 2.6 GB, with just 61 MB added in class 0d. There will be some additional reads in some cases which theoretically hurts performance, but this will be more than compensated for by smaller writes and better compaction. On my own home setup which just has three devices and a handful of folders the difference is smaller in absolute numbers of course, but still less than half the old size: ``` 0x00: 297122 items, 20894 KB keys + 306860 KB data, 70 B + 1032 B avg, 103237 B max 0x01: 115299 items, 7738 KB keys + 17542 KB data, 67 B + 152 B avg, 419 B max 0x02: 1430537 items, 121223 KB keys + 5722 KB data, 84 B + 4 B avg, 253 B max ... Total 1947412 items, 151268 KB keys + 337485 KB data. ``` to: ``` 0x00: 297122 items, 20894 KB keys + 37038 KB data, 70 B + 124 B avg, 520 B max 0x01: 115299 items, 7738 KB keys + 17542 KB data, 67 B + 152 B avg, 419 B max 0x02: 1430537 items, 121223 KB keys + 5722 KB data, 84 B + 4 B avg, 253 B max ... 0x0d: 18041 items, 595 KB keys + 71964 KB data, 33 B + 3988 B avg, 101109 B max Total 1965447 items, 151863 KB keys + 139628 KB data. ``` * wip * wip * wip * wip	2020-01-24 08:35:44 +01:00
Simon Frei	f747ba6d69	lib/ignore: Keep skipping ignored dirs for rooted patterns (#6151 ) * lib/ignore: Keep skipping ignored dirs for rooted patterns * review * clarify comment and lint * glob.QuoteMeta * review	2019-11-26 07:37:41 +00:00
Simon Frei	72194d137c	lib/scanner: Don't scan if input path is below symlink (fixes #6090 ) (#6101 )	2019-10-22 11:12:21 +02:00
Lukas Lihotzki	96bb1c8e29	all, lib/logger: Refactor SetDebug calls (#6054 )	2019-10-04 13:03:34 +02:00
Jakob Borg	24d4290d03	lib/model, lib/scanner: Pass a valid event logger (fixes #5970 ) (#5971 )	2019-08-21 08:05:43 +02:00
Simon Frei	b1c74860e8	all: Remove global events.Default (ref #4085 ) (#5886 )	2019-08-15 16:29:37 +02:00
Simon Frei	7a4c88d4e4	lib: Add mtime window when comparing files (#5852 )	2019-07-23 21:48:53 +02:00
Aurélien Rainone	f1a7dd766e	all: Add comment to ensure correct atomics alignment (fixes #5813 ) Per the sync/atomic bug note: > On ARM, x86-32, and 32-bit MIPS, it is the caller's > responsibility to arrange for 64-bit alignment of 64-bit words > accessed atomically. The first word in a variable or in an > allocated struct, array, or slice can be relied upon to be > 64-bit aligned. All atomic accesses of 64-bit variables in syncthing code base are currently ok (i.e they are all 64-bit aligned). Generally, the bug is triggered because of incorrect alignement of struct fields. Free variables (declared in a function) are guaranteed to be 64-bit aligned by the Go compiler. To ensure the code remains correct upon further addition/removal of fields, which would change the currently correct alignment, I added the following comment where required: // atomic, must remain 64-bit aligned See https://golang.org/pkg/sync/atomic/#pkg-note-BUG.	2019-07-13 14:05:39 +01:00
Jakob Borg	997bb5e7e1	all: Remove "large blocks" config (#5763 ) We now always use large / variable blocks.	2019-06-06 15:57:38 +01:00
Simon Frei	926e9228ed	lib/scanner, lib/model: File -> item when logging error (#5664 )	2019-04-21 16:19:59 +01:00
Simon Frei	0f80318ef6	lib/scanner: Consistenlty use CreateFileInfo and remove outdated comment (#5574 )	2019-03-04 13:27:33 +01:00
Audrius Butkevicius	fafd30f804	lib/scanner: Use standard adler32 when we don't need rolling (#5556 ) * lib/scanner: Use standard adler32 when we don't need rolling Seems the rolling adler32 implementation is super slow when executed on large blocks, even tho I can't explain why. BenchmarkFind1MFile-16 100 18991667 ns/op 55.21 MB/s 398844 B/op 20 allocs/op BenchmarkBlock/adler32-131072/#00-16 200 9726519 ns/op 1078.06 MB/s 2654936 B/op 163 allocs/op BenchmarkBlock/bozo32-131072/#00-16 20 73435540 ns/op 142.79 MB/s 2654928 B/op 163 allocs/op BenchmarkBlock/buzhash32-131072/#00-16 20 61482005 ns/op 170.55 MB/s 2654928 B/op 163 allocs/op BenchmarkBlock/buzhash64-131072/#00-16 20 61673660 ns/op 170.02 MB/s 2654928 B/op 163 allocs/op BenchmarkBlock/vanilla-adler32-131072/#00-16 300 4377307 ns/op 2395.48 MB/s 2654935 B/op 163 allocs/op BenchmarkBlock/adler32-16777216/#00-16 2 544010100 ns/op 19.27 MB/s 65624 B/op 5 allocs/op BenchmarkBlock/bozo32-16777216/#00-16 1 4678108500 ns/op 2.24 MB/s 51970144 B/op 24 allocs/op BenchmarkBlock/buzhash32-16777216/#00-16 1 3880370700 ns/op 2.70 MB/s 51970144 B/op 24 allocs/op BenchmarkBlock/buzhash64-16777216/#00-16 1 3875911700 ns/op 2.71 MB/s 51970144 B/op 24 allocs/op BenchmarkBlock/vanilla-adler32-16777216/#00-16 300 4010279 ns/op 2614.72 MB/s 65624 B/op 5 allocs/op BenchmarkRoll/adler32-131072/#00-16 2000 974279 ns/op 134.53 MB/s 270 B/op 0 allocs/op BenchmarkRoll/bozo32-131072/#00-16 2000 791770 ns/op 165.54 MB/s 270 B/op 0 allocs/op BenchmarkRoll/buzhash32-131072/#00-16 2000 917409 ns/op 142.87 MB/s 270 B/op 0 allocs/op BenchmarkRoll/buzhash64-131072/#00-16 2000 881125 ns/op 148.76 MB/s 270 B/op 0 allocs/op BenchmarkRoll/adler32-16777216/#00-16 10 124000400 ns/op 135.30 MB/s 7548937 B/op 0 allocs/op BenchmarkRoll/bozo32-16777216/#00-16 10 118008080 ns/op 142.17 MB/s 7548928 B/op 0 allocs/op BenchmarkRoll/buzhash32-16777216/#00-16 10 126794440 ns/op 132.32 MB/s 7548928 B/op 0 allocs/op BenchmarkRoll/buzhash64-16777216/#00-16 10 126631960 ns/op 132.49 MB/s 7548928 B/op 0 allocs/op * Update benchmark_test.go * gofmt * fixup benchmark	2019-02-25 13:29:31 +04:00
Jakob Borg	c2ddc83509	all: Revert the underscore sillyness	2019-02-02 12:16:27 +01:00
Jakob Borg	0b2cabbc31	all: Even more boring linter fixes (#5501 )	2019-02-02 11:45:17 +01:00
Jakob Borg	75dcff0a0e	all: Copy owner/group from parent (fixes #5445 ) (#5479 ) This adds a folder option "CopyOwnershipFromParent" which, when set, makes Syncthing attempt to retain the owner/group information when syncing files. Specifically, at the finisher stage we look at the parent dir to get owner/group and then attempt a Lchown call on the temp file. For this to succeed Syncthing must be running with the appropriate permissions. On Linux this is CAP_FOWNER, which can be granted by the service manager on startup or set on the binary in the filesystem. Other operating systems do other things, but often it's not required to run as full "root". On Windows this patch does nothing - ownership works differently there and is generally less of a deal, as permissions are inherited as ACLs anyway. There are unit tests on the Lchown functionality, which requires the above permissions to run. There is also a unit test on the folder which uses the fake filesystem and hence does not need special permissions.	2019-01-25 09:52:21 +01:00
Simon Frei	99c9d65ddf	lib/scanner: Check ignore patterns before reporting error (fixes #5397 ) (#5398 )	2018-12-21 12:08:15 +01:00
Simon Frei	c40c9a8d6a	lib/scanner: Don't report error on missing items (fixes #5385 ) (#5387 )	2018-12-17 14:52:15 +01:00
Simon Frei	c934918347	lib/scanner: Fix empty path on normalization error (fixes #5369 ) (#5370 )	2018-12-12 12:09:39 +01:00
Simon Frei	2f9840ddae	lib: Introduce fs.IsParent (fixes #5324 ) (#5326 )	2018-11-22 11:16:45 +01:00
Jakob Borg	c0a26c918a	lib/scanner, vendor: Update github.com/chmduquesne/rollinghash (fixes #5334 ) (#5335 ) Updates the package and fixes a test that depended on the old behavior of Write() being equivalent to Reset()+Write() which is no longer the case. The scanner already did resets after each block write, so this is fine.	2018-11-22 08:50:06 +01:00
Simon Frei	d510e3cca3	all: Display errors while scanning in web UI (fixes #4480 ) (#5215 )	2018-11-07 11:04:41 +01:00
Simon Frei	4f4781d254	lib/scanner: Centralise protocol.FileInfo creation (#5202 )	2018-09-18 15:34:17 +02:00
Simon Frei	272fb3b444	all: Adjust windows perms in fs package (#5200 )	2018-09-16 16:09:56 +02:00
Simon Frei	c8652222ef	all: Check files on disk/in db when deleting/renaming (fixes #5194 ) (#5195 )	2018-09-16 09:48:14 +02:00
Jakob Borg	f822b10550	all: Add receive only folder type (#5027 ) Adds a receive only folder type that does not send changes, and where the user can optionally revert local changes. Also changes some of the icons to make the three folder types distinguishable.	2018-07-12 11:15:57 +03:00
Jakob Borg	b1b68ceedb	Add LocalFlags to FileInfo (#4952 ) We have the invalid bit to indicate that a file isn't good. That's enough for remote devices. For ourselves, it would be good to know sometimes why the file isn't good - because it's an unsupported type, because it matches an ignore pattern, or because we detected the data is bad and we need to rescan it. Or, and this is the main future reason for the PR, because it's a change detected on a receive only device. We will want something like the invalid flag for those changes, but marking them as invalid today means the scanner will rehash them. Hence something more fine grained is required. This introduces a LocalFlags fields to the FileInfo where we can stash things that we care about locally. For example, FlagLocalUnsupported = 1 << 0 // The kind is unsupported, e.g. symlinks on Windows FlagLocalIgnored = 1 << 1 // Matches local ignore patterns FlagLocalMustRescan = 1 << 2 // Doesn't match content on disk, must be rechecked fully The LocalFlags fields isn't sent over the wire; instead the Invalid attribute is calculated based on the flags at index sending time. It's on the FileInfo anyway because that's what we serialize to database etc. The actual Invalid flag should after this just be considered when building the global state and figuring out availability for remote devices. It is not used for local file index entries.	2018-06-24 09:50:18 +02:00
Simon Frei	8ff7ceeddc	lib/ignore, lib/scanner: Fix recursion to catch included paths (fixes #5009 ) (#5010 )	2018-06-18 08:22:19 +02:00
Jakob Borg	8b15624f7d	lib/scanner: Skip block size hysteresis test in -short mode This burns like a percent of my laptop battery on every invocation...	2018-06-02 13:10:05 +00:00
Audrius Butkevicius	d60f0e734c	lib/scanner: Copy execute bits from previous version on Windows (fixes #4969 ) (#4970 )	2018-05-29 07:01:23 +01:00
Simon Frei	d59aecba31	lib/ignore, lib/scanner: Catch included items below ignored ones (#4811 )	2018-05-14 09:47:23 +02:00
Audrius Butkevicius	ef0dcea6a4	lib/model: Verify request content against weak (and possibly strong) hash (#4767 )	2018-05-05 10:24:44 +02:00
Jakob Borg	19c7cd99f5	all: Implement variable sized blocks (fixes #4807 )	2018-04-16 19:08:50 +01:00
Simon Frei	69f2c26d50	lib/scanner, lib/model: Actually assign version when un-ignoring (fixes #4841 ) (#4842 ) This fixes a mistake introduced in #4750 and #4776 and is relevant to v0.14.46-rc1	2018-03-27 16:24:20 -04:00
Simon Frei	8b4346c3ec	lib/scanner, lib/fs: Don't create file infos with abs paths (fixes #4799 ) (#4800 )	2018-03-12 13:18:59 +01:00

1 2 3

129 Commits