Systemd

Author	SHA1	Message	Date
Zbigniew Jędrzejewski-Szmek	5a271b08b3	journal: remove unused args from journal_file_copy_entry()	2018-05-31 14:30:23 +02:00
Zbigniew Jędrzejewski-Szmek	c627395366	journal: refuse an entry with invalid timestamp fields The journal verification functions would reject such an entry. It would probably still display fine (because we prefer _SOURCE_REALTIME_TIMESTAMP= if present), but it seems wrong to create an entry that would not pass verification.	2018-05-31 14:30:23 +02:00
Zbigniew Jędrzejewski-Szmek	fd4885df94	journal: allow writing journal files even if machine-id is missing The code to open journal files seems like the wrong place to enforce this. We already check during boot and refuse to boot if machine-id is missing, no need to enforce this here. In particular, it seems better to write logs from journald even if they are not completely functional rather than refuse to operate at all, and systemd-journal-remote also writes journal files and may even be run on a system without systemd at all. The docker image that oss-fuzz uses has an empty /etc/machine-id. Obviously this is an error in the docker, but docker is fact of life, and it seems better for systemd-journal-remote to work in such an incomplete environment.	2018-05-31 13:04:18 +02:00
Vito Caputo	83bf6b6741	journal-file: avoid joining offline thread In journal_file_set_online() the offline thread doesn't need to be joined if it's been canceled before actually reaching the phase of writing the offline state.	2018-05-29 17:01:23 +02:00
Lennart Poettering	cf409d15fa	tree-wide: use newa() rather than alloca() where we can	2018-04-27 14:29:06 +02:00
Zbigniew Jędrzejewski-Szmek	11a1589223	tree-wide: drop license boilerplate Files which are installed as-is (any .service and other unit files, .conf files, .policy files, etc), are left as is. My assumption is that SPDX identifiers are not yet that well known, so it's better to retain the extended header to avoid any doubt. I also kept any copyright lines. We can probably remove them, but it'd nice to obtain explicit acks from all involved authors before doing that.	2018-04-06 18:58:55 +02:00
Yu Watanabe	1cc6c93a95	tree-wide: use TAKE_PTR() and TAKE_FD() macros	2018-04-05 14:26:26 +09:00
Lennart Poettering	96d4d0244b	journal-file: we can't use a chain cache entry if we don't know where it starts (#8542 ) It might happen that we try to bisect through a chain of offset arrays in the journal whose last element was just allocated but no item yet written to. In that case that array will be all NUL, but it might still end up in our array chain cache. If it does, we cannot use it for bisection, since for bisection we need to know the value of the first entry in that array, but if it's uninitialized it does not have a first value. Hence, as a simple fix, in this unlikely case, simply ignore the chain cache. This is supposed to fix the issue pointed out in #8432, but in a more permissive way, as this case isn't strictly a badly formatted journal but actually a valid state (though one within a very short time window), and we should make the best of it, and handle it gracefully. Background: in each journal file entries are linked up in large arrays of offsets. In each array the entries are strictly ordered by the offsets of the entries, which permits search by bisection. These arrays are allocated with a fixed size and then filled up as entries are added to the journal file. If an array is fully filled up, a new array (double in size as the old one) is appended to the journal file, and linked up. This means, the journal file will contain a series of chained up arrays, each time doubling in size, and strictly ordered. When looking for an entry we maintain a "chain cache", which allows us to bypass traversing the chain in full if we look for entries close to each other in a short time. With the fix above we make sure we don't erroneously use a chain cache item that doesn't carry enough information for this bisection to work. Original issue identified (with patch) by @Kxuan. Replaces: #8432	2018-03-27 09:36:49 +02:00
Zbigniew Jędrzejewski-Szmek	55c36ec0c1	Merge pull request #8508 from poettering/more-cocci two new coccinelle rules files and their results	2018-03-21 12:50:49 +01:00
Lennart Poettering	d9a43665eb	Merge pull request #8313 from alexgartrell/compression-threshold Compression threshold	2018-03-21 12:37:54 +01:00
Lennart Poettering	ffe535e43e	journal-file: drop unused tail_entry_monotonic_valid field. As pointed out by Matthijs van Duin: https://lists.freedesktop.org/archives/systemd-devel/2018-March/040499.html	2018-03-20 23:31:11 +01:00
Lennart Poettering	be6b0c2165	coccinelle: make use of DIV_ROUND_UP() wherever appropriate Let's use our macros where we can	2018-03-20 20:59:02 +01:00
Alex Gartrell	57850536d5	journal: provide compress_threshold_bytes parameter Previously the compression threshold was hardcoded to 512, which meant that smaller values wouldn't be compressed. This left some storage savings on the table, so instead, we make that number tunable.	2018-03-20 11:48:52 -07:00
Lennart Poettering	4c2e1b399f	xattr-util: use crtime/btime if statx() is available for implementation of fd_setcrtime() and friends The Linux kernel exposes the birth time now for files through statx() hence make use of it where available. We keep the xattr logic in place for this however, since only a subset of file systems on Linux currently expose the birth time. NFS and tmpfs for example do not support it. OTOH there are other file systems that do support the birth time but might not support xattrs (smb…), hence make the best of the two, in particular in order to deal with journal files copied between file system types and to maintain compatibility with older file systems that are updated to newer version of the file system.	2018-02-20 15:41:49 +01:00
Lennart Poettering	8fc58f1ad3	journal-file: fix typo in log message	2018-02-20 15:39:31 +01:00
Lennart Poettering	11b29a96e9	fs-util: move fsync_directory_of_file() into generic code This function used by the journal code is pretty useful generically, let's move it to fs-util.c to make it useful for other code too.	2018-02-20 15:39:31 +01:00
Lennart Poettering	3cc4411403	stat-util: unify code that checks whether something is a regular file Let's add a common implementation for regular file checks, that are careful to return the right error code (EISDIR/EISLNK/EBADFD) when we are encountering a wrong file node.	2018-02-20 15:39:31 +01:00
Lennart Poettering	817b1c5b1e	journal-file: add O_NONBLOCK for paranoia when opening journal files	2018-02-20 15:39:21 +01:00
Lennart Poettering	8d6a4d33e1	journal-file: refuse opening non-regular journal files Let's check the file node type when we open/stat journal files: refuse anything that is not a regular file...	2018-02-20 12:53:10 +01:00
Lennart Poettering	6eda13d3ba	journal: losen restrictions on journal file suffix (#8013 ) Previously, we'd refuse open journal files with suffixes that aren't either .journal or .journal~. With this change we only care when we are creating the journal file. I looked over the sources to see whether we ever pass files discovered by directory enumeration to journal_file_open() without first checking the suffix (in which case the old check made sense), but I couldn't find any. hence I am pretty sure removing this check is safe. Fixes: #7972	2018-01-27 17:32:36 +09:00
Lennart Poettering	5e9f01e8a6	tree-wide: in all threads we fork off in library code, block all signals This ensures that in all threads we fork off in the background in our code we mask out all signals, so that our thread won't end up getting signals delivered the main process should be getting. We always set the signal mask before forking off the thread, so that the thread has the right mask set from its earliest existance on.	2018-01-04 13:27:27 +01:00
Lennart Poettering	fa7ff4cf03	tree-wide: properly name all threads we fork off	2017-12-25 11:48:21 +01:00
Lennart Poettering	fbd0b64f44	tree-wide: make use of new STRLEN() macro everywhere (#7639 ) Let's employ coccinelle to do this for us. Follow-up for #7625.	2017-12-14 19:02:29 +01:00
Zbigniew Jędrzejewski-Szmek	f916819053	journal: use new helpers with journal_file_close journal_file_close_set() is not necessary anymore.	2017-11-28 21:34:50 +01:00
Shawn Landden	4831981d89	tree-wide: adjust fall through comments so that gcc is happy Distcc removes comments, making the comment silencing not work. I know there was a decision against a macro in commit `ec251fe7d5`	2017-11-20 13:06:25 -08:00
Zbigniew Jędrzejewski-Szmek	53e1b68390	Add SPDX license identifiers to source files under the LGPL This follows what the kernel is doing, c.f. https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=5fd54ace4721fc5ce2bb5aef6318fcf17f421460.	2017-11-19 19:08:15 +01:00
Zbigniew Jędrzejewski-Szmek	5180446051	journal: disable -Waddress-of-packed-member under clang clang warns about a few sites like this: ../src/journal/journal-file.c:1780:48: warning: taking address of packed member 'entry_offset' of class or structure 'DataObject' may result in an unaligned pointer value [-Waddress-of-packed-member] &o->data.entry_offset, ^~~~~~~~~~~~~~~~~~~~ but DataObject.entry_offset will always be 8-byte aligned as long as the DataObject structure is aligned. Similarly in other cases, the field is always aligned. Let's just silence the warning to avoid noise. gcc does not know -Waddress-of-packed-member, and would warn about an unknown warning, so we need to conditionalize on __clang__.	2017-11-01 23:10:25 +01:00
Zbigniew Jędrzejewski-Szmek	349cc4a507	build-sys: use #if Y instead of #ifdef Y everywhere The advantage is that is the name is mispellt, cpp will warn us. $ git grep -Ee "conf.set$'(HAVE\|ENABLE)_" -l\|xargs sed -r -i "s/conf.set\('(HAVE\|ENABLE)_/conf.set10('\1_/" $ git grep -Ee '#ifn?def (HAVE\|ENABLE)' -l\|xargs sed -r -i 's/#ifdef (HAVE\|ENABLE)/#if \1/; s/#ifndef (HAVE\|ENABLE)/#if ! \1/;' $ git grep -Ee 'if.defined\(HAVE' -l\|xargs sed -i -r 's/defined\((HAVE_[A-Z0-9_])$/\1/g' $ git grep -Ee 'if.defined$ENABLE' -l\|xargs sed -i -r 's/defined\((ENABLE_[A-Z0-9_])$/\1/g' + manual changes to meson.build squash! build-sys: use #if Y instead of #ifdef Y everywhere v2: - fix incorrect setting of HAVE_LIBIDN2	2017-10-04 12:09:29 +02:00
Andreas Rammhold	ec2ce0c5d7	tree-wide: use `!IN_SET(..)` for `a != b && a != c && …` The included cocci was used to generate the changes. Thanks to @flo-wer for pointing this case out.	2017-10-02 13:09:56 +02:00
Andreas Rammhold	3742095b27	tree-wide: use IN_SET where possible In addition to the changes from #6933 this handles cases that could be matched with the included cocci file.	2017-10-02 13:09:54 +02:00
Tommi Rantala	10e8445bcc	journal: add missing le64toh() calls in journal_file_check_object() Lennart Poettering noticed missing le64toh() calls.	2017-09-24 11:56:52 +03:00
Tommi Rantala	24754f3694	journal: add object sanity check to journal_file_move_to_object() Introduce journal_file_check_object(), which does lightweight object sanity checks, and use it in journal_file_move_to_object(), so that we will catch certain corrupted objects in the journal file. This fixes #6447, where we had only partially written out OBJECT_ENTRY (ObjectHeader written, but rest of object zero bytes), causing "journalctl --list-boots" to fail. $ builddir.vanilla/journalctl --list-boots -D bug6447/ Failed to determine boots: No data available $ builddir.patched/journalctl --list-boots -D bug6447/ -52 22633da1c5374a728d6c215e2c301dc2 Mon 2017-07-10 05:29:21 EEST—Mon 2017-07-10 05:31:51 EEST -51 2253aab9ea7e4a2598f2abda82939eff Mon 2017-07-10 05:32:22 EEST—Mon 2017-07-10 05:36:49 EEST -50 ef0d85d35c74486fa4104f9d6391b6ba Mon 2017-07-10 05:40:33 EEST—Mon 2017-07-10 05:40:40 EEST [...] Note that journal_file_check_object() is similar to journal_file_object_verify(). The most expensive checks are omitted, as they would slow down every journal_file_move_to_object() call too much. With this implementation, the added overhead is small, for example when dumping some journal content to /dev/null (built with -Dbuildtype=debugoptimized -Db_ndebug=true): Performance counter stats for 'builddir.vanilla/journalctl -D 76f4d4c3406945f9a60d3ca8763aa754/': 12542,311634 task-clock:u (msec) # 1,000 CPUs utilized 0 context-switches:u # 0,000 K/sec 0 cpu-migrations:u # 0,000 K/sec 80 100 page-faults:u # 0,006 M/sec 41 786 963 456 cycles:u # 3,332 GHz 105 453 864 770 instructions:u # 2,52 insn per cycle 24 342 227 334 branches:u # 1940,809 M/sec 105 709 217 branch-misses:u # 0,43% of all branches 12,545199291 seconds time elapsed Performance counter stats for 'builddir.patched/journalctl -D 76f4d4c3406945f9a60d3ca8763aa754/': 12734,723233 task-clock:u (msec) # 1,000 CPUs utilized 0 context-switches:u # 0,000 K/sec 0 cpu-migrations:u # 0,000 K/sec 80 693 page-faults:u # 0,006 M/sec 42 661 017 429 cycles:u # 3,350 GHz 107 696 985 865 instructions:u # 2,52 insn per cycle 24 950 526 745 branches:u # 1959,252 M/sec 101 762 806 branch-misses:u # 0,41% of all branches 12,737527327 seconds time elapsed Fixes #6447.	2017-09-22 10:32:20 +03:00
Vito Caputo	b439282e0b	journal: avoid unnecessary mmap_cache_get() calls journal_file_move_to_object() can skip the second journal_file_move_to() call if the first one already mapped a sufficiently large area. Now that mmap_cache_get() returns the size of the mapped area when asked, ask for the size and only perform the second call if the required size exceeds the mapped size instead of the object header size. This results in a nice performance boost in my testing, even with a corpus of many small logs burning much CPU time elsewhere: Before: # time ./journalctl -b -1 --no-pager > /dev/null real 0m16.330s user 0m16.281s sys 0m0.046s # time ./journalctl -b -1 --no-pager > /dev/null real 0m16.409s user 0m16.358s sys 0m0.048s # time ./journalctl -b -1 --no-pager > /dev/null real 0m16.625s user 0m16.558s sys 0m0.061s After: # time ./journalctl -b -1 --no-pager > /dev/null real 0m15.311s user 0m15.257s sys 0m0.046s # time ./journalctl -b -1 --no-pager > /dev/null real 0m15.201s user 0m15.135s sys 0m0.062s # time ./journalctl -b -1 --no-pager > /dev/null real 0m15.170s user 0m15.113s sys 0m0.053s	2017-07-12 23:59:29 -07:00
Vito Caputo	b42549ad69	journal: return mapped size from mmap_cache_get() If requested, return the actual mapping size to the caller in addition to the address. journal_file_move_to_object() often performs two successive mmap_cache_get() calls via journal_file_move_to(); one to get the object header, then another to get the entire object when it's larger than the header's size. If mmap_cache_get() returned the actual mapping's size, it's probable that the second mmap_cache_get() could be skipped when the established mapping already encompassed the desired size.	2017-07-12 23:58:48 -07:00
Vito Caputo	be7cdd8ec9	journal: explicitly add fds to mmap-cache (#6307 ) This way we have a MMapFileDescriptor reference external to the cache, and can supply the handle directly to mmap_cache_get(), eliminating hashmap lookups entirely from the hot path.	2017-07-10 19:24:56 -04:00
Yusuke Nojima	5b3cc0c86a	journald: fix assertion failure on journal_file_link_data. (#5843 ) When some error occurs during the initialization of JournalFile, the JournalFile can be left without hash tables created. When later trying to append an entry to that file, the assertion in journal_file_link_data() fails, and journald crashes. This patch fix this issue by checking *_hash_table_size in journal_file_verify_header().	2017-04-29 19:37:53 +02:00
Tobias Stoeckmann	6f94e420e8	journal: prevent integer overflow while validating header (#5569 ) It is possible to overflow uint64_t while validating the header of a journal file. To prevent this, the addition itself is checked to be within the limits of UINT64_MAX first. To keep this readable, I have introduced two stack variables which hold the converted values during validation.	2017-03-13 08:14:42 +01:00
AsciiWolf	13e785f7a0	Fix missing space in comments (#5439 )	2017-02-24 18:14:02 +01:00
Lennart Poettering	486b3d08db	Merge pull request #5204 from keszybz/masked-warning-cleanup Cleanup of error code mismatch for masked units	2017-02-02 11:47:30 +01:00
Zbigniew Jędrzejewski-Szmek	b288cdeb2d	Consistently use ERFKILL for masked units `76ec966f0e` changed the code from ESHUTDOWN to ERFKILL, but missed one spot in bus-common-errors.c. Fix that. The code in transaction.c was checking for ERFKILL, but I'm not sure if this mismatch had any effect, i.e. if there were any code paths in which the wrong code actually made difference. Also add comments when ESHUTDOWN is used in the journal code, so it's easy to distinguish those cases when grepping. Standarize on the same capitalization. (There's also a bunch of uses in sd-bus.c, but that's clearly different.)	2017-02-01 19:47:23 -05:00
Lennart Poettering	ef2f4f911b	Merge pull request #5151 from keszybz/journal-flags More information about unsupported journal file flags	2017-02-02 01:01:45 +01:00
Zbigniew Jędrzejewski-Szmek	869a3458cb	Merge pull request #5191 from keszybz/tweaks	2017-02-01 10:27:32 -05:00
Zbigniew Jędrzejewski-Szmek	a6c5909665	Revert "Trivial typo fixes and code refactorings (#5191 )" Let's do a merge to preserve all the commit messages. This reverts commit `785d345145`.	2017-02-01 10:26:50 -05:00
Zbigniew Jędrzejewski-Szmek	785d345145	Trivial typo fixes and code refactorings (#5191 ) * logind: trivial simplification free_and_strdup() handles NULL arg, so make use of that. * boot: fix two typos * pid1: rewrite check in ignore_proc() to not check condition twice It's harmless, but it seems nicer to evaluate a condition just a single time. * core/execute: reformat exec_context_named_iofds() for legibility * core/execute.c: check asprintf return value in the usual fashion This is unlikely to fail, but we cannot rely on asprintf return value on failure, so let's just be correct here. CID #1368227. * core/timer: use (void) CID #1368234. * journal-file: check asprintf return value in the usual fashion This is unlikely to fail, but we cannot rely on asprintf return value on failure, so let's just be correct here. CID #1368236. * shared/cgroup-show: use (void) CID #1368243. * cryptsetup: do not return uninitialized value on error CID #1368416.	2017-02-01 15:04:27 +01:00
Zbigniew Jędrzejewski-Szmek	ec251fe7d5	tree-wide: adjust fall through comments so that gcc is happy gcc 7 adds -Wimplicit-fallthrough=3 to -Wextra. There are a few ways we could deal with that. After we take into account the need to stay compatible with older versions of the compiler (and other compilers), I don't think adding __attribute__((fallthrough)), even as a macro, is worth the trouble. It sticks out too much, a comment is just as good. But gcc has some very specific requiremnts how the comment should look. Adjust it the specific form that it likes. I don't think the extra stuff we had in those comments was adding much value. (Note: the documentation seems to be wrong, and seems to describe a different pattern from the one that is actually used. I guess either the docs or the code will have to change before gcc 7 is finalized.)	2017-01-31 14:04:55 -05:00
Zbigniew Jędrzejewski-Szmek	7645c77b9b	journal-file: check asprintf return value in the usual fashion This is unlikely to fail, but we cannot rely on asprintf return value on failure, so let's just be correct here. CID #1368236.	2017-01-31 11:41:46 -05:00
Zbigniew Jędrzejewski-Szmek	4761fd0ffb	journal-file, journalctl: provide better hint about unsupported features https://bugzilla.redhat.com/show_bug.cgi?id=1416201 $ journalctl -b Journal file /var/log/journal/ad18f69b80264b52bb3b766240742383/system@0005467d92e23784-a6571c8b69d09124.journal~ uses an unsupported feature, ignoring file. Use SYSTEMD_LOG_LEVEL=debug journalctl --file=/var/log/journal/ad18f69b80264b52bb3b766240742383/system@0005467d92e23784-a6571c8b69d09124.journal~ to see the details. -- No entries -- $ journalctl --file=/var/log/journal/ad18f69b80264b52bb3b766240742383/system@0005467d92e23784-a6571c8b69d09124.journal~ Journal file /var/log/journal/ad18f69b80264b52bb3b766240742383/system@0005467d92e23784-a6571c8b69d09124.journal~ uses incompatible flag lz4-compressed disabled at compilation time. Failed to open journal file /var/log/journal/ad18f69b80264b52bb3b766240742383/system@0005467d92e23784-a6571c8b69d09124.journal~: Protocol not supported mmap cache statistics: 0 hit, 1 miss Failed to open files: Protocol not supported	2017-01-24 19:19:33 -05:00
Zbigniew Jędrzejewski-Szmek	4214009f8a	journal-file: factor out helper function In preparation for later changes.	2017-01-24 19:00:23 -05:00
Zbigniew Jędrzejewski-Szmek	6b430fdb7c	tree-wide: use mfree more	2016-10-16 23:35:39 -04:00
Lennart Poettering	ae739cc1ed	journal: refuse opening journal files from the future for writing Never permit that we write to journal files that have newer timestamps than our local wallclock has. If we'd accept that, then the entries in the file might end up not being ordered strictly. Let's refuse this with ETXTBSY, and then immediately rotate to use a new file, so that each file remains strictly ordered also be wallclock internally.	2016-10-12 20:25:20 +02:00
Lennart Poettering	989793d341	journal: when iterating through entry arrays and we hit an invalid one keep going When iterating through partially synced journal files we need to be prepared for hitting with invalid entries (specifically: non-initialized). Instead of generated an error and giving up, let's simply try to preceed with the next one that is valid (and debug log about this). This reworks the logic introduced with `caeab8f626` to iteration in both directions, and tries to look for valid entries located after the invalid one. It also extends the behaviour to both iterating through the global entry array and per-data object entry arrays. Fixes: #4088	2016-10-12 20:25:20 +02:00
Lennart Poettering	1c69f0966a	journal: add an explicit check for uninitialized objects Let's make dissecting of borked journal files more expressive: if we encounter an object whose first 8 bytes are all zeroes, then let's assume the object was simply never initialized, and say so. Previously, this would be detected as "overly short object", which is true too in a away, but it's a lot more helpful printing different debug options for the case where the size is not initialized at all and where the size is initialized to some bogus value. No function behaviour change, only a different log messages for both cases.	2016-10-12 20:25:20 +02:00
Lennart Poettering	ded5034e7a	journal: also check that our entry arrays are properly ordered Let's and extra check, reusing check_properly_ordered() also for journal_file_next_entry_for_data().	2016-10-12 20:25:20 +02:00
Lennart Poettering	b6da4ed045	journal: split out check for properly ordered arrays into its own function This adds a new call check_properly_ordered(), which we can reuse later, and makes the code a bit more readable.	2016-10-12 20:25:20 +02:00
Lennart Poettering	aa598ba5b6	journal: split out array index inc/dec code into a new call bump_array_index() This allows us to share a bit more code between journal_file_next_entry() and journal_file_next_entry_for_data().	2016-10-12 20:25:20 +02:00
Lennart Poettering	202fd896e5	journal: when we encounter a broken journal file, add some debug logging Let's make it easier to figure out when we see an invalid journal file, why we consider it invalid, and add some minimal debug logging for it. This log output is normally not seen (after all, this all is library code), unless debug logging is exlicitly turned on.	2016-10-12 20:25:20 +02:00
Franck Bui	33685a5a3a	journal: fix HMAC calculation when appending a data object Since commit `5996c7c295` (v190 !), the calculation of the HMAC is broken because the hash for a data object including a field is done in the wrong order: the field object is hashed before the data object is. However during verification, the hash is done in the opposite order as objects are scanned sequentially.	2016-09-23 14:59:51 +02:00
Franck Bui	43cd879483	journal: warn when we fail to append a tag to a journal We shouldn't silently fail when appending the tag to a journal file since FSS protection will simply be disabled in this case.	2016-09-23 14:59:00 +02:00
Torstein Husebø	f8e2f4d6a0	treewide: fix typos (#3187 )	2016-05-04 11:26:17 +02:00
Lennart Poettering	a67d68b848	tree-wide: fix invocations of chattr_path() chattr_path() takes two bitmasks, and no booleans. Fix the various invocations to do this properly.	2016-05-02 11:15:30 +02:00
Lennart Poettering	1fcefd8815	journal-file: when rotating a journal file, fsync directory too As suggested by: https://github.com/systemd/systemd/pull/3126#discussion_r61125474	2016-04-29 12:24:09 +02:00
Lennart Poettering	a0fe2a2d20	journal: when creating a new journal file, fsync() the directory it is created in too Fixes: #2831	2016-04-29 12:23:34 +02:00
Vito Caputo	8eb851711f	journal: set STATE_ARCHIVED as part of offlining (#2740 ) The only code path which makes a journal durable is via journal_file_set_offline(). When we perform a rotate the journal's header->state is being set to STATE_ARCHIVED prior to journal_file_set_offline() being called. In journal_file_set_offline(), we short-circuit the entire offline when f->header->state != STATE_ONLINE. This all results in none of the journal_file_set_offline() fsync() calls being reached when rotate archives a journal, so archived journals are never explicitly made durable. What we do now is instead of setting the f->header->state to STATE_ARCHIVED directly in journal_file_rotate() prior to journal_file_close(), we set an archive flag in f->archive for the journal_file_set_offline() machinery to honor by committing STATE_ARCHIVED instead of STATE_OFFLINE when set. Prior to this, rotated journals were never getting fsync() explicitly performed on them, since journal_file_set_offline() short-circuited. Obviously this is undesirable, and depends entirely on the underlying filesystem as to how much durability was achieved when simply closing the file. Note that this problem existed prior to the recent asynchronous fsync changes, but those changes do facilitate our performing this durable offline on rotate without blocking, regardless of the underlying filesystem sync-on-close semantics.	2016-04-27 08:29:43 +02:00
Lennart Poettering	bee6a29198	journal-file: make seeking in corrupted files work Previously, when we used a bisection table for seeking through a corrupted file, and the end of the bisection table was corrupted we'd most likely fail the entire seek operation. Improve the situation: if we encounter invalid entries in a bisection table, linearly go backwards until we find a working entry again.	2016-04-26 12:00:49 +02:00
Lennart Poettering	caeab8f626	journal-file: when iterating through a partly corruped journal file, treat error like EOF When we linearly iterate through a corrupted journal file, and we encounter a read error, don't consider this fatal, but merely as EOF condition (and log about it).	2016-04-26 12:00:49 +02:00
Lennart Poettering	bd30fdf213	journal-file: always generate the same error when encountering corrupted files Let's make sure EBADMSG is the one error we throw when we encounter corrupted data, so that we can neatly test for it.	2016-04-26 12:00:03 +02:00
Lennart Poettering	50809d7a9c	sd-journal: detect earlier if we try to read an object from an invalid offset Specifically, detect early if we try to read from offset 0, i.e. are using uninitialized offset data.	2016-04-26 12:00:02 +02:00
Zbigniew Jędrzejewski-Szmek	47005cf1cf	Merge pull request #3109 from poettering/journal-by-fd rework "journalctl -M"	2016-04-25 15:57:36 -04:00
Zbigniew Jędrzejewski-Szmek	61837e19c6	Merge pull request #3114 from poettering/journalctl-b Fix endless loops in journalctl --list-boots (closes #617).	2016-04-25 15:56:17 -04:00
Vito Caputo	b8f99e27e1	journal: fix already offline check and thread leak (#2810 ) Early in journal_file_set_offline() f->header->state is tested to see if it's != STATE_ONLINE, and since there's no need to do anything if the journal isn't online, the function simply returned here. Since moving part of the offlining process to a separate thread, there are two problems here: 1. We can't simply check f->header->state, because if there is an offline thread active it may modify f->header->state. 2. Even if the journal is deemed offline, the thread responsible may still need joining, so a bare return may leak the thread's resources like its stack. To address #1, the helper journal_file_is_offlining() is called prior to accessing f->header->state. If journal_file_is_offlining() returns true, f->header->state isn't even checked, because an offlining journal is obviously online, and we'll just continue with the normal set offline code path. If journal_file_is_offlining() returns false, then it's safe to check f->header->state, because the offline_state is beyond the point of modifying f->header->state, and there's a memory barrier in the helper. If we find f->header->state is != STATE_ONLINE, then we call the idempotent journal_file_set_offline_thread_join() on the way out of the function, to join a potential lingering offline thread.	2016-04-25 19:58:16 +02:00
Lennart Poettering	0808b92f02	journalctl: improve output of --header a bit Show the various timestamps in hexadecimal too. This is useful for matching the timestamps included in cursor strings (which are encoded in hex, too), with the references in the journal header.	2016-04-25 18:06:47 +02:00
Lennart Poettering	5d1ce25728	sd-journal: add API for opening journal files or directories by fd Also, expose this via the "journalctl --file=-" syntax for STDIN. This feature remains undocumented though, as it is probably not too useful in real-life as this still requires fds that support mmaping and seeking, i.e. does not work for pipes, for which reading from STDIN is most commonly used.	2016-04-25 15:24:46 +02:00
Lennart Poettering	d971033f6b	Merge pull request #2708 from vcaputo/journal-restore-offline-state-on-error journal: restore offline state on error	2016-02-23 16:55:16 +01:00
Vito Caputo	313cefa1d9	tree-wide: make ++/-- usage consistent WRT spacing Throughout the tree there's spurious use of spaces separating ++ and -- operators from their respective operands. Make ++ and -- operator consistent with the majority of existing uses; discard the spaces.	2016-02-22 20:32:04 -08:00
Vito Caputo	ec9ffa2cdd	journal: restore offline state on error If we fail to create the thread, technically we should leave the offline_state as OFFLINE_JOINED, not OFFLINE_SYNCING.	2016-02-22 20:00:13 -08:00
Vito Caputo	b58c888f30	journal: defer journal closes on rotate When we rotate journals, we must set offline and close the current one, but don't generally need to wait for this to complete. Instead, we'll initiate an asynchronous offline via journal_file_set_offline(oldfile, false), and add the file to a per-server set of deferred closes to be closed later when they won't block. There's one complication however; journal_file_open() via journal_file_verify_header() assumes that any writable journal in the online state is the product of an unclean shutdown or other form of corruption. Thus there's a need for journal_file_open() to be aware of deferred closes and synchronize with their completion when opening preexisting journals for writing. To facilitate this the deferred closes set is supplied to the journal_file_open() function where the deferred closes may be closed synchronously before verifying the header in such circumstances.	2016-02-19 18:50:20 -08:00
Vito Caputo	ac2e41f510	journal: asynchronous journal_file_set_offline() This adds a wait flag to journal_file_set_offline(), when false the offline is performed asynchronously in a separate thread. When wait is true, if an asynchronous offline is already in-progress it is restarted and waited for. Otherwise the offline is performed synchronously without the use of a thread. journal_file_set_online() cancels or waits for the asynchronous offline to complete if in-flight, depending on where in the offline process the thread happens to be. If the thread is in the fsync() phase, it is cancelled and waiting is unnecessary. Otherwise, the thread is joined before proceeding. A new offline_state member is added to JournalFile which is used via atomic operations for communicating between the offline thread and the journal_file_set_{offline,online}() functions.	2016-02-19 18:50:20 -08:00
Vito Caputo	69a3a6fd3d	journal: add void cast to journal_file_close() calls	2016-02-19 18:50:16 -08:00
Vito Caputo	fb42603752	journal: add void cast to fsync() calls	2016-02-19 16:54:19 -08:00
Lennart Poettering	91ba5ac7d0	Merge pull request #2589 from keszybz/resolve-tool-2 Better support of OPENPGPKEY, CAA, TLSA packets and tests	2016-02-13 11:15:41 +01:00
Zbigniew Jędrzejewski-Szmek	75f32f047c	Add memcpy_safe ISO/IEC 9899:1999 §7.21.1/2 says: Where an argument declared as size_t n specifies the length of the array for a function, n can have the value zero on a call to that function. Unless explicitly stated otherwise in the description of a particular function in this subclause, pointer arguments on such a call shall still have valid values, as described in 7.1.4. In base64_append_width memcpy was called as memcpy(x, NULL, 0). GCC 4.9 started making use of this and assumes This worked fine under -O0, but does something strange under -O3. This patch fixes a bug in base64_append_width(), fixes a possible bug in journal_file_append_entry_internal(), and makes use of the new function to simplify the code in other places.	2016-02-11 13:07:02 -05:00
Daniel Mack	b26fa1a2fb	tree-wide: remove Emacs lines from all files This should be handled fine now by .dir-locals.el, so need to carry that stuff in every file.	2016-02-10 13:41:57 +01:00
Klearchos Chaloulos	ecb6105a1b	journal: Drop monotonicity check when appending to journal file Remove the check that triggers rotation of the journal file when the arriving log entry had a monotonic timestamp smaller that the previous log entry. This check causes unnecessary rotations when journal-remote was receiving from multiple senders, therefore monotonicity can not be guaranteed. Also, it does not offer any useful functionality for systemd-journald.	2016-02-09 12:14:54 +02:00
Vito Caputo	31981791c5	journal: add missing space to switch statement	2016-02-06 03:51:14 -08:00
Vito Caputo	90d222c190	journal: add asserts on f->(data\|field)_hash_table Functions dereferencing these members should assert their non-NULL state.	2016-02-05 07:43:46 -08:00
Vito Caputo	c88cc6af70	journal: add asserts for f->header Just some additional asserts in functions dereferencing f->header.	2016-02-05 07:43:46 -08:00
Lennart Poettering	e167d7fd8d	journald: minor fixes This primarily contains some minor coding style fixups for `7a24f3bf2f` and earlier changes. Specifically: * Don't log at log levels above LOG_DEBUG from "library" code like journal-file.c * Don't negate errno values before passing them to log_debug_errno(), as the call can handle this fine anyway * Cast some calls we knowingly ignore the return values of to (void) * Don't clobber function call-by-ref return values on failure * Don't mix function calls and variable declarations in one line There's also one more relevant change: when failing to enqueue a journal change fs event, we'll run it immediately.	2016-01-26 14:13:30 +01:00
Zbigniew Jędrzejewski-Szmek	9d5a981398	Merge pull request #2318 from vcaputo/coalesce-ftruncates-redux journal: coalesce ftruncate()s in 250ms windows	2016-01-23 22:09:51 -05:00
Vito Caputo	7a24f3bf2f	journal: coalesce ftruncate()s in 250ms windows Prior to this change every journal append causes an ftruncate() for the sake of inotify propagation of the mmap-based writes. With this change the notification is deferred up to ~250ms, coalescing any repeated journal writes during the deferred period into a single ftruncate(). The ftruncate() call isn't free and doing it on every append adds unnecessary overhead and latency in the journald event loop. Introduces journal_file_enable_post_change_timer() which manages a timer on the provided sd-event instance for scheduling coalesced ftruncates. The ftruncate() behavior is unchanged unless journal_file_enable_post_change_timer() is called on the JournalFile. While not a tremendous improvement, profiling systemd-journald event loop latencies using instrumentation as introduced by `34b8751` it was observed that coalescing the ftruncates was low-hanging fruit worth pursuing. Note orders 12 and 13 shifting left into order 11 and order 6 dipping into order 5: Unmodified: log2(us) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 ----------------------------------------------------------- [10685.414572] 0 0 0 0 38 602 61 2 290 60 1643 2554 13 1 4 1 0 0 1 [10690.415114] 0 0 0 0 0 646 54 7 309 44 2073 2148 17 1 3 0 0 0 1 [10695.415509] 0 0 0 0 1 650 73 3 324 37 2071 2270 9 0 0 1 0 1 0 [10700.416297] 0 0 0 0 0 659 50 4 318 38 2111 2152 6 0 1 0 0 1 1 [10705.417136] 0 0 0 0 2 660 48 4 320 38 2129 2146 12 1 1 0 0 1 1 [10710.489114] 0 0 0 0 0 673 38 3 321 37 1925 2339 7 0 0 0 0 1 1 [10715.489613] 0 0 0 0 3 656 64 8 317 48 2365 2007 7 0 0 0 0 0 1 Coalesced: log2(us) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 ----------------------------------------------------------- [ 6169.161360] 0 0 0 1 24 786 54 11 389 24 4192 771 6 4 0 0 1 0 1 [ 6174.161705] 0 0 0 1 18 800 35 6 380 27 3977 893 3 1 0 0 1 0 1 [ 6179.162741] 0 0 0 1 28 768 51 4 391 16 3998 831 5 3 0 0 0 0 2 [ 6184.162856] 0 0 0 0 19 770 60 2 376 26 3795 1004 9 5 1 0 1 0 1 [ 6189.163279] 0 0 0 0 28 761 49 7 372 27 3729 1056 3 2 0 0 1 0 1 [ 6194.164255] 0 0 0 0 25 785 49 7 394 19 3996 908 6 3 2 0 0 0 1 [ 6199.164658] 0 0 0 0 29 797 35 5 389 18 3995 898 3 4 1 1 1 0 1 The remaining high-order delays are a result of the synchronous fsyncs in systemd-journald, beyond the scope of this commit.	2016-01-14 16:36:07 -08:00
Lennart Poettering	838c669055	Merge pull request #2158 from keszybz/journal-decompression Journal decompression fixes	2015-12-23 21:31:07 +01:00
Zbigniew Jędrzejewski-Szmek	5d6f46b6bf	journal: add dst_allocated_size parameter for compress_blob compress_blob took src, src_size, dst and dst_size, but dst_size wasn't used as an input parameter with the size of dst, but only as an output parameter. dst was implicitly assumed to be at least src_size-1. This code wasn't wrong*, because the only real caller in journal-file.c got it right. But it was misleading, and the tests in test-compress.c got it wrong, and worked only because the output buffer happened to be the same size as input buffer. So add a seperate dst_allocated_size parameter to make it explicit what the size of the buffer is, and to allow test to proceed with different output buffer sizes.	2015-12-13 14:54:47 -05:00
Lennart Poettering	f649045c10	journal: make mmap_cache_unref() a NOP when NULL is passed, like all other destructors	2015-12-10 11:35:52 +01:00
Michael Olbrich	16098e9379	journal: reduce minimum journal file size to 512 KiB For low end embedded systems 4 MiB for each journal file is a lot of memory. Journald will use at least 512 KiB even if JOURNAL_FILE_SIZE_MIN is set to less than that so just use 512 KiB.	2015-11-06 12:10:34 +01:00
Zbigniew Jędrzejewski-Szmek	cfb571f30f	journal: return better error for empty files When reading stuff, we should only return EIO when an actual read error occured, not when we don't like the data for whatever reason. We already return ENODATA for all other kinds of file truncation, hence do the same for the most obvious kind, so that callers know what ENODATA means.	2015-11-03 00:02:00 +01:00
Lennart Poettering	b5efdb8af4	util-lib: split out allocation calls into alloc-util.[ch]	2015-10-27 13:45:53 +01:00
Lennart Poettering	c8b3094de5	util-lib: split out file attribute calls to chattr-util.[ch]	2015-10-27 13:25:56 +01:00
Lennart Poettering	89a5a90cb0	util-lib: split xattr-related calls into xattr-util.[ch]	2015-10-27 13:25:56 +01:00
Lennart Poettering	6bedfcbb29	util-lib: split string parsing related calls from util.[ch] into parse-util.[ch]	2015-10-27 13:25:55 +01:00
Tom Gundersen	7c8871d315	Merge pull request #1654 from poettering/util-lib Various changes to src/basic/	2015-10-25 14:22:43 +01:00
Lennart Poettering	3ffd4af220	util-lib: split out fd-related operations into fd-util.[ch] There are more than enough to deserve their own .c file, hence move them over.	2015-10-25 13:19:18 +01:00

1 2 3 4 5 ...

344 commits