Systemd

Author	SHA1	Message	Date
Lennart Poettering	989793d341	journal: when iterating through entry arrays and we hit an invalid one keep going When iterating through partially synced journal files we need to be prepared for hitting with invalid entries (specifically: non-initialized). Instead of generated an error and giving up, let's simply try to preceed with the next one that is valid (and debug log about this). This reworks the logic introduced with `caeab8f626` to iteration in both directions, and tries to look for valid entries located after the invalid one. It also extends the behaviour to both iterating through the global entry array and per-data object entry arrays. Fixes: #4088	2016-10-12 20:25:20 +02:00
Lennart Poettering	1c69f0966a	journal: add an explicit check for uninitialized objects Let's make dissecting of borked journal files more expressive: if we encounter an object whose first 8 bytes are all zeroes, then let's assume the object was simply never initialized, and say so. Previously, this would be detected as "overly short object", which is true too in a away, but it's a lot more helpful printing different debug options for the case where the size is not initialized at all and where the size is initialized to some bogus value. No function behaviour change, only a different log messages for both cases.	2016-10-12 20:25:20 +02:00
Lennart Poettering	ded5034e7a	journal: also check that our entry arrays are properly ordered Let's and extra check, reusing check_properly_ordered() also for journal_file_next_entry_for_data().	2016-10-12 20:25:20 +02:00
Lennart Poettering	b6da4ed045	journal: split out check for properly ordered arrays into its own function This adds a new call check_properly_ordered(), which we can reuse later, and makes the code a bit more readable.	2016-10-12 20:25:20 +02:00
Lennart Poettering	aa598ba5b6	journal: split out array index inc/dec code into a new call bump_array_index() This allows us to share a bit more code between journal_file_next_entry() and journal_file_next_entry_for_data().	2016-10-12 20:25:20 +02:00
Lennart Poettering	202fd896e5	journal: when we encounter a broken journal file, add some debug logging Let's make it easier to figure out when we see an invalid journal file, why we consider it invalid, and add some minimal debug logging for it. This log output is normally not seen (after all, this all is library code), unless debug logging is exlicitly turned on.	2016-10-12 20:25:20 +02:00
Franck Bui	33685a5a3a	journal: fix HMAC calculation when appending a data object Since commit `5996c7c295` (v190 !), the calculation of the HMAC is broken because the hash for a data object including a field is done in the wrong order: the field object is hashed before the data object is. However during verification, the hash is done in the opposite order as objects are scanned sequentially.	2016-09-23 14:59:51 +02:00
Franck Bui	43cd879483	journal: warn when we fail to append a tag to a journal We shouldn't silently fail when appending the tag to a journal file since FSS protection will simply be disabled in this case.	2016-09-23 14:59:00 +02:00
Torstein Husebø	f8e2f4d6a0	treewide: fix typos (#3187 )	2016-05-04 11:26:17 +02:00
Lennart Poettering	a67d68b848	tree-wide: fix invocations of chattr_path() chattr_path() takes two bitmasks, and no booleans. Fix the various invocations to do this properly.	2016-05-02 11:15:30 +02:00
Lennart Poettering	1fcefd8815	journal-file: when rotating a journal file, fsync directory too As suggested by: https://github.com/systemd/systemd/pull/3126#discussion_r61125474	2016-04-29 12:24:09 +02:00
Lennart Poettering	a0fe2a2d20	journal: when creating a new journal file, fsync() the directory it is created in too Fixes: #2831	2016-04-29 12:23:34 +02:00
Vito Caputo	8eb851711f	journal: set STATE_ARCHIVED as part of offlining (#2740 ) The only code path which makes a journal durable is via journal_file_set_offline(). When we perform a rotate the journal's header->state is being set to STATE_ARCHIVED prior to journal_file_set_offline() being called. In journal_file_set_offline(), we short-circuit the entire offline when f->header->state != STATE_ONLINE. This all results in none of the journal_file_set_offline() fsync() calls being reached when rotate archives a journal, so archived journals are never explicitly made durable. What we do now is instead of setting the f->header->state to STATE_ARCHIVED directly in journal_file_rotate() prior to journal_file_close(), we set an archive flag in f->archive for the journal_file_set_offline() machinery to honor by committing STATE_ARCHIVED instead of STATE_OFFLINE when set. Prior to this, rotated journals were never getting fsync() explicitly performed on them, since journal_file_set_offline() short-circuited. Obviously this is undesirable, and depends entirely on the underlying filesystem as to how much durability was achieved when simply closing the file. Note that this problem existed prior to the recent asynchronous fsync changes, but those changes do facilitate our performing this durable offline on rotate without blocking, regardless of the underlying filesystem sync-on-close semantics.	2016-04-27 08:29:43 +02:00
Lennart Poettering	bee6a29198	journal-file: make seeking in corrupted files work Previously, when we used a bisection table for seeking through a corrupted file, and the end of the bisection table was corrupted we'd most likely fail the entire seek operation. Improve the situation: if we encounter invalid entries in a bisection table, linearly go backwards until we find a working entry again.	2016-04-26 12:00:49 +02:00
Lennart Poettering	caeab8f626	journal-file: when iterating through a partly corruped journal file, treat error like EOF When we linearly iterate through a corrupted journal file, and we encounter a read error, don't consider this fatal, but merely as EOF condition (and log about it).	2016-04-26 12:00:49 +02:00
Lennart Poettering	bd30fdf213	journal-file: always generate the same error when encountering corrupted files Let's make sure EBADMSG is the one error we throw when we encounter corrupted data, so that we can neatly test for it.	2016-04-26 12:00:03 +02:00
Lennart Poettering	50809d7a9c	sd-journal: detect earlier if we try to read an object from an invalid offset Specifically, detect early if we try to read from offset 0, i.e. are using uninitialized offset data.	2016-04-26 12:00:02 +02:00
Zbigniew Jędrzejewski-Szmek	47005cf1cf	Merge pull request #3109 from poettering/journal-by-fd rework "journalctl -M"	2016-04-25 15:57:36 -04:00
Zbigniew Jędrzejewski-Szmek	61837e19c6	Merge pull request #3114 from poettering/journalctl-b Fix endless loops in journalctl --list-boots (closes #617).	2016-04-25 15:56:17 -04:00
Vito Caputo	b8f99e27e1	journal: fix already offline check and thread leak (#2810 ) Early in journal_file_set_offline() f->header->state is tested to see if it's != STATE_ONLINE, and since there's no need to do anything if the journal isn't online, the function simply returned here. Since moving part of the offlining process to a separate thread, there are two problems here: 1. We can't simply check f->header->state, because if there is an offline thread active it may modify f->header->state. 2. Even if the journal is deemed offline, the thread responsible may still need joining, so a bare return may leak the thread's resources like its stack. To address #1, the helper journal_file_is_offlining() is called prior to accessing f->header->state. If journal_file_is_offlining() returns true, f->header->state isn't even checked, because an offlining journal is obviously online, and we'll just continue with the normal set offline code path. If journal_file_is_offlining() returns false, then it's safe to check f->header->state, because the offline_state is beyond the point of modifying f->header->state, and there's a memory barrier in the helper. If we find f->header->state is != STATE_ONLINE, then we call the idempotent journal_file_set_offline_thread_join() on the way out of the function, to join a potential lingering offline thread.	2016-04-25 19:58:16 +02:00
Lennart Poettering	0808b92f02	journalctl: improve output of --header a bit Show the various timestamps in hexadecimal too. This is useful for matching the timestamps included in cursor strings (which are encoded in hex, too), with the references in the journal header.	2016-04-25 18:06:47 +02:00
Lennart Poettering	5d1ce25728	sd-journal: add API for opening journal files or directories by fd Also, expose this via the "journalctl --file=-" syntax for STDIN. This feature remains undocumented though, as it is probably not too useful in real-life as this still requires fds that support mmaping and seeking, i.e. does not work for pipes, for which reading from STDIN is most commonly used.	2016-04-25 15:24:46 +02:00
Lennart Poettering	d971033f6b	Merge pull request #2708 from vcaputo/journal-restore-offline-state-on-error journal: restore offline state on error	2016-02-23 16:55:16 +01:00
Vito Caputo	313cefa1d9	tree-wide: make ++/-- usage consistent WRT spacing Throughout the tree there's spurious use of spaces separating ++ and -- operators from their respective operands. Make ++ and -- operator consistent with the majority of existing uses; discard the spaces.	2016-02-22 20:32:04 -08:00
Vito Caputo	ec9ffa2cdd	journal: restore offline state on error If we fail to create the thread, technically we should leave the offline_state as OFFLINE_JOINED, not OFFLINE_SYNCING.	2016-02-22 20:00:13 -08:00
Vito Caputo	b58c888f30	journal: defer journal closes on rotate When we rotate journals, we must set offline and close the current one, but don't generally need to wait for this to complete. Instead, we'll initiate an asynchronous offline via journal_file_set_offline(oldfile, false), and add the file to a per-server set of deferred closes to be closed later when they won't block. There's one complication however; journal_file_open() via journal_file_verify_header() assumes that any writable journal in the online state is the product of an unclean shutdown or other form of corruption. Thus there's a need for journal_file_open() to be aware of deferred closes and synchronize with their completion when opening preexisting journals for writing. To facilitate this the deferred closes set is supplied to the journal_file_open() function where the deferred closes may be closed synchronously before verifying the header in such circumstances.	2016-02-19 18:50:20 -08:00
Vito Caputo	ac2e41f510	journal: asynchronous journal_file_set_offline() This adds a wait flag to journal_file_set_offline(), when false the offline is performed asynchronously in a separate thread. When wait is true, if an asynchronous offline is already in-progress it is restarted and waited for. Otherwise the offline is performed synchronously without the use of a thread. journal_file_set_online() cancels or waits for the asynchronous offline to complete if in-flight, depending on where in the offline process the thread happens to be. If the thread is in the fsync() phase, it is cancelled and waiting is unnecessary. Otherwise, the thread is joined before proceeding. A new offline_state member is added to JournalFile which is used via atomic operations for communicating between the offline thread and the journal_file_set_{offline,online}() functions.	2016-02-19 18:50:20 -08:00
Vito Caputo	69a3a6fd3d	journal: add void cast to journal_file_close() calls	2016-02-19 18:50:16 -08:00
Vito Caputo	fb42603752	journal: add void cast to fsync() calls	2016-02-19 16:54:19 -08:00
Lennart Poettering	91ba5ac7d0	Merge pull request #2589 from keszybz/resolve-tool-2 Better support of OPENPGPKEY, CAA, TLSA packets and tests	2016-02-13 11:15:41 +01:00
Zbigniew Jędrzejewski-Szmek	75f32f047c	Add memcpy_safe ISO/IEC 9899:1999 §7.21.1/2 says: Where an argument declared as size_t n specifies the length of the array for a function, n can have the value zero on a call to that function. Unless explicitly stated otherwise in the description of a particular function in this subclause, pointer arguments on such a call shall still have valid values, as described in 7.1.4. In base64_append_width memcpy was called as memcpy(x, NULL, 0). GCC 4.9 started making use of this and assumes This worked fine under -O0, but does something strange under -O3. This patch fixes a bug in base64_append_width(), fixes a possible bug in journal_file_append_entry_internal(), and makes use of the new function to simplify the code in other places.	2016-02-11 13:07:02 -05:00
Daniel Mack	b26fa1a2fb	tree-wide: remove Emacs lines from all files This should be handled fine now by .dir-locals.el, so need to carry that stuff in every file.	2016-02-10 13:41:57 +01:00
Klearchos Chaloulos	ecb6105a1b	journal: Drop monotonicity check when appending to journal file Remove the check that triggers rotation of the journal file when the arriving log entry had a monotonic timestamp smaller that the previous log entry. This check causes unnecessary rotations when journal-remote was receiving from multiple senders, therefore monotonicity can not be guaranteed. Also, it does not offer any useful functionality for systemd-journald.	2016-02-09 12:14:54 +02:00
Vito Caputo	31981791c5	journal: add missing space to switch statement	2016-02-06 03:51:14 -08:00
Vito Caputo	90d222c190	journal: add asserts on f->(data\|field)_hash_table Functions dereferencing these members should assert their non-NULL state.	2016-02-05 07:43:46 -08:00
Vito Caputo	c88cc6af70	journal: add asserts for f->header Just some additional asserts in functions dereferencing f->header.	2016-02-05 07:43:46 -08:00
Lennart Poettering	e167d7fd8d	journald: minor fixes This primarily contains some minor coding style fixups for `7a24f3bf2f` and earlier changes. Specifically: * Don't log at log levels above LOG_DEBUG from "library" code like journal-file.c * Don't negate errno values before passing them to log_debug_errno(), as the call can handle this fine anyway * Cast some calls we knowingly ignore the return values of to (void) * Don't clobber function call-by-ref return values on failure * Don't mix function calls and variable declarations in one line There's also one more relevant change: when failing to enqueue a journal change fs event, we'll run it immediately.	2016-01-26 14:13:30 +01:00
Zbigniew Jędrzejewski-Szmek	9d5a981398	Merge pull request #2318 from vcaputo/coalesce-ftruncates-redux journal: coalesce ftruncate()s in 250ms windows	2016-01-23 22:09:51 -05:00
Vito Caputo	7a24f3bf2f	journal: coalesce ftruncate()s in 250ms windows Prior to this change every journal append causes an ftruncate() for the sake of inotify propagation of the mmap-based writes. With this change the notification is deferred up to ~250ms, coalescing any repeated journal writes during the deferred period into a single ftruncate(). The ftruncate() call isn't free and doing it on every append adds unnecessary overhead and latency in the journald event loop. Introduces journal_file_enable_post_change_timer() which manages a timer on the provided sd-event instance for scheduling coalesced ftruncates. The ftruncate() behavior is unchanged unless journal_file_enable_post_change_timer() is called on the JournalFile. While not a tremendous improvement, profiling systemd-journald event loop latencies using instrumentation as introduced by `34b8751` it was observed that coalescing the ftruncates was low-hanging fruit worth pursuing. Note orders 12 and 13 shifting left into order 11 and order 6 dipping into order 5: Unmodified: log2(us) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 ----------------------------------------------------------- [10685.414572] 0 0 0 0 38 602 61 2 290 60 1643 2554 13 1 4 1 0 0 1 [10690.415114] 0 0 0 0 0 646 54 7 309 44 2073 2148 17 1 3 0 0 0 1 [10695.415509] 0 0 0 0 1 650 73 3 324 37 2071 2270 9 0 0 1 0 1 0 [10700.416297] 0 0 0 0 0 659 50 4 318 38 2111 2152 6 0 1 0 0 1 1 [10705.417136] 0 0 0 0 2 660 48 4 320 38 2129 2146 12 1 1 0 0 1 1 [10710.489114] 0 0 0 0 0 673 38 3 321 37 1925 2339 7 0 0 0 0 1 1 [10715.489613] 0 0 0 0 3 656 64 8 317 48 2365 2007 7 0 0 0 0 0 1 Coalesced: log2(us) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 ----------------------------------------------------------- [ 6169.161360] 0 0 0 1 24 786 54 11 389 24 4192 771 6 4 0 0 1 0 1 [ 6174.161705] 0 0 0 1 18 800 35 6 380 27 3977 893 3 1 0 0 1 0 1 [ 6179.162741] 0 0 0 1 28 768 51 4 391 16 3998 831 5 3 0 0 0 0 2 [ 6184.162856] 0 0 0 0 19 770 60 2 376 26 3795 1004 9 5 1 0 1 0 1 [ 6189.163279] 0 0 0 0 28 761 49 7 372 27 3729 1056 3 2 0 0 1 0 1 [ 6194.164255] 0 0 0 0 25 785 49 7 394 19 3996 908 6 3 2 0 0 0 1 [ 6199.164658] 0 0 0 0 29 797 35 5 389 18 3995 898 3 4 1 1 1 0 1 The remaining high-order delays are a result of the synchronous fsyncs in systemd-journald, beyond the scope of this commit.	2016-01-14 16:36:07 -08:00
Lennart Poettering	838c669055	Merge pull request #2158 from keszybz/journal-decompression Journal decompression fixes	2015-12-23 21:31:07 +01:00
Zbigniew Jędrzejewski-Szmek	5d6f46b6bf	journal: add dst_allocated_size parameter for compress_blob compress_blob took src, src_size, dst and dst_size, but dst_size wasn't used as an input parameter with the size of dst, but only as an output parameter. dst was implicitly assumed to be at least src_size-1. This code wasn't wrong*, because the only real caller in journal-file.c got it right. But it was misleading, and the tests in test-compress.c got it wrong, and worked only because the output buffer happened to be the same size as input buffer. So add a seperate dst_allocated_size parameter to make it explicit what the size of the buffer is, and to allow test to proceed with different output buffer sizes.	2015-12-13 14:54:47 -05:00
Lennart Poettering	f649045c10	journal: make mmap_cache_unref() a NOP when NULL is passed, like all other destructors	2015-12-10 11:35:52 +01:00
Michael Olbrich	16098e9379	journal: reduce minimum journal file size to 512 KiB For low end embedded systems 4 MiB for each journal file is a lot of memory. Journald will use at least 512 KiB even if JOURNAL_FILE_SIZE_MIN is set to less than that so just use 512 KiB.	2015-11-06 12:10:34 +01:00
Zbigniew Jędrzejewski-Szmek	cfb571f30f	journal: return better error for empty files When reading stuff, we should only return EIO when an actual read error occured, not when we don't like the data for whatever reason. We already return ENODATA for all other kinds of file truncation, hence do the same for the most obvious kind, so that callers know what ENODATA means.	2015-11-03 00:02:00 +01:00
Lennart Poettering	b5efdb8af4	util-lib: split out allocation calls into alloc-util.[ch]	2015-10-27 13:45:53 +01:00
Lennart Poettering	c8b3094de5	util-lib: split out file attribute calls to chattr-util.[ch]	2015-10-27 13:25:56 +01:00
Lennart Poettering	89a5a90cb0	util-lib: split xattr-related calls into xattr-util.[ch]	2015-10-27 13:25:56 +01:00
Lennart Poettering	6bedfcbb29	util-lib: split string parsing related calls from util.[ch] into parse-util.[ch]	2015-10-27 13:25:55 +01:00
Tom Gundersen	7c8871d315	Merge pull request #1654 from poettering/util-lib Various changes to src/basic/	2015-10-25 14:22:43 +01:00
Lennart Poettering	3ffd4af220	util-lib: split out fd-related operations into fd-util.[ch] There are more than enough to deserve their own .c file, hence move them over.	2015-10-25 13:19:18 +01:00
Lennart Poettering	07630cea1f	util-lib: split our string related calls from util.[ch] into its own file string-util.[ch] There are more than enough calls doing string manipulations to deserve its own files, hence do something about it. This patch also sorts the #include blocks of all files that needed to be updated, according to the sorting suggestions from CODING_STYLE. Since pretty much every file needs our string manipulation functions this effectively means that most files have sorted #include blocks now. Also touches a few unrelated include files.	2015-10-24 23:05:02 +02:00
Lennart Poettering	0240c60369	journal: irrelevant coding style fixes	2015-10-24 15:08:15 +02:00
Lennart Poettering	d1afbcd221	journal: fix error handling when compressing journal objects Let's make sure we handle compression errors properly, and don't misunderstand an error for success. Also, let's actually compress things if lz4 is enabled. Fixes #1662.	2015-10-24 13:19:42 +02:00
Lennart Poettering	8580d1f73d	journal: rework vacuuming logic Implement a maximum limit on number of journal files to keep around. Enforcing a limit is useful on this since our performance when viewing pays a heavy penalty for each journal file to interleve. This setting is turned on now by default, and set to 100. Also, actully implement what `348ced9097` promised: use whatever we find on disk at startup as lower bound on how much disk space we can use. That commit introduced some provisions to implement this, but actually never did. This also adds "journalctl --vacuum-files=" to vacuum files on disk by their number explicitly.	2015-10-02 23:21:59 +02:00
Lennart Poettering	65089b8240	journal: improve some messages Indicate that we are ignoring errors, when we ignore them, and log that at LOG_WARNING level. Use the right error code for the log message.	2015-10-02 23:19:00 +02:00
Lennart Poettering	070052aba3	journal: simplify things by using the LESS_BY() macro	2015-10-02 22:42:13 +02:00
Lennart Poettering	804ae586d4	journal: make journal_file_close() return NULL The way it is customary everywhere else in our sources.	2015-10-02 22:36:33 +02:00
Lennart Poettering	59f448cf15	tree-wide: never use the off_t unless glibc makes us use it off_t is a really weird type as it is usually 64bit these days (at least in sane programs), but could theoretically be 32bit. We don't support off_t as 32bit builds though, but still constantly deal with safely converting from off_t to other types and back for no point. Hence, never use the type anymore. Always use uint64_t instead. This has various benefits, including that we can expose these values directly as D-Bus properties, and also that the values parse the same in all cases.	2015-09-10 18:16:18 +02:00
Markus Elfring	dc4ebc0787	Bug #944 : Deletion of unnecessary checks before calls of the function "free" The function "free" is documented in the way that no action shall occur for a passed null pointer. It is therefore not needed that a function caller repeats a corresponding check. http://stackoverflow.com/questions/18775608/free-a-null-pointer-anyway-or-check-first This issue was fixed by using the software Coccinelle 1.0.1.	2015-08-17 09:30:49 +02:00
Lennart Poettering	dade37d403	journal: avoid mapping empty data and field hash tables When a new journal file is created we write the header first, then sync and only then create the data and field hash tables in them. That means to other processes it might appear that the files have a valid header but not data and field hash tables. Our reader code should be able to deal with this. With this change we'll not map the two hash tables right-away after opening a file for reading anymore (because that will of course fail if the objects are missing), but delay this until the first time we access them. On top of that, when we want to look something up in the hash tables and we notice they aren't initialized yet, we consider them empty. This improves handling of some journal files reported in #487.	2015-07-24 01:55:45 +02:00
Lennart Poettering	fc68c92973	journal: don't force FS_NOCOW_FL on new journal files, but warn if it is missing This way users have the freedom to set or unset the FS_NOCOW_FL flag on their journal files by setting it on the journal directory. Since our default tmpfiles configuration now sets this flag on the directory the flag is set by default on new files, however people can opt-out of this by masking the tmpfiles file for it.	2015-04-22 13:27:53 +02:00
Ronny Chevalier	3df3e884ae	shared: add random-util.[ch]	2015-04-11 00:11:13 +02:00
Lennart Poettering	1ed8f8c16d	util: merge change_attr_fd() and chattr_fd()	2015-04-08 20:47:35 +02:00
Harald Hoyer	a7f7d1bde4	fix gcc warnings about uninitialized variables like: src/shared/install.c: In function ‘unit_file_lookup_state’: src/shared/install.c:1861:16: warning: ‘r’ may be used uninitialized in this function [-Wmaybe-uninitialized] return r < 0 ? r : state; ^ src/shared/install.c:1796:13: note: ‘r’ was declared here int r; ^	2015-03-27 14:57:38 +01:00
Zbigniew Jędrzejewski-Szmek	977eaa1eae	journal: fix return code Introduced in `fa6ac76083`. Might be related to CID #1261724, but I don't know if coverity can recurse this deep.	2015-03-09 22:02:25 -04:00
Zbigniew Jędrzejewski-Szmek	d587eca510	journal-file: update format string to remove cast	2015-03-09 22:02:25 -04:00
Zbigniew Jędrzejewski-Szmek	288359dba1	journal: align comments to make them more legible	2015-03-09 22:02:25 -04:00
Cristian Rodríguez	65eae3b762	journal: fix Inappropriate ioctl for device on ext4 Logs constantly show systemd-journald[395]: Failed to set file attributes: Inappropriate ioctl for device This is because ext4 does not support FS_NOCOW_FL. [zj: fold into one conditional as suggested on the ML and fix (preexisting) r/errno confusion in error message.]	2015-03-02 10:54:19 -05:00
Michal Schmidt	950c07d421	journal: make skipping of exhausted journal files effective again Commit `668c965af` "journal: skipping of exhausted journal files is bad if direction changed" fixed a correctness issue, but it also significantly limited the cases where the optimization that skips exhausted journal files could apply. As a result, some journalctl queries are much slower in v219 than in v218. (e.g. queries where a "--since" cutoff should have quickly eliminated older journal files from consideration, but didn't.) If already in the initial iteration find_location_with_matches() finds no entry, the journal file's location is not updated. This is fine, except that: - We must update at least f->last_direction. The optimization relies on it. Let's separate that from journal_file_save_location() and update it immediately after the direction checks. - The optimization was conditional on "f->current_offset > 0", but it would always be 0 in this scenario. This check is unnecessary for the optimization.	2015-02-25 17:32:27 +01:00
Thomas Hindoe Paaboel Andersen	2eec67acbb	remove unused includes This patch removes includes that are not used. The removals were found with include-what-you-use which checks if any of the symbols from a header is in use.	2015-02-23 23:53:42 +01:00
Lennart Poettering	2a560338c4	journald: don't specify inline in local functions Leave it to the compiler to figure out whether it shall inline stuff or not. Only place where using static inline is OK to use is in in header files, really.	2015-02-10 12:34:11 +01:00
Zbigniew Jędrzejewski-Szmek	8facc3498e	Fix some format strings for enums, they are signed	2015-01-22 00:39:30 -05:00
Lennart Poettering	d61b600dde	util: make it easy to initialize the crtime from the current time in fd_setcrtime()	2015-01-08 01:27:13 +01:00
Lennart Poettering	11689d2a02	journald: turn off COW for journal files on btrfs btrfs' COW logic results in heavily fragment journal files, which is detrimental for perfomance. Hence, turn off COW for journal files as we create them. Turning off COW comes at the cost of data integrity guarantees, but this should be acceptable, given that we do our own checksumming, and generally have a pretty conservative write pattern. Also see discussion on linux-btrfs: http://www.spinics.net/lists/linux-btrfs/msg41001.html	2015-01-08 01:22:29 +01:00
Lennart Poettering	b9a1617d75	journal: consider file deletion errors a reason for rotation	2015-01-06 20:31:40 +01:00
Lennart Poettering	f27a386430	journald: whenever we rotate a file, btrfs defrag it Our write pattern is quite awful for CoW file systems (btrfs...), as we keep updating file parts in the beginning of the file. This results in fragmented journal files. Hence: when rotating files, defragment them, since at that point we know that no further write accesses will be made.	2015-01-06 20:31:40 +01:00
Lennart Poettering	2678031a17	journald: when we detect the journal file we are about to write to has been deleted, rotate https://bugzilla.redhat.com/show_bug.cgi?id=1171719	2015-01-05 02:57:36 +01:00
Lennart Poettering	805d14864f	journald: add some additional checks before we divide by values read from journal file headers Since the file headers might be replaced by zeroed pages now due to sigbus we should make sure we don't end up dividing by zero because we don't check values read from journal file headers for changes.	2015-01-05 01:40:51 +01:00
Lennart Poettering	fa6ac76083	journald: process SIGBUS for the memory maps we set up Even though we use fallocate() it appears that file systems like btrfs will trigger SIGBUS on certain low-disk-space situation. We should handle that, hence catch the signal, add it to a list of invalidated pages, and replace the page with an empty memory area. After each write check if SIGBUS was triggered, and consider the write invalid if it was. This should make journald a lot more robust with file systems where fallocate() is not reliable, for example all CoW file systems (btrfs...), where changing written data can fail with disk full errors. https://bugzilla.redhat.com/show_bug.cgi?id=1045810	2015-01-05 01:40:51 +01:00
Lennart Poettering	4a4d89b682	util: make creation time xattr logic more generic	2014-12-24 16:53:04 +01:00
Michal Schmidt	f534928ad7	journal: journal_file_next_entry() does not need pointer to current Object The current offset is sufficient information.	2014-12-18 14:41:22 +01:00
Michal Schmidt	d8ae66d7fa	journal: compare candidate entries using JournalFiles' locations When comparing the locations of candidate entries, we can rely on the location information stored in struct JournalFile.	2014-12-18 12:26:00 +01:00
Michal Schmidt	6573ef05a3	journal: keep per-JournalFile location info during iteration In next_beyond_location() when we find a candidate entry in a journal file, save its location information in struct JournalFile. The purpose of remembering the locations of candidate entries is to be able to save work in the next iteration. This patch does only the remembering part. LOCATION_SEEK means the location identifies a candidate entry. When a winner is picked from among candidates, it becomes LOCATION_DISCRETE. LOCATION_TAIL here signifies we've iterated the file to the end (or the beginning in the case of reversed direction).	2014-12-18 12:17:20 +01:00
Michal Schmidt	1fc605b0e1	journal: abstract the resetting of JournalFile's location	2014-12-18 11:56:19 +01:00
Michal Schmidt	14499361a5	journal: delete unused function journal_file_skip_entry() Its only caller is a test.	2014-12-18 11:53:08 +01:00
Michal Schmidt	ae2adbcd09	journal: delete unused function journal_file_move_to_entry_by_offset()	2014-12-18 11:47:13 +01:00
Michal Schmidt	69adae5168	journal: replace contexts hashmap with a plain array try_context() is such a hot path that the hashmap lookup is expensive. The number of contexts is small - it is the number of object types. Using a hashmap is overkill. A plain array will do. Before: $ time ./journalctl --since=2014-06-01 --until=2014-07-01 > /dev/null real 0m9.445s user 0m9.228s sys 0m0.213s After: $ time ./journalctl --since=2014-06-01 --until=2014-07-01 > /dev/null real 0m5.438s user 0m5.266s sys 0m0.170s	2014-12-13 00:47:23 +01:00
Michal Schmidt	7a9dabea7e	journal: push type_to_context conversion down to journal_file_move_to()	2014-12-13 00:47:23 +01:00
Michal Schmidt	7851983162	journal: have a named enum ObjectType	2014-12-13 00:47:23 +01:00
Michal Schmidt	d05089d86e	journal: consistently use OBJECT_<type> names instead of numbers Note that numbers 0 and -1 are both replaced with OBJECT_UNUSED, because they are treated the same everywhere (e.g. type_to_context() translates them both to 0).	2014-12-13 00:47:23 +01:00
Michal Schmidt	d3d3208f60	journal: move type_to_context() to journal-file.c It has no other callers. It does not need to be in the header file.	2014-12-13 00:47:23 +01:00
Michal Schmidt	1b8951e5bd	journal: remove journal_file_object_keep/release functions The only user is sd_journal_enumerate_unique() and, as explained in the previous commit (`fed67c38e3` "journal: map objects to context set by caller, not by actual object type"), the use of them there is now superfluous. Let's remove them. This reverts major parts of commits: `ae97089d49` journal: fix access to munmapped memory in sd_journal_enumerate_unique `06cc69d44c` sd-journal: fix sd_journal_enumerate_unique skipping values Tested with an "--enable-debug" build and "journalctl --list-boots". It gives the expected number of results. Additionally, if I then revert the previous commit ("journal: map objects to context set by caller, not to actual object type"), it crashes with SIGSEGV, as expected.	2014-12-13 00:46:40 +01:00
Michal Schmidt	fed67c38e3	journal: map objects to context set by caller, not by actual object type When the caller of journal_file_move_to_object() specifies type==0, the object header is at first mapped in context 0. Then after the header is checked, the whole object is mapped in a context determined by the actual object type (which is not even range-checked using type_to_context()). This looks wrong. It should map in the caller-specified context. An old comment in sd_journal_enumerate_unique() supports this view: /* We do not use the type context here, but 0 instead, * so that we can look at this data object at the same * time as one on another file */ Clearly the expectation was that the data object will remain mapped in context 0 without being pushed away by mapping other objects in context OBJECT_DATA. I suspect that this was the real bug that got fixed by `ae97089d49` "journal: fix access to munmapped memory in sd_journal_enumerate_unique". In other words, journal_file_object_keep/release are superfluous after applying this patch.	2014-12-13 00:46:16 +01:00
Torstein Husebø	7517e17443	journald: correct spacing near eol code comments	2014-12-11 15:08:26 +01:00
Olivier Brunel	2173cbf847	journal: Fix navigating backwards missing entries With DIRECTION_UP (i.e. navigating backwards) in generic_array_bisect() when the needle was found as the last item in the array, it wasn't actually processed as match, resulting in entries being missed. https://bugs.freedesktop.org/show_bug.cgi?id=86855	2014-12-08 19:38:55 +01:00
Michal Schmidt	56f64d9576	treewide: use log__errno whenever %m is in the format string If the format string contains %m, clearly errno must have a meaningful value, so we might as well use log__errno to have ERRNO= logged. Using: find . -name '.[ch]' \| xargs sed -r -i -e \ 's/log_(debug\|info\|notice\|warning\|error\|emergency)\((".%m.*")/log_\1_errno(errno, \2/' Plus some whitespace, linewrap, and indent adjustments.	2014-11-28 19:49:27 +01:00
Lennart Poettering	294330896d	journald: be nice to coverity, add an extra assert coverity otherwise assumes that the chain object might be NULL.	2014-10-27 23:50:53 +01:00
Michal Schmidt	4743015db6	journal: make JournalFile::chain_cache an OrderedHashmap The order of entries may matter here. Oldest entries are evicted first when the cache is full. (Though I don't see anything to rejuvenate entries on cache hits.)	2014-10-23 17:38:02 +02:00
Jan Janssen	06cc69d44c	sd-journal: fix sd_journal_enumerate_unique skipping values sd_journal_enumerate_unique will lock its mmap window to prevent it from being released by calling mmap_cache_get with keep_always=true. This call may return windows that are wider, but compatible with the parameters provided to it. This can result in a mismatch where the window to be released cannot properly be selected, because we have more than one window matching the parameters of mmap_cache_release. Therefore, introduce a release_cookie to be used when releasing the window. https://bugs.freedesktop.org/show_bug.cgi?id=79380	2014-10-09 23:16:36 -04:00
Michal Schmidt	d5099efc47	hashmap: introduce hash_ops to make struct Hashmap smaller It is redundant to store 'hash' and 'compare' function pointers in struct Hashmap separately. The functions always comprise a pair. Store a single pointer to struct hash_ops instead. systemd keeps hundreds of hashmaps, so this saves a little bit of memory.	2014-09-15 16:08:50 +02:00
Zbigniew Jędrzejewski-Szmek	0f99f74a14	sd-journal: verify that object start with the field name If the journal is corrupted, we might return an object that does not start with the expected field name and/or is shorter than it should.	2014-08-27 21:05:28 -04:00
Zbigniew Jędrzejewski-Szmek	fa1c4b518e	Fix misuse of uint64_t as size_t They have different size on 32 bit, so they are really not interchangable.	2014-08-03 23:53:49 -04:00
Zbigniew Jędrzejewski-Szmek	b3306e9c3c	journal: allow files with no data whatsoever If a file was opened for writing, and then closed immediately without actually writing any entries, on subsequent opening, it would be considered "corrupted". This should be totally fine, and even in read mode, an empty file can become non-empty later on.	2014-07-15 22:23:47 -04:00
Zbigniew Jędrzejewski-Szmek	3b1a55e110	Fix build without any compression enabled	2014-07-11 10:42:27 -04:00
Ronny Chevalier	92261977d8	fix #ifdef	2014-07-08 23:15:07 -04:00
Zbigniew Jędrzejewski-Szmek	d89c8fdf48	journal: add LZ4 as optional compressor Add liblz4 as an optional dependency when requested with --enable-lz4, and use it in preference to liblzma for journal blob and coredump compression. To retain backwards compatibility, XZ is used to decompress old blobs. Things will function correctly only with lz4-119. Based on the benchmarks found on the web, lz4 seems to be the best choice for "quick" compressors atm. For pkg-config status, see http://code.google.com/p/lz4/issues/detail?id=135.	2014-07-06 19:06:03 -04:00
Kay Sievers	d2edfae0f9	build-sys: use glibc's xattr support instead of requiring libattr	2014-05-28 17:36:40 +08:00
Lennart Poettering	03e334a1c7	util: replace close_nointr_nofail() by a more useful safe_close() safe_close() automatically becomes a NOP when a negative fd is passed, and returns -1 unconditionally. This makes it easy to write lines like this: fd = safe_close(fd); Which will close an fd if it is open, and reset the fd variable correctly. By making use of this new scheme we can drop a > 200 lines of code that was required to test for non-negative fds or to reset the closed fd variable afterwards.	2014-03-18 19:31:34 +01:00
Zbigniew Jędrzejewski-Szmek	fb099c8d2a	journal: assume that next entry is after previous entry With a corrupted file, we can get in a situation where two entries in the entry array point to the same object. Then journal_file_next_entry will find the first one using generic_arrray_bisect, and try to move to the second one, but since the address is the same, generic_array_get will return the first one. journal_file_next_entry ends up in an infinite loop. https://bugzilla.redhat.com/show_bug.cgi?id=1047039	2014-03-05 00:17:27 -05:00
Zbigniew Jędrzejewski-Szmek	bdd13f6be4	Remove dead lines in various places As pointed-out by clang -Wunreachable-code. No behaviour changes.	2014-02-24 19:24:14 -05:00
Daniel Mack	748db5928c	journal: fix compiler warning in journal_file_append_data() gcc (4.8.2, arm) does not understand that journal_file_append_field() will always set 'fo' when it returns 0, so this warning is bogus. Anyway, fix it by initialiting fo = NULL.	2014-02-20 17:24:36 +01:00
Greg KH	29804cc1e0	use memzero(foo, length); for all memset(foo, 0, length); calls In trying to track down a stupid linker bug, I noticed a bunch of memset() calls that should be using memzero() to make it more "obvious" that the options are correct (i.e. 0 is not the length, but the data to set). So fix up all current calls to memset(foo, 0, length) to memzero(foo, length).	2014-01-31 11:55:01 +01:00
Zbigniew Jędrzejewski-Szmek	ae97089d49	journal: fix access to munmapped memory in sd_journal_enumerate_unique sd_j_e_u needs to keep a reference to an object while comparing it with possibly duplicate objects in other files. Because the size of mmap cache is limited, with enough files and object to compare to, at some point the object being compared would be munmapped, resulting in a segmentation fault. Fix this issue by turning keep_always into a reference count that can be increased and decreased. Other callers which set keep_always=true are unmodified: their references are never released but are ignored when the whole file is closed, which happens at some point. keep_always is increased in sd_j_e_u and later on released.	2014-01-11 10:07:22 -05:00
George McCollister	af13a6b060	journal: Add missing byte order conversions Convert entry_array.items[0] to host byte order prior to passing it to chain_cache_put(). [zj: also use le64toh in journal-verify.c] https://bugs.freedesktop.org/show_bug.cgi?id=73194	2014-01-05 09:36:32 -05:00
Lennart Poettering	9bf3b53533	shared: switch our hash table implementation over to SipHash SipHash appears to be the new gold standard for hashing smaller strings for hashtables these days, so let's make use of it.	2013-12-22 21:12:25 +01:00
Thomas Hindoe Paaboel Andersen	0d647f7f73	journal-file.c: remove redundant assignment of variable we also do 'last_index = (uint64_t) -1;' at the end of the while loop so there is no reason to also do it here.	2013-12-17 20:55:20 +01:00
Shawn Landden	08c6f819cb	journal: fix against (theoretical) undefined behavior While all the libc implementations I know return NULL when memchr's size parameter is 0, without accessing any memory, passing NULL to memchr is still invalid: C11 7.24.1p2: Where an argument declared as "size_t n" specifies the length of the array for a function, n can have the value zero on a call to that function. Unless explicitly stated otherwise in the description of a particular function in this subclause, pointer arguments on such a call shall still have valid values, as described in 7.1.4. On such a call, a function that locates a character finds no occurrence, a function that compares two character sequences returns zero, and a function that copies characters copies zero characters. see http://llvm.org/bugs/show_bug.cgi?id=18247	2013-12-16 23:46:01 -05:00
Thomas Hindoe Paaboel Andersen	4faa70046c	journal-file: protect against alloca(0)	2013-12-16 23:37:38 +01:00
Lennart Poettering	eda4b58b50	journal: simplify pre-allocation logic let's just do a single fallocate() as far as possible, and don't distuingish between allocated space and file size. This way we can save a syscall for each append, which makes quite some benefits.	2013-11-27 01:50:36 +01:00
Lennart Poettering	248c78c79c	journal: allow journal_file_copy_entry() to work on non-local files	2013-11-27 00:59:07 +01:00
Lennart Poettering	e5462cd80e	journal: fix iteration when we go backwards from the beginning of an array chain element	2013-11-27 00:58:39 +01:00
Lennart Poettering	f268980d2c	journal: optimize bisection logic a bit by caching the last position This way we can do a quick restart limiting a bit how wildly we need to jump around during the bisection process.	2013-11-26 20:37:53 +01:00
Lennart Poettering	b8e891e699	journal: make table const	2013-11-26 18:46:03 +01:00
Lennart Poettering	a676e66535	journal: when appending to journal file, allocate larger blocks at once	2013-11-26 18:46:03 +01:00
Lennart Poettering	9588bc3209	Remove dead code and unexport some calls "make check-api-unused" informs us about code that is not used anymore or that is exported but only used internally. Fix these all over the place.	2013-11-08 18:12:45 +01:00
Zbigniew Jędrzejewski-Szmek	7ff7394d9e	Never call qsort on potentially NULL arrays This extends `62678ded` 'efi: never call qsort on potentially NULL arrays' to all other places where qsort is used and it is not obvious that the count is non-zero.	2013-10-13 17:56:54 -04:00
Zbigniew Jędrzejewski-Szmek	57535f4703	journald: replace new+snprintf with asprintf	2013-10-09 22:32:08 -04:00
Lennart Poettering	8c92d4bbc7	journald: add missing error check	2013-10-02 19:45:12 +02:00
Zbigniew Jędrzejewski-Szmek	718fe4b1d5	journalctl: properly print headers of empty journals	2013-06-18 08:50:08 -04:00
Zbigniew Jędrzejewski-Szmek	ed375bebf4	journalctl: print monotonic timestamp in --header	2013-06-10 10:10:07 -04:00
Zbigniew Jędrzejewski-Szmek	2765b7bb69	journalctl: print proper IDs with --header The same buffer was used for two different IDs, messing up the output.	2013-06-10 10:10:07 -04:00
Zbigniew Jędrzejewski-Szmek	507f22bd01	Use stdint.h macros instead of casts to print uint64_t values Casts are visually heavy, and can obscure unwanted truncations.	2013-06-10 10:10:07 -04:00
Lennart Poettering	253f59dff9	journald: be more careful when we try to flush the runtime journal to disk and the disk is close to being full Bump the minimal size of the journal so that we can be sure creating the journal file will always succeed. Previously the minimum size was smaller than a empty jounral file...	2013-05-07 01:10:05 +02:00
Zbigniew Jędrzejewski-Szmek	47838ab325	journal: as a µ-opt, use sizeof instead of strlen	2013-05-03 19:56:18 -04:00
Colin Guthrie	8621b1109b	journal: Set the default keep free value to 15% (up from 5%) As some SSDs are still seeing performance degredation when reaching 85% usage the default value of 5% seems a little low. Set this to 15% by default.	2013-05-03 17:46:44 +02:00
Zbigniew Jędrzejewski-Szmek	44a6b1b680	Add __attribute__((const, pure, format)) in various places I'm assuming that it's fine if a _const_ or _pure_ function calls assert. It is assumed that the assert won't trigger, and even if it does, it can only trigger on the first call with a given set of parameters, and we don't care if the compiler moves the order of calls.	2013-05-02 22:52:09 -04:00
Cristian Rodríguez	9607d9470e	Always use our own MAX/MIN definitions code in src/shared/macro.h only defined MAX/MIN in case they were not defined previously. however the MAX/MIN macros implemented in glibc are not of the "safe" kind but defined as: define MIN(a,b) (((a)<(b))?(a):(b)) define MAX(a,b) (((a)>(b))?(a):(b)) Avoid nasty side effects by using our own versions instead. Also fix the warnings derived from this change. [zj: - modify MAX3 macro to fix warning about _a shadowing _a, - do bootchart/svg.c too, - remove unused MIN3.]	2013-04-01 23:43:48 -04:00
Oleksii Shevchuk	26687bf8a9	journal: Add sync timer to journal server Add option to force journal sync with fsync. Default timeout is 5min. Interval configured via SyncIntervalSec option at journal.conf. Synced journal files will be marked as OFFLINE. Manual sync can be performed via sending SIGUSR1.	2013-03-25 17:51:06 +01:00
Lennart Poettering	93b73b064c	journal: by default do not decompress dat objects larger than 64K This introduces a new data threshold setting for sd_journal objects which controls the maximum size of objects to decompress. This is relieves the library from having to decompress full data objects even if a client program is only interested in the initial part of them. This speeds up "systemd-coredumpctl" drastically when invoked without parameters.	2012-11-21 00:28:00 +01:00
Lennart Poettering	34741aa3e2	journal: special case the trivial cache chain cache entry	2012-10-26 20:25:36 +02:00
Lennart Poettering	a4bcff5ba3	journal: introduce entry array chain cache When traversing entry array chains for a bisection or for retrieving an item by index we previously always started at the beginning of the chain. Since we tend to look at the same chains repeatedly, let's cache where we have been the last time, and maybe we can skip ahead with this the next time. This turns most bisections and index lookups from O(log(n)*log(n)) into O(log(n)). More importantly however, we seek around on disk much less, which is good to reduce buffer cache and seek times on rotational disks.	2012-10-26 03:24:03 +02:00
Lennart Poettering	0598fd4a95	journal: suggest rotation when fields are not indexed	2012-10-18 04:12:25 +02:00
Lennart Poettering	3c1668da62	journal: add ability to list values a specified field can take in all entries of the journal The new 'unique' API allows listing all unique field values that a field specified by a field name can take in all entries of the journal. This allows answering queries such as "What units logged to the journal?", "What hosts have logged into the journal?", "Which boot IDs have logged into the journal?". Ultimately this allows implementation of tools similar to lastlog based on journal data. Note that listing these field values will not work for journal files created with older journald, as the field values are not indexed in older files.	2012-10-18 03:35:18 +02:00
Lennart Poettering	fb0951b02e	journal: implement time-based rotation/vacuuming This also enables time-based rotation (but not vacuuming) after 1month, so that not more one month of journal is lost at a time per vacuuming.	2012-10-16 22:58:07 +02:00
Lennart Poettering	1f2da9ec51	journal: sort data items of entries by offset This should slightly optimize disk access patterns on rotating disks for simple readers.	2012-10-16 21:40:48 +02:00
Mirco Tischler	f1889c9119	journal: set seal even for readonly journalfiles journalctl needs to know wether the file has been sealed to be able to do verification.	2012-09-24 14:41:38 +02:00
Lennart Poettering	0559d3a58a	journal: don't allow journal_file_open() to be called with ret being NULL	2012-09-21 16:16:39 +02:00
Thomas Hindoe Paaboel Andersen	c53158818d	man: fix a bunch of typos in docs https://bugs.freedesktop.org/show_bug.cgi?id=54501	2012-09-13 19:34:24 +02:00
Lennart Poettering	b588975fb0	journal: never assert directly on data read from the journal	2012-09-13 17:10:46 +02:00
Lennart Poettering	5996c7c295	journald: don't reposition window if we don't have to	2012-09-13 17:06:04 +02:00

1 2 3 4 5 ...

344 commits