Systemd

Author	SHA1	Message	Date
Zbigniew Jędrzejewski-Szmek	6123dfaa72	pid1: disable printk ratelimit in early boot We have the problem that many early boot or late shutdown issues are harder to solve than they could be because we have no logs. When journald is not running, messages are redirected to /dev/kmsg. It is also the time when many things happen in a rapid succession, so we tend to hit the kernel printk ratelimit fairly reliably. The end result is that we get no logs from the time where they would be most useful. Thus let's disable the kernels ratelimit. Once the system is up and running, the ratelimit is not a problem. But during normal runtime, things also log to journald, and not to /dev/kmsg, so the ratelimit is not useful. Hence, there doesn't seem to be much point in trying to restore the ratelimit after boot is finished and journald is up and running. See kernel's commit 750afe7babd117daabebf4855da18e4418ea845e for the description of the kenrel interface. Our setting has lower precedence than explicit configuration on the kenrel command line.	2019-09-20 16:05:53 +02:00
Zbigniew Jędrzejewski-Szmek	8c227e7f2b	Drop RATELIMIT macros Using plain structure initialization is both shorter _and_ more clearer. We get type safety for free.	2019-09-20 16:05:53 +02:00
Zbigniew Jędrzejewski-Szmek	db11487d10	manager: put bin before sbin for user instances Traditionally, user logins had a $PATH in which /bin was before /sbin, while root logins had a $PATH with /sbin first. This allows the tricks that consolehelper is doing to work. But even if we ignore consolehelper, having the path in this order might have been used by admins for other purposes, and keeping the order in user sessions will make it easier the adoption of systemd user sessions a bit easier. Fixes #733. https://bugzilla.redhat.com/show_bug.cgi?id=1744059 OOM handling in manager_default_environment wasn't really correct. Now the (theorertical) malloc failure in strv_new() is handled. Please note that this has no effect on: - systems with merged /bin-/sbin (e.g. arch) - when there are no binaries that differ between the two locations. E.g. on my F30 laptop there is exactly one program that is affected: /usr/bin/setup -> consolehelper. There is less and less stuff that relies on consolehelper, but there's still some. So for "clean" systems this makes no difference, but helps with legacy setups. $ dnf repoquery --releasever=31 --qf %{name} --whatrequires usermode anaconda-live audit-viewer beesu chkrootkit driftnet drobo-utils-gui hddtemp mate-system-log mock pure-ftpd setuptool subscription-manager system-config-httpd system-config-rootpassword system-switch-java system-switch-mail usermode-gtk vpnc-consoleuser wifi-radar xawtv	2019-08-27 18:24:44 +02:00
Zbigniew Jędrzejewski-Szmek	581fef8d56	core: stop removing non-existent and duplicate lookup paths When we would iterate over the lookup paths for each unit, making the list as short as possible was important for performance. With the current cache, it doesn't matter much. Two classes of paths were being removed: - paths which don't exist in the filesystem - paths which symlink to a path earlier in the search list Both of those points cause problems with the caching code: - if a user creates a directory that didn't exist before and puts units there, now we will notice the new mtime an properly load the unit. When the path was removed from list, we wouldn't. - we now properly detect whether a unit path is on the path or not. Before, if e.g. /lib/systemd/system, /usr/lib/systemd/systemd were both on the path, and /lib was a symlink to /usr/lib, the second directory would be pruned from the path. Then, the code would think that a symlink /etc/systemd/system/foo.service→/lib/systemd/system/foo.service is an alias, but /etc/systemd/system/foo.service→/usr/lib/systemd/system/foo.service would be considered a link (in the systemctl link sense). Removing the pruning has a slight negative performance impact in case of usr-merge systems which have systemd compiled with non-usr-merge paths. Non-usr-merge systems are deprecated, and this impact should be very small, so I think it's OK. If it turns out to be an issue, the loop in function that builds the cache could be improved to skip over "duplicate" directories with same logic that the cache pruning did before. I didn't want to add this, becuase it complicates the code to improve a corner case. Fixes #13272.	2019-08-27 18:12:20 +02:00
Lennart Poettering	ea7584329b	manager: simplify manager_get_confirm_spawn() a bit Let's use our usual way of storing error codes. Let's remove a redundant temporary variable we never change	2019-08-20 17:34:19 +02:00
Lennart Poettering	5756bff6f1	Merge pull request #13119 from keszybz/unit-loading-2 Rework unit loading to take into account all aliases	2019-07-30 17:55:37 +02:00
Zbigniew Jędrzejewski-Szmek	91e0ee5f16	pid1: drop unit caches only based on mtime v2: - do not watch mtime of transient and generated dirs We'd reload the map after every transient unit we created, which we don't need to do, since we create those units ourselves and know their fragment path.	2019-07-30 14:01:46 +02:00
Zbigniew Jędrzejewski-Szmek	e8630e6952	pid1: use a cache for all unit aliases This reworks how we load units from disk. Instead of chasing symlinks every time we are asked to load a unit by name, we slurp all symlinks from disk and build two hashmaps: 1. from unit name to either alias target, or fragment on disk (if an alias, we put just the target name in the hashmap, if a fragment we put an absolute path, so we can distinguish both). 2. from a unit name to all aliases Reading all this data can be pretty costly (40 ms) on my machine, so we keep it around for reuse. The advantage is that we can reliably know what all the aliases of a given unit are. This means we can reliably load dropins under all names. This fixes #11972.	2019-07-30 14:01:46 +02:00
Zbigniew Jędrzejewski-Szmek	a505166845	Merge pull request #13096 from keszybz/unit-loading Preparatory work for the unit loading rework	2019-07-19 21:47:10 +02:00
Zbigniew Jędrzejewski-Szmek	5cfa33e0bc	Create src/shared/unit-file.[ch] for unit-file related ops So far we put such functinos in install.[ch], but that is tied too closely to enable/disable. Let's start moving things to a place with a better name.	2019-07-19 16:51:14 +02:00
Michael Olbrich	da8e178296	job: make the run queue order deterministic Jobs are added to the run queue in random order. This happens because most jobs are added by iterating over the transaction or dependency hash maps. As a result, jobs that can be executed at the same time are started in a different order each time. On small embedded devices this can cause a measurable jitter for the point in time when a job starts (~100ms jitter for 10 units that are started in random order). This results is a similar jitter for the boot time. This is undesirable in general and make optimizing the boot time a lot harder. Also, jobs that should have a higher priority because the unit has a higher CPU weight might get executed later than others. Fix this by turning the job run_queue into a Prioq and sort by the following criteria (use the next if the values are equal): - CPU weight - nice level - unit type - unit name The last one is just there for deterministic sorting to avoid any jitter.	2019-07-18 10:28:39 +02:00
Lennart Poettering	b910cc72c0	tree-wide: get rid of strappend() It's a special case of strjoin(), so no need to keep both. In particular as typing strjoin() is even shoert than strappend().	2019-07-12 14:31:12 +09:00
Zbigniew Jędrzejewski-Szmek	2a8f53c67b	Use unit->id instead of description in messages v2: - rename unit_identifier to unit_status_string	2019-07-10 13:35:26 +02:00
Zbigniew Jędrzejewski-Szmek	36cf45078c	Add config and kernel commandline option to use short identifiers No functional change, just docs and configuration and parsing. v2: - change ShortIdentifiers=yes\|no to StatusUnitFormat=name\|description.	2019-07-10 13:35:26 +02:00
Zbigniew Jędrzejewski-Szmek	3454129571	pid1: use monotonic timestamp in dump if realtime is not available $ systemd-analyze dump \| head -3 Timestamp firmware: (null) Timestamp loader: (null) Timestamp kernel: Mon 2019-07-01 17:21:02 CEST Since this is a debugging interface, it is OK to change the output format. The user can infer what "Timestamp firmware: 123.456ms" means.	2019-07-04 22:52:25 +02:00
Zbigniew Jędrzejewski-Szmek	1f65fd4926	basic/time-util: add helper function to check if timestamp is set No functional change.	2019-07-04 19:12:47 +02:00
Zbigniew Jędrzejewski-Szmek	b1d5246d29	core: do not enumerate units in MANAGER_TEST_RUN_MINIMAL mode In this mode we are not supposed to "interact with the environment", so loading all units and printing warnings about syntax errors and /var/run usage seems inappropriate.	2019-06-26 16:25:36 +02:00
Yu Watanabe	657ee2d82b	tree-wide: replace strjoin() with path_join()	2019-06-21 03:26:16 +09:00
Lennart Poettering	3aa317943c	Merge pull request #12626 from keszybz/oompolicy-check Make the check if oom-killer fired more robust	2019-05-21 18:29:01 +02:00
Zbigniew Jędrzejewski-Szmek	2ba6ae6b2b	core: do an extra check if oom was triggered when handling sigchild Should fix #12425.	2019-05-20 16:37:06 +02:00
Topi Miettinen	0a51b45ce4	small fixes: make get_process_state() static and fix typo	2019-05-20 16:23:22 +02:00
Ben Boeckel	5238e95759	codespell: fix spelling errors	2019-04-29 16:47:18 +02:00
Lennart Poettering	3661dc349e	Merge pull request #12217 from keszybz/unlocked-operations Refactor how we do unlocked file operations	2019-04-12 13:51:53 +02:00
Zbigniew Jędrzejewski-Szmek	2fe21124a6	Add open_memstream_unlocked() wrapper	2019-04-12 11:44:57 +02:00
Zbigniew Jędrzejewski-Szmek	17e4b07088	core: vodify one more call to mkdir CID #1400460.	2019-04-12 09:05:02 +02:00
Lennart Poettering	afcfaa695c	core: implement OOMPolicy= and watch cgroups for OOM killings This adds a new per-service OOMPolicy= (along with a global DefaultOOMPolicy=) that controls what to do if a process of the service is killed by the kernel's OOM killer. It has three different values: "continue" (old behaviour), "stop" (terminate the service), "kill" (let the kernel kill all the service's processes). On top of that, track OOM killer events per unit: generate a per-unit structured, recognizable log message when we see an OOM killer event, and put the service in a failure state if an OOM killer event was seen and the selected policy was not "continue". A new "result" is defined for this case: "oom-kill". All of this relies on new cgroupv2 kernel functionality: the "memory.events" notification interface and the "memory.oom.group" attribute (which makes the kernel kill all cgroup processes automatically).	2019-04-09 11:17:58 +02:00
Lennart Poettering	cbe83389d5	core: rearrange cgroup empty events a bit So far the priorities for cgroup empty event handling were pretty weird. The raw events (on cgroupsv2 from inotify, on cgroupsv1 from the agent dgram socket) where scheduled at a lower priority than the cgroup empty queue dispatcher. Let's swap that and ensure that we can coalesce events more agressively: let's process the raw events at higher priority than the cgroup empty event (which remains at the same prio).	2019-04-09 11:17:57 +02:00
Zbigniew Jędrzejewski-Szmek	237ebf61e2	Merge pull request #12013 from yuwata/fix-switchroot-11997 core: on switching root do not emit device state change based on enumeration results	2019-04-02 16:06:07 +02:00
Zbigniew Jędrzejewski-Szmek	c6335c3b51	Merge pull request #12115 from poettering/verbose-job-enqueue add "systemctl --show-transaction start" as a more verbose "systemctl start" that shows enqueued jobs	2019-03-28 11:04:26 +01:00
Zbigniew Jędrzejewski-Szmek	330b8fb379	journalctl: voidify mkdir_p() call and unify two similar code paths Let's unify the two similar code paths to watch /run/systemd/journal. The code in manager.c is similar, but it uses mkdir_p_label(), and unifying that would be too much trouble, so let's just adjust the error messages to be the same. CID #1400224.	2019-03-27 13:01:44 +01:00
Lennart Poettering	50cbaba4fe	core: add new API for enqueing a job with returning the transaction data	2019-03-27 12:37:37 +01:00
Lennart Poettering	b82f71c7ff	tree-wide: constify a few static string tables	2019-03-25 14:04:34 +01:00
Franck Bui	f75f613d25	core: reduce the number of stalled PIDs from the watched processes list when possible Some PIDs can remain in the watched list even though their processes have exited since a long time. It can easily happen if the main process of a forking service manages to spawn a child before the control process exits for example. However when a pid is about to be mapped to a unit by calling unit_watch_pid(), the caller usually knows if the pid should belong to this unit exclusively: if we just forked() off a child, then we can be sure that its PID is otherwise unused. In this case we take this opportunity to remove any stalled PIDs from the watched process list. If we learnt about a PID in any other form (for example via PID file, via searching, MAINPID= and so on), then we can't assume anything.	2019-03-20 10:51:49 +01:00
Lennart Poettering	f60a028a4e	tree-wide: use ERRNO_IS_DISCONNECT() at more places	2019-03-19 15:41:30 +01:00
Yu Watanabe	c6e892bc0e	core: add Manager::honor_device_enumeration flag When system manager is started first time or after switching root, then the udev's device tag data do not exist yet. So, let's not honor the enumeration results. Fixes #11997.	2019-03-15 19:47:43 +09:00
Lennart Poettering	e56f9ffe51	util: split out plymouth related stuff	2019-03-14 13:25:51 +01:00
Lennart Poettering	0a9707187b	util: split out memcmp()/memset() related calls into memory-util.[ch] Just some source rearranging.	2019-03-13 12:16:43 +01:00
Matthias Klumpp	4b05f0c9d9	core: Allow to configure execute_directories execution behavior This adds a new bitfield to `execute_directories()` which allows to configure whether to ignore non-zero exit statuses of binaries run and whether to allow parallel execution of commands. In case errors are not ignored, the exit status of the failed script will now be returned for error reposrting purposes or other further future use.	2019-02-18 16:16:02 +01:00
Lennart Poettering	dcf3c3c3d9	core: export $PIDFILE env var for services, derived from PIDFile=	2019-02-15 11:32:19 +01:00
Evgeny Vereshchagin	7334ade4a7	core: free lines after reading them Closes https://github.com/systemd/systemd/issues/11251.	2018-12-28 15:36:00 +09:00
Lennart Poettering	4a53080be6	core: don't track jobs-finishing-during-reload explicitly Memory management is borked for this, and moreover this is unnecessary since `f0831ed2a0`, i.e. since coldplug() and catchup() are two different concepts: the former restoring the state from before a reload, the latter than adjusting it again to the actual status in effect after the reload. Fixes: #10716 Mostly reverts: #8803	2018-12-12 11:15:06 +01:00
Zbigniew Jędrzejewski-Szmek	60473f0c23	pid1: fix (harmless) off-by-one in PATH_MAX comparison PATH_MAX is supposed to include the terminating NUL byte. But we already check that there is no NUL byte in the specified path. Hence the maximum length we can expect is PATH_MAX - 1. This doesn't change much, but makes this use of PATH_MAX consistent with the rest of the codebase.	2018-12-10 11:57:26 +01:00
Zbigniew Jędrzejewski-Szmek	8b4e51a60e	Merge pull request #10797 from poettering/run-generator add new "systemd-run-generator" for running arbitrary commands from the kernel command line as system services using the "systemd.run=" kernel command line switch	2018-11-28 22:40:55 +01:00
Lennart Poettering	7af67e9a8b	core: allow to set exit status when using SuccessAction=/FailureAction=exit in units This adds SuccessActionExitStatus= and FailureActionExitStatus= that may be used to configure the exit status to propagate in when SuccessAction=exit or FailureAction=exit is used. When not specified let's also propagate the exit status of the main process we fork off for the unit.	2018-11-27 09:44:40 +01:00
Lennart Poettering	a885727a64	show-status: fold two bool flags function arguments into a flags parameter	2018-11-26 18:24:12 +01:00
Lennart Poettering	e3b8d0637d	core: run env generators with non-zero umask For PID 1 we adjust the umask to 0, but generators should not run that way, given that they might be implemented as shell scripts and such. Let's hence explicitly adjust the umask for them. We already do this for unit generators. Let's do this for env generators, too.	2018-11-20 23:35:04 +01:00
Lennart Poettering	79a224c460	main: when reloading PID 1 let's reset the default environment Otherwise we keep collecting stuff from env generators, and we really shouldn't. This was working properly on reexec but not on reload, as for reexec we would always start fresh, but for reload would reuse the Manager object and hence its default environment set. Fixes: #10671	2018-11-19 13:01:19 +01:00
Lennart Poettering	2fbbbf9a5f	manager: log on two OOM occasions	2018-11-19 12:22:56 +01:00
Lennart Poettering	ae3cc6ec0d	Merge pull request #10770 from poettering/unit-done-log improvements to structure log events from PID1	2018-11-16 17:54:19 +01:00
Lennart Poettering	f20db19954	cocci: simplify some if checks	2018-11-16 16:05:29 +01:00

1 2 3 4 5 ...

686 commits