Systemd

Author	SHA1	Message	Date
Zbigniew Jędrzejewski-Szmek	9432f882a5	pid1: order .automount units after local-fs-pre.target From the bug: > According to the documentation of systemd.automount if the automoint point is > automagically created if it doesn't exist yet. This ofcourse means the > filesystem underneath has to be writable, which for / means not only does > -.mount need to be started but also systemd-remount-fs.service has to be run, > which isn't guaranteed by the default automount dependencies. > > For .mount units there is an automatic default After= dependency on > local-fs-pre.target, would probably make sense to do the same for automount > units to avoid it failing on the corner-case where it has to create directory. Fixes #13306.	2019-10-28 22:44:32 +09:00
Philip Withnall	9ed7de605d	scope: Support RuntimeMaxSec= directive in scope units Just as `RuntimeMaxSec=` is supported for service units, add support for it to scope units. This will gracefully kill a scope after the timeout expires from the moment the scope enters the running state. This could be used for time-limited login sessions, for example. Signed-off-by: Philip Withnall <withnall@endlessm.com> Fixes: #12035	2019-10-28 09:44:31 +01:00
Zbigniew Jędrzejewski-Szmek	e9cfc71222	Merge pull request #13635 from fbuihuu/no-aliases-with-enable man: alias names can't be used with enable command	2019-10-28 09:23:08 +01:00
Yu Watanabe	f2106b1789	Merge pull request #13836 from systemd/assert-cleanups-and-constification Assert cleanups and constification	2019-10-25 13:36:00 +09:00
Zbigniew Jędrzejewski-Szmek	a5648b8094	basic/fs-util: change CHASE_OPEN flag into a separate output parameter chase_symlinks() would return negative on error, and either a non-negative status or a non-negative fd when CHASE_OPEN was given. This made the interface quite complicated, because dependning on the flags used, we would get two different "types" of return object. Coverity was always confused by this, and flagged every use of chase_symlinks() without CHASE_OPEN as a resource leak (because it would this that an fd is returned). This patch uses a saparate output parameter, so there is no confusion. (I think it is OK to have functions which return either an error or an fd. It's only returning either an fd or a non-fd that is confusing.)	2019-10-24 22:44:24 +09:00
Zbigniew Jędrzejewski-Szmek	0e7f5ad9d3	Move PLYMOUTH_SOCKET define to def.h and nuke plymouth-util.h Let's not have a file with a single define.	2019-10-24 11:48:08 +02:00
Chris Down	959daf9bfc	Merge pull request #13743 from anitazha/dropin_all_the_things core: support top level drop-ins through -.service.d for service units	2019-10-16 23:10:05 -04:00
Yu Watanabe	7f66ff56eb	Merge pull request #13784 from keszybz/constify-unit-pointers Constify unit pointers	2019-10-17 09:41:36 +09:00
Anita Zhang	d727acb650	Merge pull request #13754 from claudiozz/master Allow restart for oneshot units	2019-10-16 14:21:59 -07:00
Claudio Zumbo	10e72727ee	Allow restart for oneshot units Picked up from https://github.com/systemd/systemd/pull/7474 , so coauthored by @robermorales.	2019-10-16 09:44:20 -07:00
Zbigniew Jędrzejewski-Szmek	abc9fa1cf1	core/load-fragment: remove unnecessary intialization manager_load_unit() better set it on success, and unit_set_slice() asserts that the argument is not NULL, so initializing it to NULL is not useful.	2019-10-16 16:33:54 +02:00
Zbigniew Jędrzejewski-Szmek	47538b7686	core/load-fragment: constify Unit* arguments where possible This makes it easy to tell that the function only uses the Unit* for reporting, and only makes changes to the other argument (which most likely also points at the same Unit structure) for modifications.	2019-10-16 16:32:45 +02:00
Zbigniew Jędrzejewski-Szmek	a2262bcafa	core: mark unit__printf() functions as taking a const Unit They should never modify the unit argument, let's make this clear. Also see `303ee60151`.	2019-10-16 16:21:56 +02:00
Anita Zhang	d272467882	shared/dropin: support -.service.d/ top level drop-in for service units Closes #12830	2019-10-15 11:14:54 -07:00
Zbigniew Jędrzejewski-Szmek	2cea199ec1	core: pass around pointer, not struct Since this is a static function, the compiler is likely to optimize it away anyway, but let's do the normal thing here.	2019-10-11 13:46:05 +02:00
Zbigniew Jędrzejewski-Szmek	75193d4128	core: adjust load functions for other unit types to be more like service No functional change, just adjusting code to follow the same pattern everywhere. In particular, never call _verify() on an already loaded unit, but return early from the caller instead. This makes the code a bit easier to follow.	2019-10-11 13:46:05 +02:00
Zbigniew Jędrzejewski-Szmek	c3784a7d78	core: simplify unit_load() a bit Now all unit types define .load. But even if it wasn't defined, we'd need to call unit_load_fragment_and_dropin() anyway, so this code would not have worked correctly. Also, unit_load_fragment_and_dropin() either returns -ENOENT or changes UNIT_STUB to UNIT_LOADED, so we don't need to repeat this here.	2019-10-11 11:25:04 +02:00
Zbigniew Jędrzejewski-Szmek	e0cfed4c59	core/service: use common implementation of unit_load_fragment_and_dropin() There is a slight functional change when load_state == UNIT_MERGED. Before, we would not call unit_load_dropin(), but now we do. I'm not sure if this causes an actual difference in behaviour, but since all other unit types do this, I think it's better to do the same thing here too.	2019-10-11 11:25:04 +02:00
Zbigniew Jędrzejewski-Szmek	c362077087	core: turn unit_load_fragment_and_dropin_optional() into a flag unit_load_fragment_and_dropin() and unit_load_fragment_and_dropin_optional() are really the same, with one minor difference in behaviour. Let's drop the second function. "_optional" in the name suggests that it's the "dropin" part that is optional. (Which it is, but in this case, we mean the fragment to be optional.) I think the new version with a flag is easier to understand.	2019-10-11 10:45:33 +02:00
Anita Zhang	e23d911664	core: disallow using '-.service' as a service name -.service.d will become a special top level drop in so don't let it be a usable service name (otherwise the interaction gets complicated).	2019-10-07 12:02:12 -07:00
Franck Bui	27c3112dcb	fs-util: introduce inotify_add_watch_and_warn() helper The default message for ENOSPC is very misleading: it says that the disk is filled, but in fact the inotify watch limit is the problem. So let's introduce and use a wrapper that simply calls inotify_add_watch(2) and which fixes the error message up in case ENOSPC is returned.	2019-10-05 08:08:20 +02:00
Zbigniew Jędrzejewski-Szmek	3509e678f8	Merge pull request #13690 from cdown/cgroup_rework cgroup: Add support to check systemd-internal cgroup limits against the kernel	2019-10-03 22:09:56 +02:00
Franck Bui	a5cede8c24	pid1: restore the original environment passed by the kernel when switching to a new system manager PID1 may modified the environment passed by the kernel when it starts running. Commit `9d48671c62` unset $HOME for example. In case PID1 is going to switch to a new root and execute a new system manager which is not systemd, we should restore the original environment as the new manager might expect some variables to be set by default (more specifically $HOME).	2019-10-03 22:08:13 +02:00
Chris Down	bc0623df16	cgroup: analyze: Report memory configurations that deviate from systemd This is the most basic consumer of the new systemd-vs-kernel checker, both acting as a reasonable standalone exerciser of the code, and also as a way for easy inspection of deviations from systemd internal state.	2019-10-03 15:06:25 +01:00
Chris Down	6dfb92823f	cgroup: analyze: Match standard dump format We're the only ones left using = as the delimiter, which looks really weird in `systemd-analyze dump`. Use `: ` like everyone else.	2019-10-03 15:06:25 +01:00
Chris Down	74b5fb272f	cgroup: Allow checking systemd-internal limits against the kernel We currently don't have any mitigations against another privileged user on the system messing with the cgroup hierarchy, bringing the system out of line with what we've set in systemd. We also don't have any real way to surface this to the user (we do have logs, but you have to know to look in the first place). There are a few possible solutions: 1. Maintaining our own cgroup tree with the new fsopen API and having a read-only copy for everyone else. However, there are some complications on this front, and this may be infeasible in some environments. I'd rate this as a longer term effort that's tangential to this patch. 2. Actively checking for changes with {fa,i}notify and changing them back afterwards to match our configuration again. This is also possible, but it's also good to have a way to do passive monitoring of the situation without taking hard action. Also, currently daemons like senpai do actually need to modify the tree behind systemd's back (although hopefully this should be more integrated soon). This patch implements another option, where one can, on demand, monitor deviations in cgroup memory configuration from systemd's internal state. Currently the only consumer is `systemd-analyze dump`, but the interface is generic enough that it can also be exposed elsewhere later (for example, over D-Bus). Currently only memory limit style properties are supported, but later I also plan to expand this out to other properties that systemd should have ultimate control over.	2019-10-03 15:06:25 +01:00
Mike Kazantsev	fc103b3e34	cgroup: fix typo in BPF firewall support warning message	2019-10-03 15:48:57 +02:00
Zbigniew Jędrzejewski-Szmek	86e94d95d0	Merge pull request #13246 from keszybz/add-SystemdOptions-efi-variable Add efi variable to augment /proc/cmdline	2019-10-03 12:19:44 +02:00
Zbigniew Jędrzejewski-Szmek	6e2d361d53	Merge pull request #13696 from keszybz/keep-dhcp-on-restart Add a way to differentiate restart from stop and keep dhcp config on restart	2019-10-03 11:25:12 +02:00
Franck Bui	c0000de87d	pid1: fix DefaultTasksMax initialization Otherwise DefaultTasksMax is always set to "inifinity". This was broken by `fb39af4ce4`.	2019-10-03 11:24:27 +02:00
Dan Streetman	8084dcb9d7	src/core/automount: use DirectoryMode when calling mkdir -p mkdir -p is called both when setting up the autofs mount, as well as after being notified that the real mount unit should be called. However the first mkdir -p is hardcoded with 0555, while the second uses the value specified to DirectoryMode in the automount unit; the second mkdir -p is only needed when called from coldplug, so under normal operation the dirs are incorrectly created with mode 0555. This replaces the hardcoded 0555 mode with the value of DirectoryMode. Closes #13683.	2019-10-02 16:11:02 +02:00
Zbigniew Jędrzejewski-Szmek	4ab1670f3d	core: rework how logging level is calculated for kill operations Setting the log level based on the signal made sense when signals that were used were fixed. Since we allow signals to be configured, it doesn't make sense to log at notice level about e.g. a restart or stop operation just because the signal used is different. This avoids messages like: six.service: Killing process 210356 (sleep) with signal SIGINT.	2019-10-02 14:01:40 +02:00
Zbigniew Jędrzejewski-Szmek	a232ebcc2c	core: add support for RestartKillSignal= to override signal used for restart jobs v2: - if RestartKillSignal= is not specified, fall back to KillSignal=. This is necessary to preserve backwards compatibility (and keep KillSignal= generally useful).	2019-10-02 14:01:25 +02:00
Chris Down	2bfd08ce38	Merge pull request #13691 from mrc0mmand/coverity-fixes Coverity fixes for unchecked return values	2019-10-02 10:42:53 +01:00
Zbigniew Jędrzejewski-Szmek	28a2dfe801	core: add helper function to check job status Since job.h includes unit.h, and unit.h includes job.h, imports need to be adjusted to make sure unit.h is included first if the helper is used.	2019-10-01 15:05:27 +02:00
Zbigniew Jędrzejewski-Szmek	fa036b6114	core: remove unused prototypes	2019-10-01 14:25:10 +02:00
Zbigniew Jędrzejewski-Szmek	c436a4981e	core: minor formatting adjustment	2019-10-01 14:13:35 +02:00
Frantisek Sumsal	54756dce57	execute: explicitly ignore fd_wait_for_event()'s return value Fixes CID#1402316	2019-10-01 10:25:36 +02:00
Chris Down	184e989d7d	cgroup: Mark memory protections as explicitly set in transient units A later version of the DefaultMemory{Low,Min} patch changed these to require explicitly setting memory_foo_set, but we only set that in load-fragment, not dbus-cgroup. Without these, we may fall back to either DefaultMemoryFoo or CGROUP_LIMIT_MIN when we really shouldn't.	2019-09-30 22:27:21 +01:00
Chris Down	64fe532e90	cgroup: Respect DefaultMemoryMin when setting memory.min This is an oversight from https://github.com/systemd/systemd/pull/12332. Sadly the tests didn't catch it since it requires a real cgroup hierarchy to see, and it wasn't seen in prod since we're only currently using DefaultMemoryLow, not DefaultMemoryMin. :-(	2019-09-30 18:41:21 +01:00
Chris Down	7c9d2b7993	cgroup: Check ancestor memory min for unified memory config Otherwise we might not enable it when we should, ie. DefaultMemoryMin is set in a parent, but not MemoryMin in the current unit.	2019-09-30 18:24:26 +01:00
Michael Olbrich	28e68bb235	Handle d_type == DT_UNKNOWN correctly As documented in the man-page, readdir() may return a directory entry with d_type == DT_UNKNOWN. This must be handled for regular filesystems. dirent_ensure_type() is available to set d_type if necessary. Use it in some more places. Without this systemd will fail to boot correctly with nfsroot and some other filesystems. Closes #13609	2019-09-30 13:29:59 +01:00
Filipe Brandenburger	28b77ab246	log: Add missing "%" in "%m" log format strings These were clearly intended to be "%m" to display the human readable version of the error stored in errno.	2019-09-25 09:28:26 +02:00
Franck Bui	2268367471	shared/install: failing with -ELOOP can be due to the use of an alias in install_error() -ELOOP can happen also when enabling an alias name (which is admittedly useless since the unit it belongs to was already enabled) so let's mention this possibility when reporting the corresponding error.	2019-09-24 19:05:06 +02:00
Pavel Hrdina	047f5d63d7	cgroup: introduce support for cgroup v2 CPUSET controller Introduce support for configuring cpus and mems for processes using cgroup v2 CPUSET controller. This allows users to limit which cpus and memory NUMA nodes can be used by processes to better utilize system resources. The cgroup v2 interfaces to control it are cpuset.cpus and cpuset.mems where the requested configuration is written. However, it doesn't mean that the requested configuration will be actually used as parent cgroup may limit the cpus or mems as well. In order to reflect the real configuration cgroup v2 provides read-only files cpuset.cpus.effective and cpuset.mems.effective which are exported to users as well.	2019-09-24 15:16:07 +02:00
Zbigniew Jędrzejewski-Szmek	6123dfaa72	pid1: disable printk ratelimit in early boot We have the problem that many early boot or late shutdown issues are harder to solve than they could be because we have no logs. When journald is not running, messages are redirected to /dev/kmsg. It is also the time when many things happen in a rapid succession, so we tend to hit the kernel printk ratelimit fairly reliably. The end result is that we get no logs from the time where they would be most useful. Thus let's disable the kernels ratelimit. Once the system is up and running, the ratelimit is not a problem. But during normal runtime, things also log to journald, and not to /dev/kmsg, so the ratelimit is not useful. Hence, there doesn't seem to be much point in trying to restore the ratelimit after boot is finished and journald is up and running. See kernel's commit 750afe7babd117daabebf4855da18e4418ea845e for the description of the kenrel interface. Our setting has lower precedence than explicit configuration on the kenrel command line.	2019-09-20 16:05:53 +02:00
Zbigniew Jędrzejewski-Szmek	5ac1530eca	tree-wide: say "ratelimit" not "rate_limit" "ratelimit" is a real word, so we don't need to use the other form anywhere. We had both forms in various places, let's standarize on the shorter and more correct one.	2019-09-20 16:05:53 +02:00
Zbigniew Jędrzejewski-Szmek	7bf081a1e5	pid1: rename start_limit to start_ratelimit This way it is clearer what the type is. We also have auto_stop_ratelimit adjacent, and it feels ugly to have a different suffix for those two.	2019-09-20 16:05:53 +02:00
Zbigniew Jędrzejewski-Szmek	8c227e7f2b	Drop RATELIMIT macros Using plain structure initialization is both shorter _and_ more clearer. We get type safety for free.	2019-09-20 16:05:53 +02:00
Zbigniew Jędrzejewski-Szmek	90b059b608	pid1: do not warn if /run/systemd/relabel-extra.d/ doesn't exist After all, that is the expected state.	2019-09-19 18:01:40 +02:00
Anita Zhang	898fc00e79	core: add ExecXYZEx= bus hook ups to all exec command properties The "Ex" variant was originally only added for ExecStartXYZ= but it makes sense to have feature parity for the rest of the exec command properties as well (e.g. ExecReload=, ExecStop=, etc).	2019-09-17 15:48:44 +00:00
ypf791	b49e14d5f3	core: coldplug possible nop_job	2019-09-17 13:46:21 +00:00
Michal Sekletar	8fca6944c2	path: stop watching path specs once we triggered the target unit We start watching them again once we get a notification that triggered unit entered inactive or failed state. Fixes: #10503	2019-09-17 10:11:32 +02:00
Maciej Stanczew	6327aa9f6c	core: Fix setting StatusUnitFormat from config files	2019-09-17 15:21:21 +09:00
Zbigniew Jędrzejewski-Szmek	0bb2f0f10e	util-lib: split shared/efivars into basic/efivars and shared/efi-loader I want to use efivars.[ch] in proc-cmdline.c, but most of the efivars stuff is not needed in basic/. Move the file from shared/ to basic/, but then move back most of the higher-level functions to the new shared/efi-loader.c file.	2019-09-16 18:08:53 +02:00
Zbigniew Jędrzejewski-Szmek	fdb3decaa7	util-lib: move some functions from basic/cgroup-util to shared/cgroup-setup This way less stuff needs to be in basic. Initially, I wanted to move all the parts of cgroup-utils.[ch] that depend on efivars.[ch] to shared, because efivars.[ch] is in shared/. Later on, I decide to split efivars.[ch], so the move done in this patch is not necessary anymore. Nevertheless, it is still valid on its own. If at some point we want to expose libbasic, it is better to to not have stuff that belong in libshared there.	2019-09-16 18:08:00 +02:00
Zbigniew Jędrzejewski-Szmek	d4d99bc6e4	basic/cgroup-util: let cgroup_unified_flush() return the detected hierarchy This avoid the use of the global variable. Also rename cgroup_unified_update() to cgroup_unified_cached() and cgroup_unified_flush() to cgroup_unified() to better reflect their new roles.	2019-09-16 18:06:20 +02:00
Franck Bui	5a1c1b534f	core: restore initialization of u->source_mtime During the rework of unit file loading, commit `e8630e6952` dropped the initialization u->source_mtime. This had the bad side effect that generated units always needed daemon reloading.	2019-09-16 15:53:52 +02:00
Yu Watanabe	f39fc2d88b	Merge pull request #13354 from keszybz/two-refactoring-patches Two or more refactoring patches	2019-09-16 21:24:13 +09:00
Zbigniew Jędrzejewski-Szmek	36b12282e1	basic/conf-files: make conf_files_list() take just a single directory This function had two users (apart from tests), and both only used one argument. And it seems likely that if we need to pass more directories, either the _nulstr() or the _strv() form would be used. Let's simplify the code.	2019-09-16 09:15:05 +02:00
Zbigniew Jędrzejewski-Szmek	48da02ec6f	core/mount-setup: use conf_files_list_strv() for relabel-extra.d/	2019-09-16 09:15:05 +02:00
Benjamin Gilbert	71de68476c	mount-setup: relabel items mentioned directly in relabel-extra.d relabel_extra() relabels the descendants of directories listed in relabel-extra.d, but doesn't relabel the files or directories explicitly named there. This makes it impossible to use relabel-extra.d to relabel the root of a filesystem. Fix by relabeling the named items too.	2019-09-16 09:04:22 +02:00
Zbigniew Jędrzejewski-Szmek	de5ae832f2	Merge pull request #13439 from yuwata/core-support-systemctl-clean-more core: support systemctl clean more	2019-09-13 16:15:02 +02:00
Zbigniew Jędrzejewski-Szmek	74e6a78221	Merge pull request #13440 from keszybz/failing-condtion-check Revert "core: check start limit on condition checks too"	2019-09-03 10:04:05 +02:00
Dimitri John Ledkov	8fa0de653b	Generate stable machine-id and DHCP client ID on POWER KVM.	2019-08-31 10:57:16 +02:00
Zbigniew Jędrzejewski-Szmek	6b4f7fb08c	Merge pull request #13385 from yuwata/core-remove-private-directories-13355 core: also remove private directories by systemctl clean	2019-08-31 09:28:39 +02:00
Michael Biebl	07125d24ee	Drop dbus activation stub service This fixes the following problem: > At the very end of the boot, just after the first user logs in > (usually using sddm / X) I get the following messages in my logs: > Nov 18 07:02:33 samd dbus-daemon[2879]: [session uid=1000 pid=2877] Activated service 'org.freedesktop.systemd1' failed: Process org.freedesktop.systemd1 exited with status 1 > Nov 18 07:02:33 samd dbus-daemon[2879]: [session uid=1000 pid=2877] Activated service 'org.freedesktop.systemd1' failed: Process org.freedesktop.systemd1 exited with status 1 These messages are caused by the "stub" service files that systemd installs. It installed them because early versions of systemd activation required them to exist. Since dbus 1.11.0, a dbus-daemon that is run with --systemd-activation automatically assumes that o.fd.systemd1 is an activatable service. As a result, with a new enough dbus version, /usr/share/dbus-1/services/org.freedesktop.systemd1.service and /usr/share/dbus-1/system-services/org.freedesktop.systemd1.service should become unnecessary, and they can be removed. dbus 1.11.0 was released 2015-12-02. Bug-Debian: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=914015	2019-08-30 18:26:43 +02:00
Zbigniew Jędrzejewski-Szmek	5af6aa58aa	Revert "core: check start limit on condition checks too" This reverts commit `2de9b9793b`. This check causes regressions, in particular our own units fail. Apparently, it is enough for the unit to be referenced enough times: $ journalctl -b -u systemd-ask-password-console.path Aug 30 12:08:14 krowka systemd[1]: Condition check resulted in Dispatch Password Requests to Console Directory Watch being skipped. Aug 30 12:08:33 krowka systemd[1]: Condition check resulted in Dispatch Password Requests to Console Directory Watch being skipped. Aug 30 12:08:33 krowka systemd[1]: Condition check resulted in Dispatch Password Requests to Console Directory Watch being skipped. Aug 30 12:08:33 krowka systemd[1]: Condition check resulted in Dispatch Password Requests to Console Directory Watch being skipped. Aug 30 12:08:33 krowka systemd[1]: Condition check resulted in Dispatch Password Requests to Console Directory Watch being skipped. Aug 30 12:08:33 krowka systemd[1]: systemd-ask-password-console.path: Start request repeated too quickly. Aug 30 12:08:33 krowka systemd[1]: Failed to start Dispatch Password Requests to Console Directory Watch. $ journalctl -b -u systemd-firstboot.service -- Logs begin at Sun 2019-04-21 12:39:21 CEST, end at Fri 2019-08-30 12:23:06 CEST. -- Aug 30 12:08:33 krowka systemd[1]: Condition check resulted in First Boot Wizard being skipped. Aug 30 12:08:33 krowka systemd[1]: Condition check resulted in First Boot Wizard being skipped. Aug 30 12:08:33 krowka systemd[1]: Condition check resulted in First Boot Wizard being skipped. Aug 30 12:08:33 krowka systemd[1]: Condition check resulted in First Boot Wizard being skipped. Aug 30 12:08:33 krowka systemd[1]: systemd-firstboot.service: Start request repeated too quickly. Aug 30 12:08:33 krowka systemd[1]: Failed to start First Boot Wizard. And the same for other units. Fixes #13434. https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=935829	2019-08-30 18:21:05 +02:00
Zbigniew Jędrzejewski-Szmek	3a5a08bbb4	Merge pull request #13384 from yuwata/core-runtime-directory-preserve core: make RuntimeDirectoryPreserve= works with non-service units	2019-08-30 13:00:57 +02:00
Yu Watanabe	4b259b3c63	Merge pull request #13244 from keszybz/allow-dots-in-usernames Allow dots in usernames	2019-08-29 00:03:19 +09:00
Yu Watanabe	a8b689b7d0	core/swap: support "systemctl clean" for swap units	2019-08-28 23:09:54 +09:00
Yu Watanabe	12213aed12	core: move timeout_clean_usec from Service to ExecContext	2019-08-28 23:09:54 +09:00
Yu Watanabe	17e9d53d87	core/mount: support "systemctl clean" for mount units	2019-08-28 23:09:54 +09:00
Yu Watanabe	c968d76a38	core/socket: support "systemctl clean" for socket units	2019-08-28 23:09:54 +09:00
Yu Watanabe	810ef3180e	core: introduce unit_fork_and_watch_rm_rf()	2019-08-28 23:09:54 +09:00
Yu Watanabe	7f622a19d9	core: also remove private directories by systemctl clean Fixes #13355.	2019-08-28 23:09:44 +09:00
Zbigniew Jędrzejewski-Szmek	db11487d10	manager: put bin before sbin for user instances Traditionally, user logins had a $PATH in which /bin was before /sbin, while root logins had a $PATH with /sbin first. This allows the tricks that consolehelper is doing to work. But even if we ignore consolehelper, having the path in this order might have been used by admins for other purposes, and keeping the order in user sessions will make it easier the adoption of systemd user sessions a bit easier. Fixes #733. https://bugzilla.redhat.com/show_bug.cgi?id=1744059 OOM handling in manager_default_environment wasn't really correct. Now the (theorertical) malloc failure in strv_new() is handled. Please note that this has no effect on: - systems with merged /bin-/sbin (e.g. arch) - when there are no binaries that differ between the two locations. E.g. on my F30 laptop there is exactly one program that is affected: /usr/bin/setup -> consolehelper. There is less and less stuff that relies on consolehelper, but there's still some. So for "clean" systems this makes no difference, but helps with legacy setups. $ dnf repoquery --releasever=31 --qf %{name} --whatrequires usermode anaconda-live audit-viewer beesu chkrootkit driftnet drobo-utils-gui hddtemp mate-system-log mock pure-ftpd setuptool subscription-manager system-config-httpd system-config-rootpassword system-switch-java system-switch-mail usermode-gtk vpnc-consoleuser wifi-radar xawtv	2019-08-27 18:24:44 +02:00
Zbigniew Jędrzejewski-Szmek	581fef8d56	core: stop removing non-existent and duplicate lookup paths When we would iterate over the lookup paths for each unit, making the list as short as possible was important for performance. With the current cache, it doesn't matter much. Two classes of paths were being removed: - paths which don't exist in the filesystem - paths which symlink to a path earlier in the search list Both of those points cause problems with the caching code: - if a user creates a directory that didn't exist before and puts units there, now we will notice the new mtime an properly load the unit. When the path was removed from list, we wouldn't. - we now properly detect whether a unit path is on the path or not. Before, if e.g. /lib/systemd/system, /usr/lib/systemd/systemd were both on the path, and /lib was a symlink to /usr/lib, the second directory would be pruned from the path. Then, the code would think that a symlink /etc/systemd/system/foo.service→/lib/systemd/system/foo.service is an alias, but /etc/systemd/system/foo.service→/usr/lib/systemd/system/foo.service would be considered a link (in the systemctl link sense). Removing the pruning has a slight negative performance impact in case of usr-merge systems which have systemd compiled with non-usr-merge paths. Non-usr-merge systems are deprecated, and this impact should be very small, so I think it's OK. If it turns out to be an issue, the loop in function that builds the cache could be improved to skip over "duplicate" directories with same logic that the cache pruning did before. I didn't want to add this, becuase it complicates the code to improve a corner case. Fixes #13272.	2019-08-27 18:12:20 +02:00
Yu Watanabe	494d0247f9	core: introduce exec_directory_is_private() helper function Also, this follows up `40cd2ecc26`.	2019-08-25 16:27:42 +09:00
Yu Watanabe	52a12341f9	core: make RuntimeDirectoryPreserve= works with non-service units	2019-08-23 00:08:16 +09:00
Yu Watanabe	95939aed21	core: introduce unit_destroy_runtime_directory() Currently `unit_will_restart()` can return true only when the unit is service. Hence, should not change anything.	2019-08-22 23:50:52 +09:00
Anita Zhang	23f8fbb303	core: TAKE_PTR in timer_add_one_calendar_spec Introduced in `d00a52c` Fixes #13373	2019-08-22 11:02:56 +02:00
Zbigniew Jędrzejewski-Szmek	5cc2cd1cd8	pid1: always log successfull process termination quietly Fixes #13372.	2019-08-22 09:09:45 +02:00
Zbigniew Jędrzejewski-Szmek	4dba44a5a5	pid1: after creating transient drop-ins, put file in path cache The alternative would be to recreate the cache, but dropins can be created very often for transient settings, so updating the cache seems like a much faster option. Fixes #13287.	2019-08-21 15:35:21 +02:00
Lennart Poettering	ea7584329b	manager: simplify manager_get_confirm_spawn() a bit Let's use our usual way of storing error codes. Let's remove a redundant temporary variable we never change	2019-08-20 17:34:19 +02:00
Lennart Poettering	4a8daee72f	load-fragment: use path_join() where appropriate	2019-08-20 17:32:34 +02:00
Kai Krakow	2dbc45aea7	cgroup: Also set io.bfq.weight Current kernels with BFQ scheduler do not yet set their IO weight through "io.weight" but through "io.bfq.weight" (using a slightly different interface supporting only default weights, not per-device weights). This commit enables "IOWeight=" to just to that. This patch may be dropped at some time later. Github-Link: https://github.com/systemd/systemd/issues/7057 Signed-off-by: Kai Krakow <kai@kaishome.de>	2019-08-20 11:50:59 +02:00
Zbigniew Jędrzejewski-Szmek	ae480f0b09	shared/user-util: allow usernames with dots in specific fields People do have usernames with dots, and it makes them very unhappy that systemd doesn't like their that. It seems that there is no actual problem with allowing dots in the username. In particular chown declares ":" as the official separator, and internally in systemd we never rely on "." as the seperator between user and group (nor do we call chown directly). Using dots in the name is probably not a very good idea, but we don't need to care. Debian tools (adduser) do not allow users with dots to be created. This patch allows existing names with dots to be used in User, Group, SupplementaryGroups, SocketUser, SocketGroup fields, both in unit files and on the command line. DynamicUsers and sysusers still follow the strict policy. user@.service and tmpfiles already allowed arbitrary user names, and this remains unchanged. Fixes #12754.	2019-08-19 21:19:13 +02:00
Zbigniew Jędrzejewski-Szmek	d2a236929b	core: remove one {}	2019-08-19 21:04:57 +02:00
Mattias Jernberg	a5a8776ae5	core: Avoid race when starting dbus services In high load scenarios it is possible for services to be started before the NameOwnerChanged signal is properly installed. Emulate a callback by also queuing a GetNameOwner when the match is installed. Fixes: #12956	2019-08-14 16:12:31 +02:00
Zbigniew Jędrzejewski-Szmek	4c071d7f2a	meson: create (empty) /etc/systemd/system during installation We explicitly create /etc/systemd/user and other parts of the basic directory tree. I think we should create /etc/systemd/system too. (The alternative would be to not create those other directories too, but I think it's nice to have the basic directory structure in place after installation.) https://bugzilla.redhat.com/show_bug.cgi?id=1737362	2019-08-06 03:11:09 +09:00
Zbigniew Jędrzejewski-Szmek	a4fc96c823	pid1: replace asprintf() with strjoin() It's nicer. And coverity doesn't need to complain about unchecked return value (CID#1401780).	2019-08-03 17:46:56 +02:00
Lennart Poettering	735a8b6d38	job: fix coverity issue Fixes coverity issue 1403550	2019-07-31 09:45:03 +02:00
Lennart Poettering	5756bff6f1	Merge pull request #13119 from keszybz/unit-loading-2 Rework unit loading to take into account all aliases	2019-07-30 17:55:37 +02:00
Zbigniew Jędrzejewski-Szmek	91e0ee5f16	pid1: drop unit caches only based on mtime v2: - do not watch mtime of transient and generated dirs We'd reload the map after every transient unit we created, which we don't need to do, since we create those units ourselves and know their fragment path.	2019-07-30 14:01:46 +02:00
Zbigniew Jędrzejewski-Szmek	e8630e6952	pid1: use a cache for all unit aliases This reworks how we load units from disk. Instead of chasing symlinks every time we are asked to load a unit by name, we slurp all symlinks from disk and build two hashmaps: 1. from unit name to either alias target, or fragment on disk (if an alias, we put just the target name in the hashmap, if a fragment we put an absolute path, so we can distinguish both). 2. from a unit name to all aliases Reading all this data can be pretty costly (40 ms) on my machine, so we keep it around for reuse. The advantage is that we can reliably know what all the aliases of a given unit are. This means we can reliably load dropins under all names. This fixes #11972.	2019-07-30 14:01:46 +02:00
Lennart Poettering	e04ed6db6b	exit-status: rename EXIT_STATUS_GLIBC → EXIT_STATUS_LIBC After all these two exit codes are defined by ISO C as part of the C library, and it's not the GNU implementation defines them.	2019-07-29 19:05:25 +02:00
Lennart Poettering	1d7458fbb1	Merge pull request #13207 from keszybz/symbolic-exit-code-names Symbolic exit code names	2019-07-29 18:58:06 +02:00
Zbigniew Jędrzejewski-Szmek	37109b856a	pid1: use LOG_DEBUG/INFO/NOTICE for unit resource consumption message We now log at LOG_INFO for any unit. Let's vary the log level a bit, so that for normal short lived-units (less than 1 sec CPU), we only log if debugging is enabled.	2019-07-29 18:50:31 +02:00
Zbigniew Jędrzejewski-Szmek	e7b9f4d9fa	pid1: fix message about triggers missing services systemd[1]: systemd-tmpfiles-clean.timer: Refusing to start, unit systemd-tmpfiles-cle an.timer to trigger not loaded.	2019-07-29 15:54:53 +02:00
Zbigniew Jędrzejewski-Szmek	2e2ed88062	pid1,systemctl: allow symbolic exit code names	2019-07-29 15:54:53 +02:00
Zbigniew Jędrzejewski-Szmek	62b21e2e89	shared/bus-util: fix dbus serialization of {RestartPrevent,RestartForce,Success}ExitStatus We were passing 1/4th of the size in bytes as argument. So depending on the size of the array, either we'd only transfer a subset of values, or we'd get an alignment error.	2019-07-29 15:54:53 +02:00
Zbigniew Jędrzejewski-Szmek	23d5dd1687	shared/exit-status: use Bitmap instead of Sets I opted to embed the Bitmap structure directly in the ExitStatusSet. This means that memory usage is a bit higher for units which don't define this setting: Service changes: /* size: 2720, cachelines: 43, members: 73 / / sum members: 2680, holes: 9, sum holes: 39 / / sum bitfield members: 7 bits, bit holes: 1, sum bit holes: 1 bits / / last cacheline: 32 bytes / / size: 2816, cachelines: 44, members: 73 / / sum members: 2776, holes: 9, sum holes: 39 / / sum bitfield members: 7 bits, bit holes: 1, sum bit holes: 1 bits */ But this way the code is simpler and we do less pointer chasing.	2019-07-29 15:54:53 +02:00
Zbigniew Jędrzejewski-Szmek	e1714f0250	shared/exit-status: turn status level into a bitmask, add "test" The "test" doesn't really test much automatically, but it is still useful to look at the mappings.	2019-07-29 15:54:45 +02:00
Philip Withnall	7508f7f273	scope: Refactor timer handling on coldplug Factor it out into a helper function which is a bit easier to expand in future. This introduces no functional changes. Signed-off-by: Philip Withnall <withnall@endlessm.com>	2019-07-29 12:13:52 +01:00
Philip Withnall	ef71cc7787	dbus-scope: Factor out common UNIT(s) cast This introduces no functional changes. Signed-off-by: Philip Withnall <withnall@endlessm.com>	2019-07-29 12:13:51 +01:00
Lennart Poettering	c18ecf0375	core: take random seed from boot loader and credit it to kernel entropy pool	2019-07-25 18:16:46 +02:00
Lennart Poettering	0a2eef1ee1	core: try to reopen /dev/kmsg again right after mounting /dev I was debugging stuff during early boot, and was confused that I never found the logs for it in kmsg. The reason for that was that /proc is generally not mounted the first time we do log_open() and hence log_set_target(LOG_TARGET_KMSG) we do when running as PID 1 had not effect. A lot later during start-up we call log_open() again where this is fixed (after the point where we close all remaining fds still open), but in the meantime no logs every got written to kmsg. This patch fixes that.	2019-07-24 19:56:51 +02:00
Lennart Poettering	544ad34257	Merge pull request #13118 from bluca/shutdown_watchdog_kexec core: add KExecWatchdogSec and rename ShutdownWatchdogSec to RebootWatchdogSec	2019-07-24 11:11:03 +02:00
Lennart Poettering	623f20fb41	core: add spdx header to all-units.h The specific header file is probably not copyrightable anyway, since it's so trivial, but let's still add the SPDX header line so that a systematic check for the line does't spit out this header needlessly.	2019-07-24 05:06:21 +09:00
Luca Boccassi	65224c1d0e	core: rename ShutdownWatchdogSec to RebootWatchdogSec This option is only used on reboot, not on other types of shutdown modes, so it is misleading. Keep the old name working for backward compatibility, but remove it from the documentation.	2019-07-23 20:29:03 +01:00
Luca Boccassi	acafd7d8a6	core: add KExecWatchdogSec option Rather than always enabling the shutdown WD on kexec, which might be dangerous in case the kernel driver and/or the hardware implementation does not reset the wd on kexec, add a new timer, disabled by default, to let users optionally enable the shutdown WD on kexec separately from the runtime and reboot ones. Advise in the documentation to also use the runtime WD in conjunction with it. Fixes: `a637d0f9ec` ("core: set shutdown watchdog on kexec too")	2019-07-23 20:29:03 +01:00
Zbigniew Jędrzejewski-Szmek	a505166845	Merge pull request #13096 from keszybz/unit-loading Preparatory work for the unit loading rework	2019-07-19 21:47:10 +02:00
Zbigniew Jędrzejewski-Szmek	5cfa33e0bc	Create src/shared/unit-file.[ch] for unit-file related ops So far we put such functinos in install.[ch], but that is tied too closely to enable/disable. Let's start moving things to a place with a better name.	2019-07-19 16:51:14 +02:00
Zbigniew Jędrzejewski-Szmek	96cf3ec966	pid1: get rid of unit_supported() helper Another case where "open code" is easier to read than the helper.	2019-07-19 16:51:14 +02:00
Zbigniew Jędrzejewski-Szmek	f4c43a8115	pid1: do not say "(null)" if no disabled controllers It looks like we made a mistake. The list is just empty, that's all.	2019-07-19 16:51:14 +02:00
Zbigniew Jędrzejewski-Szmek	8d5e593146	pid1: simplify timestamp buffer declaration	2019-07-19 16:51:14 +02:00
Zbigniew Jędrzejewski-Szmek	217b7b33cc	pid1: order jobs that execute processes with lower priority We can meaningfully compare jobs for units which have cpu weight or nice set. But non-exec units those have those set. Starting non-exec jobs first allows us to get them out of the queue quickly, and consider more jobs for starting. If we have service A, and socket B, and service C which is after socket B, and we want to start both A and C, and C has higher cpu weight, if we get B out of the way first, we'll know that we can start both A and C, and we'll start C first. Also invert the comparisons using CMP() so they are always done left vs. right, and negate when returning instead. Follow-up for `da8e178296`.	2019-07-19 14:38:52 +09:00
Luca Boccassi	a637d0f9ec	core: set shutdown watchdog on kexec too At the moment the shutdown watchdog is set only when rebooting. The set of "things that can go wrong" is not too far off when kexec'ing and in fact we have a use case where it would be useful - moving to a new kernel image.	2019-07-18 22:31:43 +02:00
Lennart Poettering	9ddaa3e459	mount: rename update_parameters_proc_self_mount_info() → update_parameters_proc_self_mountinfo() let's name the call like the file in /proc is actually called.	2019-07-18 17:03:11 +02:00
Lennart Poettering	bcce581d65	swap: scan /proc/swaps before processing waitid() results Similar to the previous commit, but for /proc/swaps, where the same logic and rationale applies.	2019-07-18 17:03:11 +02:00
Lennart Poettering	350804867d	mount: rescan /proc/self/mountinfo before processing waitid() results (The interesting bits about the what and why are in a comment in the patch, please have a look there instead of looking here in the commit msg). Fixes: #10872	2019-07-18 17:03:11 +02:00
Lennart Poettering	fcd8e119c2	mount: simplify /proc/self/mountinfo handler Our IO handler is only installed for one fd, hence there's no reason to conditionalize on it again. Also, split out the draining into a helper function of its own.	2019-07-18 17:03:10 +02:00
Michael Olbrich	da8e178296	job: make the run queue order deterministic Jobs are added to the run queue in random order. This happens because most jobs are added by iterating over the transaction or dependency hash maps. As a result, jobs that can be executed at the same time are started in a different order each time. On small embedded devices this can cause a measurable jitter for the point in time when a job starts (~100ms jitter for 10 units that are started in random order). This results is a similar jitter for the boot time. This is undesirable in general and make optimizing the boot time a lot harder. Also, jobs that should have a higher priority because the unit has a higher CPU weight might get executed later than others. Fix this by turning the job run_queue into a Prioq and sort by the following criteria (use the next if the values are equal): - CPU weight - nice level - unit type - unit name The last one is just there for deterministic sorting to avoid any jitter.	2019-07-18 10:28:39 +02:00
Lennart Poettering	d611cfa748	core: never propagate reload failure to service result Fixes: #11238	2019-07-18 10:14:02 +09:00
Zbigniew Jędrzejewski-Szmek	043fdc4010	pid1: kill unit_file_find_dropin_paths() helper It had two users, but it is just a very thin wrapper around unit_file_find_dropin_paths(), so using it seems more complicated than directly invoking unit_file_find_dropin_paths() twice.	2019-07-17 14:27:23 +02:00
Anita Zhang	31cd5f63ce	core: ExecCondition= for services Closes #10596	2019-07-17 11:35:02 +02:00
Franck Bui	a9fd4cd120	pid1: make sure to restore correct default values for some rlimits Commit `fb39af4ce4` forgot to restore the default rlimit values (RLIMIT_NOFILE and RLIMIT_MEMLOCK) while PID1 is reloading. This patch extracts the code in charge of initializing the default values for those rlimits in order to create dedicated functions, which take care of their initialization. These functions are then called in parse_configuration() so we make sure that the default values for these rlimits get restored every time PID1 is reloading its configuration.	2019-07-17 06:24:27 +09:00
Zbigniew Jędrzejewski-Szmek	3151b668c2	Merge pull request #13076 from keszybz/pr/13062 Timer formatting fixes	2019-07-16 20:02:26 +02:00
Zbigniew Jędrzejewski-Szmek	cd87f6340f	pid1: split out another helper func for two similar code paths	2019-07-16 14:29:04 +02:00
Zbigniew Jędrzejewski-Szmek	d00a52c737	pid1: split out helper func for two similar code paths	2019-07-16 14:29:04 +02:00
Lennart Poettering	f200a3564c	Merge pull request #13063 from keszybz/cgroup-path-fixes Cgroup path fixes	2019-07-16 11:53:31 +02:00
Yu Watanabe	8cec0a5c32	tree-wide: drop duplicated blank lines ``` $ for i in /.[ch] //*.[ch]; do sed -e '/^$/ {N; s/\n$//g}' -i $i; done $ git checkout HEAD -- basic/linux shared/linux ```	2019-07-15 18:41:27 +02:00
Zbigniew Jędrzejewski-Szmek	95b21cff0e	Apply empty_to_root() in three more spots for safety	2019-07-15 18:39:26 +02:00
Zbigniew Jędrzejewski-Szmek	624e4fcffa	pid1: fix GetUnitProcesses This effectively reverts one chunk of `657ee2d82b`. For a while I couldn't figure out why 'systemctl status -- -.slice' fails to list any processes...	2019-07-15 18:39:26 +02:00
Yu Watanabe	f7f196adfa	Merge pull request #13037 from poettering/shutdown-log-fixes Shutdown log fixes	2019-07-13 23:00:12 +09:00
Lennart Poettering	56e8419aa8	main: use sysctl_writef() where appropriate	2019-07-13 11:05:07 +02:00
Lennart Poettering	60cd367649	killall: bump log message about unkilled processes to LOG_WARNING By raising this, we can raise the kernel kmsg log level safely, and still see these messages.	2019-07-13 11:05:07 +02:00
Lennart Poettering	2caa38e99f	tree-wide: some more [static] related fixes let's add [static] where it was missing so far Drop [static] on parameters that can be NULL. Add an assert() around parameters that have [static] and can't be NULL hence. Add some "const" where it was forgotten.	2019-07-12 16:40:10 +02:00
Lennart Poettering	27dd6e1b12	Merge pull request #13022 from keszybz/coverity-cleanups Coverity cleanups	2019-07-12 07:37:44 +02:00
Lennart Poettering	b910cc72c0	tree-wide: get rid of strappend() It's a special case of strjoin(), so no need to keep both. In particular as typing strjoin() is even shoert than strappend().	2019-07-12 14:31:12 +09:00
Lennart Poettering	66855de739	tree-wide: make use of errno_or_else() everywhere	2019-07-11 23:20:31 +02:00
Lennart Poettering	2e8e1a1ab6	Merge pull request #12461 from Werkov/fix-job-ordering Refactor job ordering implementation (and fix cycle detection)	2019-07-11 16:43:58 +02:00
Lennart Poettering	345f322185	core: expose per-service cleaning properties on the bus, too	2019-07-11 12:18:51 +02:00
Lennart Poettering	4d3bac5645	core: expose new clean operation on the bus This adds CanClean() and Clean() as new methods on the Unit object that initiate the cleaning operation.	2019-07-11 12:18:51 +02:00
Lennart Poettering	6b7b2ed96b	core: add type of resource string table	2019-07-11 12:18:51 +02:00
Lennart Poettering	89f6fe7b30	core: hook up timer unit type with clean operation timer units maintain state on disk (the persistent touch file), hence let's expose cleaning it up generically with the new cleaning operation for units. This is a much simpler implementation as for the service unit type: instead of forking out a worker process we just remove the touch file directly. That should be OK since we only need to remove a single (empty) file, instead of a recursive user-controlled directory tree. Fixes: #4930	2019-07-11 12:18:51 +02:00
Lennart Poettering	4c2f584230	core: hook up service unit type with the new clean operation The implementation is pretty straight-foward: when we get a request to clean some type of resources we fork off a process doing that, and while it is running we are in the "cleaning" state.	2019-07-11 12:18:51 +02:00
Lennart Poettering	380dc8b0a2	core: add generic "clean" operation to units This adds basic infrastructure to implement a "clean" operation for unit types. This "clean" operation is supposed to remove on-disk resources of units, and is supposed to be used in a later commit to clean our RuntimeDirectory=, StateDirectory= and so on of service units. Later commits will open this up to the bus, and hook up service units with this. This also adds a new generic ActiveState called UNIT_MAINTENANCE. It's supposed to cover all kinds of "maintainance" state of units. Specifically, this is supposed to cover the "cleaning" operations later added for service units which might take a bit of time. This high-level, generic, abstract state is called UNIT_MAINTENANCE instead of the more specific "UNIT_CLEANING", since I think this should be kept open for different operations possibly later on that could be nicely subsumed under this (for example, maybe a recursive chown()ing operation could be covered by this, and similar).	2019-07-11 12:18:51 +02:00
Zbigniew Jędrzejewski-Szmek	0121d1f28d	pid1: shorten dump output a bit by not repeating unit id twice Most units have just one name, but we'd print it twice: -> Unit systemd-sysctl.service: ... Name: systemd-sysctl.service Let's only print the "main" name once, and call the other names Aliases.	2019-07-11 11:19:19 +02:00
Lennart Poettering	261e7d9270	Merge pull request #12755 from keszybz/short-identifiers Allow using unit names in status messages	2019-07-11 00:00:51 +02:00
Lennart Poettering	08945b59d1	Merge pull request #12926 from keszybz/urlify-logs Urlify CONFIG_FILE and improve SYSTEMD_LOG_LOCATION	2019-07-11 00:00:34 +02:00
Lennart Poettering	ba40f0399e	Merge pull request #12939 from yuwata/lgtm-fixes make LGTM quiet	2019-07-10 14:57:14 +02:00
Zbigniew Jędrzejewski-Szmek	2a8f53c67b	Use unit->id instead of description in messages v2: - rename unit_identifier to unit_status_string	2019-07-10 13:35:26 +02:00
Zbigniew Jędrzejewski-Szmek	36cf45078c	Add config and kernel commandline option to use short identifiers No functional change, just docs and configuration and parsing. v2: - change ShortIdentifiers=yes\|no to StatusUnitFormat=name\|description.	2019-07-10 13:35:26 +02:00
Zbigniew Jędrzejewski-Szmek	c1d95b713a	pid1: tiny simplification v2: - use empty_to_root()	2019-07-10 13:35:17 +02:00
Zbigniew Jędrzejewski-Szmek	334c0979f3	pid1: fix serialization/deserialization of commmands with spaces Fixes #12258. This is enough to reproduce: $ systemd-run bash -c 'sleep 10' && systemctl daemon-reload would result in Current command vanished from the unit file. We would serialize as: ExecStart 0 /usr/bin/bash /usr/bin/bash -c sleep 10000 which of course can't work. Now we serialize as ExecStart 0 /usr/bin/bash "/usr/bin/bash" "-c" "sleep 10".	2019-07-09 01:25:35 +02:00
Zbigniew Jędrzejewski-Szmek	3454129571	pid1: use monotonic timestamp in dump if realtime is not available $ systemd-analyze dump \| head -3 Timestamp firmware: (null) Timestamp loader: (null) Timestamp kernel: Mon 2019-07-01 17:21:02 CEST Since this is a debugging interface, it is OK to change the output format. The user can infer what "Timestamp firmware: 123.456ms" means.	2019-07-04 22:52:25 +02:00
Yu Watanabe	4bbccb02ea	tree-wide: introduce strerror_safe()	2019-07-05 02:43:56 +09:00
Zbigniew Jędrzejewski-Szmek	1f65fd4926	basic/time-util: add helper function to check if timestamp is set No functional change.	2019-07-04 19:12:47 +02:00
Zbigniew Jędrzejewski-Szmek	62c6bbbc09	tree-wide: use PROJECT_FILE instead of __FILE__ This replaces the internal uses of __FILE__ with the new macro.	2019-07-04 10:36:00 +02:00
Zbigniew Jędrzejewski-Szmek	4ec8514142	Rename EXTRACT_QUOTES to EXTRACT_UNQUOTE Whenever I see EXTRACT_QUOTES, I'm always confused whether it means to leave the quotes in or to take them out. Let's say "unquote", like we say "cunescape".	2019-06-28 11:35:05 +02:00
Zbigniew Jędrzejewski-Szmek	cae90de3d3	Reindent some things for readability	2019-06-28 11:19:24 +02:00
Yu Watanabe	22800b473e	Merge pull request #12889 from keszybz/analyze-condition Add systemd-analyze condition	2019-06-28 02:37:20 +09:00
Zbigniew Jędrzejewski-Szmek	9266f31e61	core: skip whitespace after "\|" and "!" in the condition parser We'd skip any whitespace immediately after "=", but then we'd treat whitespace that is between "\|" or "!" and the value as significant. This is rather confusing, let's ignore it too.	2019-06-27 10:54:37 +02:00
Zbigniew Jędrzejewski-Szmek	edfea9fe0d	analyze: add 'condition' verb We didn't have a straightforward way to parse and evaluate those strings. Prompted by #12881.	2019-06-27 10:54:37 +02:00
Michal Koutný	dfd79eca55	core: Check transaction against execution cycles When we are validating a transaction, we take into account declared ordering between job units. However, since JOB_STOP goes always first regardless of the ordering constraint between respective units, we may detect some false cycles in the transaction which would not prevent the execution though. Use the same logic in transaction checking as we use for job execution.	2019-06-26 23:16:31 +02:00
Zbigniew Jędrzejewski-Szmek	b1d5246d29	core: do not enumerate units in MANAGER_TEST_RUN_MINIMAL mode In this mode we are not supposed to "interact with the environment", so loading all units and printing warnings about syntax errors and /var/run usage seems inappropriate.	2019-06-26 16:25:36 +02:00
Zbigniew Jędrzejewski-Szmek	48f48b8c7c	core: move assert before actual use of the variable No point in using u->id first, and doing assert(u) later. -std=c89 strikes again.	2019-06-26 16:24:48 +02:00
Michal Koutný	e602f15282	core: Extract job ordering logic The job ordering logic is spread at multiple places of the code, making it hard to maintain and also a bit to understand. The actual execution order of two jobs always depends on their types and the ordering contraint between their units. Extract this logic to a new function job_compare. The second change is simplification of the order evaluation, JOB_STOP takes always precedence (as documented), unless two units are both stopping, then the ordering constraint is taken into account.	2019-06-26 00:00:43 +02:00
Michal Sekletar	33fe9e3fd0	execute: drop SYNTHETIC_ERRNO because error code was received from the apply_numa_policy()	2019-06-25 21:52:28 +02:00
Joerg Behrmann	fa97f63067	core: factor root_directory application out of apply_working_directory Fixes: #12498	2019-06-25 22:53:33 +09:00
Frantisek Sumsal	a07a7324ad	core: move config_parse_* functions to a shared module Apart from making the code a little bit more clean, it should allow us to write a fuzzer around the config-parsing functions in the future	2019-06-25 22:35:02 +09:00
Lennart Poettering	2193f17c08	core: mention why we do migration for everything but ConfigurationDirectory=	2019-06-25 10:47:46 +02:00
Lennart Poettering	cf52c45d6b	core: log when we convert from DynamicUser=1 to =0 or vice versa	2019-06-25 10:47:46 +02:00
Lennart Poettering	c7e42ceb7a	Merge pull request #12869 from poettering/dynamic-user-re-migrate DynamicUser=1 state directory back migration	2019-06-25 10:06:03 +02:00
Kai Lüke	fab347489f	bpf-firewall: custom BPF programs through IP(Ingress\|Egress)FilterPath= Takes a single /sys/fs/bpf/pinned_prog string as argument, but may be specified multiple times. An empty assignment resets all previous filters. Closes https://github.com/systemd/systemd/issues/10227	2019-06-25 09:56:16 +02:00
Lennart Poettering	05b2ace147	Merge pull request #12870 from yuwata/tree-wide-further-path-join-cleanups tree-wide: further path_join() and path_joina() cleanups	2019-06-25 09:27:01 +02:00
Yu Watanabe	270384b2d4	tree-wide: replace strjoina() with prefix_roota()	2019-06-25 01:31:26 +09:00
Michal Sekletar	b070c7c0e1	core: introduce NUMAPolicy and NUMAMask options Make possible to set NUMA allocation policy for manager. Manager's policy is by default inherited to all forked off processes. However, it is possible to override the policy on per-service basis. Currently we support, these policies: default, prefer, bind, interleave, local. See man 2 set_mempolicy for details on each policy. Overall NUMA policy actually consists of two parts. Policy itself and bitmask representing NUMA nodes where is policy effective. Node mask can be specified using related option, NUMAMask. Default mask can be overwritten on per-service level.	2019-06-24 16:58:54 +02:00
Lennart Poettering	5c6d40d132	core: migrate service directories back from private if needed Fixes: #12131	2019-06-24 16:20:34 +02:00
Lennart Poettering	3f5b15084e	core: add missing space to DynamicUser=1 directory comment (also line break again)	2019-06-24 16:20:34 +02:00
Lennart Poettering	cd69e88ba3	doc: make clear that --system and --user only make sense with --test Fixes: #12843	2019-06-24 14:51:52 +02:00
Yu Watanabe	6abdec98f3	tree-wide: use _cleanup_ attribute and strv_consume() + TAKE_PTR()	2019-06-24 14:57:58 +09:00
Lennart Poettering	cee97d5768	Merge pull request #12836 from yuwata/tree-wide-replace-strjoin tree-wide: replace strjoin() with path_join()	2019-06-22 20:02:46 +02:00
Anita Zhang	4c1567f29a	bpf-firewall: optimization for IPAddressXYZ="any" (and unprivileged users) This is a workaround to make IPAddressDeny=any/IPAddressAllow=any work for non-root users that have CAP_NET_ADMIN. "any" was chosen since all or nothing network access is one of the most common use cases for isolation. Allocating BPF LPM TRIE maps require CAP_SYS_ADMIN while BPF_PROG_TYPE_CGROUP_SKB only needs CAP_NET_ADMIN. In the case of IPAddressXYZ="any" we can just consistently return false/true to avoid allocating the map and limit the user to having CAP_NET_ADMIN.	2019-06-22 19:56:06 +02:00
Lennart Poettering	c6134d3e2f	path-util: get rid of prefix_root() prefix_root() is equivalent to path_join() in almost all ways, hence let's remove it. There are subtle differences though: prefix_root() will try shorten multiple "/" before and after the prefix. path_join() doesn't do that. This means prefix_root() might return a string shorter than both its inputs combined, while path_join() never does that. I like the path_join() semantics better, hence I think dropping prefix_root() is totally OK. In the end the strings generated by both functon should always be identical in terms of path_equal() if not streq(). This leaves prefix_roota() in place. Ideally we'd have path_joina(), but I don't think we can reasonably implement that as a macro. or maybe we can? (if so, sounds like something for a later PR) Also add in a few missing OOM checks	2019-06-21 08:42:55 +09:00
Lennart Poettering	1e59b5455e	bpf: use more TAKE_FD()	2019-06-21 03:28:24 +09:00
Yu Watanabe	657ee2d82b	tree-wide: replace strjoin() with path_join()	2019-06-21 03:26:16 +09:00
Donald Buczek	0219b3524f	cgroup: Continue unit reset if cgroup is busy When part of the cgroup hierarchy cannot be deleted (e.g. because there are still processes in it), do not exit unit_prune_cgroup early, but continue so that u->cgroup_realized is reset. Log the known case of non-empty cgroups at debug level and other errors at warning level. Fixes https://github.com/systemd/systemd/issues/12386	2019-06-20 10:16:53 +02:00
Lennart Poettering	6e2f789484	core: set fs.file-max sysctl to LONG_MAX rather than ULONG_MAX Since kernel 5.2 the kernel thankfully returns proper errors when we write a value out of range to the sysctl. Which however breaks writing ULONG_MAX to request the maximum value. Hence let's write the new maximum value instead, LONG_MAX. /cc @brauner Fixes: #12803	2019-06-17 15:48:11 +02:00
Philip Withnall	226a08f28f	service: Fix typo in warning message The directive is `RuntimeMaxSec=`, not `MaxRuntimeSec=`. Signed-off-by: Philip Withnall <withnall@endlessm.com>	2019-06-12 10:39:51 +01:00
Chris Down	c710d3b430	cgroup: Prevent theoretical nullptr deref in unit mask calculation	2019-06-07 06:33:53 +01:00
Chris Down	eab5049520	Merge pull request #11778 from anitazha/rfe_11654_dbus core: add ExecStartXYZEx= with dbus support for executable prefixes	2019-06-05 10:02:00 +01:00
Zbigniew Jędrzejewski-Szmek	f140ed02f7	Silence warning about BPF firewall in containers We'd get a warning on every nspawn invocation: dev-hugepages.mount: unit configures an IP firewall, but the local system does not support BPF/cgroup firewalling. (This warning is only shown for the first unit using IP firewalling.) Before the previous commit, I'd generally get a warning about systemd-udev.service, even though that service is not started in containers. But are still many other units which that declare a firewall, which is currently unsupported in containers. Let's stop warning about this. The warning is still emitted e.g. if legacy cgroups are used. This is something that can be configured, so it makes more sense to emit the warning.	2019-06-04 17:22:37 +02:00
Zbigniew Jędrzejewski-Szmek	84d2744bc5	Move warning about unsupported BPF firewall right before the firewall would be created There's no need to warn about the firewall when parsing, because the unit might not be started at all. Let's warn only when we're actually preparing to start the firewall. This changes behaviour: - the warning is printed just once for all unit types, and not once for normal units and once for transient units. - on repeat warnings, the message is not printed at all. There's already detailed debug info from bpf_firewall_compile(), so we don't need to repeat ourselves. - when we are not root, let's say precisely that, not "lack of necessary privileges" and "the local system does not support BPF/cgroup firewalling". Fixes #12673.	2019-06-04 17:22:37 +02:00
Michal Sekletar	e7fca352ba	execute: dump CPUAffinity as a range string instead of a list of CPUs We do this already when printing the property in systemctl so be consistent and do the same for systemd-analyze dump.	2019-06-03 15:21:52 +02:00
Michal Sekletar	75e40119a4	dbus-execute: make transfer of CPUAffinity endian safe (#12711 ) We store the affinity mask in the native endian. However, over D-Bus we must transfer the mask in little endian byte order. This is the second part of `c367f996f5`.	2019-05-31 15:23:23 +02:00
Anita Zhang	b3d593673c	core: add ExecStartXYZEx= with dbus support for executable prefixes Closes #11654	2019-05-30 20:41:42 -07:00
Michal Sekletar	3f09629c22	Merge pull request #12628 from keszybz/dbus-execute Rework cpu affinity parsing	2019-05-30 12:32:53 +02:00
Michal Sekletar	c367f996f5	shared/cpu-set-util: make transfer of cpu_set_t over bus endian safe	2019-05-29 16:12:23 +02:00
Zbigniew Jędrzejewski-Szmek	fb39af4ce4	pid1: when reloading configuration, forget old settings If we had a configuration setting from a configuration file, and it was removed, we'd still remember the old value, because there's was no mechanism to "reset" everything, just to assign new values. Note that the effect of this is limited. For settings that have an "ongoing" effect, like systemd.confirm_spawn, the new value is simply used. But some settings can only be set at start. In particular, CPUAffinity= will be updated if set to a new value, but if CPUAffinity= is fully removed, it will not be reset, simply because we don't know what to reset it to. We might have inherited a setting, or we might have set it ourselves. In principle we could remember the "original" value that was set when we were executed, but propagate this over reloads and reexecs, but that would be a lot of work for little gain. So this corner case of removal of CPUAffinity= is not handled fully, and a reboot is needed to execute the change. As a work-around, a full mask of CPUAffinity=0-8191 can be specified.	2019-05-29 10:29:28 +02:00
Zbigniew Jędrzejewski-Szmek	470a5e6dce	pid1: don't reset setting from /proc/cmdline upon restart We have settings which may be set on the kernel command line, and also in /proc/cmdline (for pid1). The settings in /proc/cmdline have higher priority of course. When a reload was done, we'd reload just the configuration file, losing the overrides. So read /proc/cmdline again during reload. Also, when initially reading the configuration file when program starts, don't treat any errors as fatal. The configuration done in there doesn't seem important enough to refuse boot.	2019-05-29 10:29:28 +02:00
Zbigniew Jędrzejewski-Szmek	61fbbac1d5	pid1: parse CPUAffinity= in incremental fashion This makes the handling of this option match what we do in unit files. I think consistency is important here. (As it happens, it is the only option in system.conf that is "non-atomic", i.e. where there's a list of things which can be split over multiple assignments. All other options are single-valued, so there's no issue of how to handle multiple assignments.)	2019-05-29 10:29:28 +02:00
Zbigniew Jędrzejewski-Szmek	0985c7c4e2	Rework cpu affinity parsing The CPU_SET_S api is pretty bad. In particular, it has a parameter for the size of the array, but operations which take two (CPU_EQUAL_S) or even three arrays (CPU_{AND,OR,XOR}_S) still take just one size. This means that all arrays must be of the same size, or buffer overruns will occur. This is exactly what our code would do, if it received an array of unexpected size over the network. ("Unexpected" here means anything different from what cpu_set_malloc() detects as the "right" size.) Let's rework this, and store the size in bytes of the allocated storage area. The code will now parse any number up to 8191, independently of what the current kernel supports. This matches the kernel maximum setting for any architecture, to make things more portable. Fixes #12605.	2019-05-29 10:20:42 +02:00
Lennart Poettering	1802d5f2cf	terminal-util: reset access mode in vt_restore(), too Only changing ownership back to root is not enough we also need to change the access mode, otherwise the user might have set 666 first, and thus allow everyone access before and after the chown().	2019-05-24 15:07:55 +02:00
Lennart Poettering	4b3b5bc71b	tree-wide: port various places over to use chmod_and_chown() Doing this properly is hard, hence let's unify the code.	2019-05-24 15:07:55 +02:00
Lennart Poettering	ccc16c7842	core: prefer SCMP_ACT_KILL_PROCESS for SystemCallFilter= behaviour If we have it, use it. It makes a ton more sense. Fixes: #11967	2019-05-24 10:48:28 +02:00
Lennart Poettering	05332e243c	Merge pull request #12590 from keszybz/unicode-cmdlines Use unicode for cmdline printing	2019-05-24 10:41:30 +02:00
Lennart Poettering	93d70b6cf2	Merge pull request #12631 from keszybz/doc-and-error-message-tweaks Doc and error message tweaks	2019-05-22 19:00:10 +02:00
Zbigniew Jędrzejewski-Szmek	7cc5ef5f18	pid1: improve message when setting up namespace fails I covered the most obvious paths: those where there's a clear problem with a path specified by the user. Prints something like this (at error level): May 21 20:00:01.040418 systemd[125871]: bad-workdir.service: Failed to set up mount namespacing: /run/systemd/unit-root/etc/tomcat9/Catalina: No such file or directory May 21 20:00:01.040456 systemd[125871]: bad-workdir.service: Failed at step NAMESPACE spawning /bin/true: No such file or directory Fixes #10972.	2019-05-22 16:28:02 +02:00
Zbigniew Jędrzejewski-Szmek	9d48671c62	core: unset HOME=/ that the kernel gives us Partially fixes #12389. %h would return "/" in a machine, but "/root" in a container. Let's fix this by resetting $HOME to the expected value.	2019-05-22 16:28:02 +02:00
Zbigniew Jędrzejewski-Szmek	09c1dceef1	basic/process-util: convert bool arg to flags In preparation for the next commit…	2019-05-22 10:15:49 +02:00
Zbigniew Jędrzejewski-Szmek	bc28751ed2	Rework cmdline printing to use unicode The functions to retrieve and print process cmdlines were based on the assumption that they contain printable ASCII, and everything else should be filtered out. That assumption doesn't hold in today's world, where people are free to use unicode everywhere. This replaces the custom cmdline reading code with a more generic approach using utf8_escape_non_printable_full(). For kernel threads, truncation is done on the parenthesized name, so we'll get "[worker]", "[worker…]", …, "[w…]", "[…", "…" as we reduce the number of available columns. This implementation is most likely slower for very long cmdlines, but I don't think this is very important. The common case is to have short commandlines, and should print those properly. Absurdly long cmdlines are the exception, which needs to be handled correctly and safely, but speed is not too important. Fixes #12532. v2: - use size_t for the number of columns. This change propagates into various other functions that call get_process_cmdline(), increasing the size of the patch, but the changes are rather trivial.	2019-05-22 10:08:17 +02:00
Lennart Poettering	3aa317943c	Merge pull request #12626 from keszybz/oompolicy-check Make the check if oom-killer fired more robust	2019-05-21 18:29:01 +02:00
Zbigniew Jędrzejewski-Szmek	a832893f9c	shared/cpu-set-util: move the part to print cpu-set into a separate function Also avoid unnecessary asprintf() when we can write to the output area directly.	2019-05-21 08:44:03 +02:00
Zbigniew Jędrzejewski-Szmek	bd0abfaea1	core/dbus-execute: remove unnecessary initialization	2019-05-21 08:42:28 +02:00
Zbigniew Jędrzejewski-Szmek	2ba6ae6b2b	core: do an extra check if oom was triggered when handling sigchild Should fix #12425.	2019-05-20 16:37:06 +02:00
Zbigniew Jędrzejewski-Szmek	569554d9e5	core/service: drop {}	2019-05-20 16:37:06 +02:00
Topi Miettinen	0a51b45ce4	small fixes: make get_process_state() static and fix typo	2019-05-20 16:23:22 +02:00
David Tardon	525b95f10e	timer: simplify computation of unit activation time	2019-05-18 16:58:27 +02:00
Michael Biebl	dadc7f2e43	meson: stop creating .wants directories for {multi-user,getty}.target (#12569 ) Since preset is supposed to be used to enable the services, there is no need to pre-create those directories either. Follow-up for #12164	2019-05-17 08:02:45 +02:00
Zbigniew Jędrzejewski-Szmek	1d3fe304fd	Use sd_event_source_disable_unref()	2019-05-10 16:55:37 +02:00
Chris Down	22bf131be2	cgroup: Support 0-value for memory protection directives These make sense to be explicitly set at 0 (which has a different effect than the default, since it can affect processing of `DefaultMemoryXXX`). Without this, it's not easily possible to relinquish memory protection for a subtree, which is not great.	2019-05-08 12:06:32 +01:00
Chris Down	7e7223b3d5	cgroup: Readd some plumbing for DefaultMemoryMin Somehow these got lost in the previous PR, rendering DefaultMemoryMin not very useful.	2019-05-08 12:06:32 +01:00
Lennart Poettering	adb7b782f8	Merge pull request #12218 from keszybz/use-libmount-more Use libmount more	2019-04-30 19:44:17 +02:00
Lennart Poettering	0892f3f999	Merge pull request #12420 from mrc0mmand/coccinelle-tweaks Coccinelle improvements	2019-04-30 11:37:19 +02:00
Frantisek Sumsal	ed0cb34682	tree-wide: code improvements suggested by Coccinelle	2019-04-30 09:39:07 +02:00
Ben Boeckel	5238e95759	codespell: fix spelling errors	2019-04-29 16:47:18 +02:00
Lennart Poettering	d8974757c4	Merge pull request #12407 from keszybz/two-unrelated-cleanups Two unrelated cleanups	2019-04-26 23:43:27 +02:00
Lennart Poettering	85318688cc	chown-recursive: also check mode before we bypass	2019-04-26 08:31:08 +02:00
Zbigniew Jędrzejewski-Szmek	c5b7ae0edb	Merge pull request #12074 from poettering/io-acct expose IO stats on the bus and in "systemctl status" and "systemd-run --wait"	2019-04-25 11:59:37 +02:00
Zbigniew Jędrzejewski-Szmek	c5322608a5	core: adjust unit_get_ancestor_memory_{low,min}() to work with units which don't have a CGroupContext Coverity doesn't like the fact that unit_get_cgroup_context() returns NULL for unit types that don't have a CGroupContext. We don't expect to call those functions with such unit types, so this isn't an immediate problem, but we can make things more robust by handling this case. CID #1400683, #1400684.	2019-04-25 11:13:02 +02:00
Zbigniew Jędrzejewski-Szmek	b6411f716c	Merge pull request #12332 from cdown/default_min cgroup: Add support for propagation of memory.min	2019-04-25 11:06:45 +02:00
Jan Klötzke	99b43caf26	core: immediately trigger watchdog action on WATCHDOG=trigger A service might be able to detect errors by itself that may require the system to take the same action as if the service locked up. Add a WATCHDOG=trigger state change notification to sd_notify() to let the service manager know about the self-detected misery and instantly trigger the configured watchdog behaviour.	2019-04-24 10:17:10 +02:00
Zbigniew Jędrzejewski-Szmek	e2857b3d87	Add helper function for mnt_table_parse_{stream,mtab} This wraps a few common steps. It is defined as inline function instead of in a .c file to avoid having a .c file. With a .c file, we would have three choices: - either link it into libshared, but then then libshared would have to be linked to libmount. - or compile the .c file into each target separately. This has the disdvantage that configuration of every target has to be updated and stuff will be compiled multiple times anyway, which is not too different from keeping this in the header file. - or create a new convenience library just for this. This also has the disadvantage that the every target would have to be updated, and a separate library for a 10 line function seems overkill. By keeping everything in a header file, we compile this a few times, but otherwise it's the least painful option. The compiler can optimize most of the function away, because it knows if 'source' is set or not.	2019-04-23 23:29:29 +02:00
Zbigniew Jędrzejewski-Szmek	13dcfe4661	shared/mount-util: convert to libmount It seems better to use just a single parsing algorithm for /proc/self/mountinfo. Also, unify the naming of variables in all places that use mnt_table_next_fs(). It makes it easier to compare the different call sites.	2019-04-23 23:29:29 +02:00
Anita Zhang	25cc30c4c8	core: support DisableControllers= for transient units	2019-04-22 11:52:08 -07:00
Chris Down	7ad5439e06	unit: Add DefaultMemoryMin	2019-04-16 18:45:04 +01:00
Chris Down	6264b85e92	cgroup: Create UNIT_DEFINE_ANCESTOR_MEMORY_LOOKUP This is in preparation for creating unit_get_ancestor_memory_min.	2019-04-16 18:39:51 +01:00
Yu Watanabe	dcab85be18	core: do not show TimeoutStopSec= in dump message if it is not set	2019-04-14 20:47:13 +09:00
Yu Watanabe	9c79f0e0a0	core: add assertion in two inline functions	2019-04-14 20:46:24 +09:00
Yu Watanabe	3bf0cb65f5	core: use BUS_DEFINE_PROPERTY_GET() macro at more places	2019-04-14 20:45:31 +09:00
Yu Watanabe	54c1a6ab8c	core: change type of Service::timeout_abort_set to bool Follow-up for `dc653bf487` (#11211).	2019-04-14 20:13:47 +09:00
Jan Klötzke	dc653bf487	service: handle abort stops with dedicated timeout When shooting down a service with SIGABRT the user might want to have a much longer stop timeout than on regular stops/shutdowns. Especially in the face of short stop timeouts the time might not be sufficient to write huge core dumps before the service is killed. This commit adds a dedicated (Default)TimeoutAbortSec= timer that is used when stopping a service via SIGABRT. In all other cases the existing TimeoutStopSec= is used. The timer value is unset by default to skip the special handling and use TimeoutStopSec= for state 'stop-watchdog' to keep the old behaviour. If the service is in state 'stop-watchdog' and the service should be stopped explicitly we still go to 'stop-sigterm' and re-apply the usual TimeoutStopSec= timeout.	2019-04-12 17:32:52 +02:00
Chris Down	c52db42b78	cgroup: Implement default propagation of MemoryLow with DefaultMemoryLow In cgroup v2 we have protection tunables -- currently MemoryLow and MemoryMin (there will be more in future for other resources, too). The design of these protection tunables requires not only intermediate cgroups to propagate protections, but also the units at the leaf of that resource's operation to accept it (by setting MemoryLow or MemoryMin). This makes sense from an low-level API design perspective, but it's a good idea to also have a higher-level abstraction that can, by default, propagate these resources to children recursively. In this patch, this happens by having descendants set memory.low to N if their ancestor has DefaultMemoryLow=N -- assuming they don't set a separate MemoryLow value. Any affected unit can opt out of this propagation by manually setting `MemoryLow` to some value in its unit configuration. A unit can also stop further propagation by setting `DefaultMemoryLow=` with no argument. This removes further propagation in the subtree, but has no effect on the unit itself (for that, use `MemoryLow=0`). Our use case in production is simplifying the configuration of machines which heavily rely on memory protection tunables, but currently require tweaking a huge number of unit files to make that a reality. This directive makes that significantly less fragile, and decreases the risk of misconfiguration. After this patch is merged, I will implement DefaultMemoryMin= using the same principles.	2019-04-12 17:23:58 +02:00
Lennart Poettering	bc40a20ebe	core: include IO data in per-unit resource log msg	2019-04-12 14:25:44 +02:00
Lennart Poettering	fbe14fc9a7	croup: expose IO accounting data per unit This was the last kind of accounting still not exposed on for each unit. Let's fix that. Note that this is a relatively simplistic approach: we don't expose per-device stats, but sum them all up, much like cgtop does. This kind of metric is probably the most interesting for most usecases, and covers the "systemctl status" output best. If we want per-device stats one day we can of course always add that eventually.	2019-04-12 14:25:44 +02:00
Lennart Poettering	83f18c91d0	core: use string_table_lookup() at more places	2019-04-12 14:25:44 +02:00
Lennart Poettering	9b2559a13e	core: add new call unit_reset_accounting() It's a simple wrapper for resetting both IP and CPU accounting in one go. This will become particularly useful when we also needs this to reset IO accounting (to be added in a later commit).	2019-04-12 14:25:44 +02:00

... 3 4 5 6 7 ...

5304 commits