Systemd

Author	SHA1	Message	Date
Lennart Poettering	30874dda3a	dev-setup: generalize logic we use to create "inaccessible" device nodes Let's generalize this, so that we can use this in nspawn later on, which is pretty useful as we need to be able to mask files from the inner child of nspawn too, where the host's /run/systemd/inaccessible directory is not visible anymore. Moreover, if nspawn can create these nodes on its own before the payload this means the payload can run with fewer privileges.	2018-11-29 20:21:40 +01:00
Lennart Poettering	d5aecba6e0	cgroup: use device_path_parse_major_minor() also for block device paths Not only when we populate the "devices" cgroup controller we need major/minor numbers, but for the io/blkio one it's the same, hence let's use the same logic for both.	2018-11-29 20:21:39 +01:00
Lennart Poettering	846b3bd61e	stat-util: add new APIs device_path_make_{major_minor\|canonical}() and device_path_parse_major_minor() device_path_make_{major_minor\|canonical) generate device node paths given a mode_t and a dev_t. We have similar code all over the place, let's unify this in one place. The former will generate a "/dev/char/" or "/dev/block" path, and never go to disk. The latter then goes to disk and resolves that path to the actual path of the device node. device_path_parse_major_minor() reverses device_path_make_major_minor(), also withozut going to disk. We have similar code doing something like this at various places, let's unify this in a single set of functions. This also allows us to teach them special tricks, for example handling of the /run/systemd/inaccessible/{blk\|chr} device nodes, which we use for masking device nodes, and which do not exist in /dev/char/* and /dev/block/*	2018-11-29 20:21:39 +01:00
Lennart Poettering	8e8b5d2e6d	cgroups: beef up DeviceAllow= syntax a bit Previously we'd allow pattern expressions such as "char-input" to match all input devices. Internally, this would look up the right major to test in /proc/devices. With this commit the syntax is slightly extended: - "char-" can be used to match any kind of character device, and similar "block-. This expression would work previously already, but instead of actually installing a wildcard match it would install many individual matches for everything listed in /proc/devices. - "char-<MAJOR>" with "<MAJOR>" being a numerical parameter works now too. This allows clients to install whitelist items by specifying the major directly. The main reason to add these is to provide limited compat support for clients that for some reason contain whitelists with major/minor numbers (such as OCI containers).	2018-11-29 20:21:39 +01:00
Lennart Poettering	74c48bf5a8	core: add special handling for devices cgroup allow lists for /dev/block/* and /dev/char/* device nodes This adds some code to hanlde /dev/block/* and /dev/char/* device node paths specially: instead of actually stat()ing them we'll just parse the major/minor name from the name. This is useful 'hack' to allow clients to install whitelists for devices that don't actually have to exist. Also, let's similarly handle /run/systemd/inaccessible/{blk\|chr}. This allows us to simplify our built-in default whitelist to not require a "ignore_enoent" mode for these nodes. In general we should be careful with hardcoding major/minor numbers, but in this case this should safe.	2018-11-29 20:03:56 +01:00
Zbigniew Jędrzejewski-Szmek	8b4e51a60e	Merge pull request #10797 from poettering/run-generator add new "systemd-run-generator" for running arbitrary commands from the kernel command line as system services using the "systemd.run=" kernel command line switch	2018-11-28 22:40:55 +01:00
Yu Watanabe	50ae773f85	Merge pull request #10970 from yuwata/from-name-return-negative-errno util: make *_from_name() returns negative errno on error	2018-11-29 03:18:03 +09:00
Yu Watanabe	acf4d15893	util: make *_from_name() returns negative errno on error	2018-11-28 20:20:50 +09:00
Lennart Poettering	b4525804a1	core: USB function properties do not change dynamically, don't claim so This reduces our PropertiesChanged signals a bit in size as we don't keep out blasting properties that cannot change anyway all the time.	2018-11-28 10:29:51 +01:00
Lennart Poettering	4917894417	Merge pull request #10944 from poettering/redirect-file-fix StandardOutput=file: fixes	2018-11-27 13:18:26 +01:00
Zbigniew Jędrzejewski-Szmek	6fa158f55c	Merge pull request #10902 from poettering/highlight-status Highlight status	2018-11-27 12:53:43 +01:00
Lennart Poettering	41fc585a7a	core: be more careful when inheriting stdout fds to stderr We need to compare the fd name/file name if we inherit an fd from stdout to stderr. Let's do that. Fixes: #10875	2018-11-27 10:06:51 +01:00
Lennart Poettering	1704fba92f	dbus-execute: generate the correct transient unit setting	2018-11-27 10:06:50 +01:00
Lennart Poettering	dbe6c4b657	dbus-execute: fix indentation	2018-11-27 10:06:50 +01:00
Lennart Poettering	922ce049d1	core: drop references to 'StandardOutputFileToCreate' This property never existed, let's drop any reference to it.	2018-11-27 10:06:50 +01:00
Lennart Poettering	7af67e9a8b	core: allow to set exit status when using SuccessAction=/FailureAction=exit in units This adds SuccessActionExitStatus= and FailureActionExitStatus= that may be used to configure the exit status to propagate in when SuccessAction=exit or FailureAction=exit is used. When not specified let's also propagate the exit status of the main process we fork off for the unit.	2018-11-27 09:44:40 +01:00
Lennart Poettering	78f93209fc	core: when Delegate=yes is set for a unit, run ExecStartPre= and friends in a subcgroup of the unit Otherwise we might conflict with the "no-processes-in-inner-cgroup" rule of cgroupsv2. Consider nspawn starting up and initializing its cgroup hierarchy with "supervisor/" and "payload/" as subcgroup, with itself moved into the former and the payload into the latter. Now, if an ExecStartPre= is run right after it cannot be placed in the main cgroup, because that is now in inner cgroup with populated children. Hence, let's run these helpers in another sub-cgroup .control/ below it. This is somewhat ugly since it weakens the clear separation of ownership, but given that this is an explicit contract, and double opt-in should be acceptable. Fixes: #10482	2018-11-26 18:43:23 +01:00
Lennart Poettering	5b262f74e4	unit: tweak status output a bit Let's highlight the unit description string in the status updates, to separate them a bit more the english sentence they are part of, and thus make the different casing less surprising.	2018-11-26 18:24:12 +01:00
Lennart Poettering	ccfc08d4bc	show-status: use free_and_replace() where we can	2018-11-26 18:24:12 +01:00
Lennart Poettering	a885727a64	show-status: fold two bool flags function arguments into a flags parameter	2018-11-26 18:24:12 +01:00
Yu Watanabe	938dbb292a	Merge pull request #10901 from poettering/startswith-list add new STARTSWITH_SET() macro	2018-11-26 22:40:51 +09:00
Lennart Poettering	9630d4dd68	Merge pull request #10894 from poettering/root-cgroup-fix A multitude of cgroup fixes	2018-11-26 14:13:01 +01:00
Lennart Poettering	da9fc98ded	tree-wide: port more code over to PATH_STARTSWITH_SET()	2018-11-26 14:08:46 +01:00
Lennart Poettering	49fe5c0996	tree-wide: port various places over to STARTSWITH_SET()	2018-11-26 14:08:46 +01:00
Lennart Poettering	b8b6f32104	cgroup: when we unload a unit, also update all its parent's members mask This way we can corectly ensure that when a unit that requires some controller goes away, we propagate the removal of it all the way up, so that the controller is turned off in all the parents too.	2018-11-23 13:41:37 +01:00
Lennart Poettering	5af8805872	cgroup: drastically simplify caching of cgroups members mask Previously we tried to be smart: when a new unit appeared and it only added controllers to the cgroup mask we'd update the cached members mask in all parents by ORing in the controller flags in their cached values. Unfortunately this was quite broken, as we missed some conditions when this cache had to be reset (for example, when a unit got unloaded), moreover the optimization doesn't work when a controller is removed anyway (as in that case there's no other way for the parent to iterate though all children if any other, remaining child unit still needs it). Hence, let's simplify the logic substantially: instead of updating the cache on the right events (which we didn't get right), let's simply invalidate the cache, and generate it lazily when we encounter it later. This should actually result in better behaviour as we don't have to calculate the new members mask for a whole subtree whever we have the suspicion something changed, but can delay it to the point where we actually need the members mask. This allows us to simplify things quite a bit, which is good, since validating this cache for correctness is hard enough. Fixes: #9512	2018-11-23 13:41:37 +01:00
Lennart Poettering	8a0d538815	cgroup: extend comment on what unit_release_cgroup() is for	2018-11-23 13:41:37 +01:00
Lennart Poettering	1fd3a10c38	cgroup: extend reasons when we realize the enable mask After creating a cgroup we need to initialize its "cgroup.subtree_control" file with the controllers its children want to use. Currently we do so whenever the mkdir() on the cgroup succeeded, i.e. when we know the cgroup is "fresh". Let's update the condition slightly that we also do so when internally we assume a cgroup doesn't exist yet, even if it already does (maybe left-over from a previous run). This shouldn't change anything IRL but make things a bit more robust.	2018-11-23 13:41:37 +01:00
Lennart Poettering	d5095dcd30	cgroup: tighten call that detects whether we need to realize a unit's cgroup a bit, and comment why	2018-11-23 13:41:37 +01:00
Lennart Poettering	5a62e5e2ac	cgroup: document what the various masks variables are used for	2018-11-23 13:41:37 +01:00
Lennart Poettering	27c4ed790a	cgroup: simplify check whether it makes sense to realize a cgroup	2018-11-23 13:41:37 +01:00
Lennart Poettering	e00068e71f	cgroup: in unit_invalidate_cgroup() actually modify invalidation mask Previously this would manipulate the realization mask for invalidating the realization. This is a bit ugly though as the realization mask's primary purpose to is to reflect in which hierarchies a cgroup currently exists, and it's probably a good idea to keep that in sync with realities. We nowadays have the an explicit fields for invalidating cgroup controller information, the "cgroup_invalidated_mask", let's use this one instead. The effect is pretty much the same, as the main consumer of these masks (unit_has_mask_realize()) checks both anyway.	2018-11-23 13:41:37 +01:00
Lennart Poettering	27adcc9737	cgroup: be more careful with which controllers we can enable/disable on a cgroup This changes cg_enable_everywhere() to return which controllers are enabled for the specified cgroup. This information is then used to correctly track the enablement mask currently in effect for a unit. Moreover, when we try to turn off a controller, and this works, then this is indicates that the parent unit might succesfully turn it off now, too as our unit might have kept it busy. So far, when realizing cgroups, i.e. when syncing up the kernel representation of relevant cgroups with our own idea we would strictly work from the root to the leaves. This is generally a good approach, as when controllers are enabled this has to happen in root-to-leaves order. However, when controllers are disabled this has to happen in the opposite order: in leaves-to-root order (this is because controllers can only be enabled in a child if it is already enabled in the parent, and if it shall be disabled in the parent then it has to be disabled in the child first, otherwise it is considered busy when it is attempted to remove it in the parent). To make things complicated when invalidating a unit's cgroup membershup systemd can actually turn off some controllers previously turned on at the very same time as it turns on other controllers previously turned off. In such a case we have to work up leaves-to-root and root-to-leaves right after each other. With this patch this is implemented: we still generally operate root-to-leaves, but as soon as we noticed we successfully turned off a controller previously turned on for a cgroup we'll re-enqueue the cgroup realization for all parents of a unit, thus implementing leaves-to-root where necessary.	2018-11-23 13:41:37 +01:00
Zbigniew Jędrzejewski-Szmek	e5e0a79623	pid1,sd-device: use PATH_STARTSWITH_SET more	2018-11-23 13:37:47 +01:00
Lennart Poettering	26a17ca280	cgroup: add explanatory comment	2018-11-23 12:24:37 +01:00
Lennart Poettering	442ce7759c	cgroup: units that aren't loaded properly should not result in cgroup controllers being pulled in This shouldn't make much difference in real life, but is a bit cleaner.	2018-11-23 12:24:37 +01:00
Lennart Poettering	0adf88b68c	cgroup: dump delegation mask too	2018-11-23 12:24:37 +01:00
Lennart Poettering	1649244588	cgroup: make unit_get_needs_bpf_firewall() static too	2018-11-23 12:24:37 +01:00
Lennart Poettering	53aea74a60	cgroup: make some functions static	2018-11-23 12:24:37 +01:00
Lennart Poettering	52fecf20b9	cgroup: fine tune when to apply cgroup attributes to the root cgroup Let's tweak when precisely to apply cgroup attributes on the root cgroup. With this we now follow the following rules: 1. On cgroupsv2 we never apply any regular cgroups to the host root, since the attributes generally do not exist there. 2. On cgroupsv1 we do not apply any "weight" or "shares" style attributes to the host root cgroup, since they don't make much sense on the top level where there's only one group, hence no need to compare weights against each other. The other attributes are applied to the host root cgroup however. 3. In any case we don't apply attributes to the root of container environments (and --user roots), under the assumption that this is managed by the manager further up. (Note that on cgroupsv2 this is even enforced by the kernel) 4. BPF pseudo-attributes are applied in all cases (since we can have as many of them as we want)	2018-11-23 12:24:37 +01:00
Lennart Poettering	589a5f7a38	cgroup: append \n to static strings we write to cgroup attributes This is a bit cleaner since we when we format numeric limits we append it. And this way write_string_file() doesn't have to append it.	2018-11-23 12:24:37 +01:00
Lennart Poettering	28cfdc5aeb	cgroup: tighten manager_owns_host_root_cgroup() a bit This tightening is not strictly necessary (as the m->cgroup_root check further down does the same), but let's make this explicit.	2018-11-23 12:24:37 +01:00
Lennart Poettering	611c4f8afb	cgroup: rename {manager_owns\|unit_has}_root_cgroup() → .._host_root_cgroup() Let's emphasize that this function checks for the host root cgroup, i.e. returns false for the root cgroup when we run in a container where CLONE_NEWCGROUP is used. There has been some confusion around this already, for example cgroup_context_apply() uses the function incorrectly (which we'll fix in a later commit). Just some refactoring, not change in behaviour.	2018-11-23 12:24:37 +01:00
Lennart Poettering	293d32df39	cgroup: add a common routine for writing to attributes, and logging about it We can use this at quite a few places, and this allows us to shorten our code quite a bit.	2018-11-23 12:24:37 +01:00
Lennart Poettering	39b9fefb2e	cgroup: add a new macro for determining log level for cgroup attr write failures For now, let's use it only at one place, but a follow-up commit will make more use of it.	2018-11-23 12:24:37 +01:00
Lennart Poettering	2c74e12bb3	cgroup: ignore EPERM for a couple of more attribute writes	2018-11-23 12:24:37 +01:00
Lennart Poettering	8c83840772	cgroup: add comment explaining why we ignore EINVAL at two places These are just copies from further down.	2018-11-23 12:24:37 +01:00
Lennart Poettering	73fe5314bf	cgroup: suffix settings with "=" in log messages where appropriate	2018-11-23 12:24:37 +01:00
Lennart Poettering	a0c339ed4b	cgroup: only install cgroup release agent when we own the root cgroup If we run in a container we shouldn't patch around this, and most likely we can't anyway, and there's not much point in complaining about this. Hence let's strictly say: the agent is private property of the host's system instance, nothing else.	2018-11-23 12:24:37 +01:00
Lennart Poettering	de8a711a58	cgroup: use structured initialization	2018-11-23 12:24:37 +01:00

1 2 3 4 5 ...

4532 commits