Systemd

Author	SHA1	Message	Date
Yu Watanabe	93bab28895	tree-wide: use typesafe_qsort()	2018-09-19 08:02:52 +09:00
Franck Bui	03d0f4b58e	nspawn: always use mode 555 for /sys When a network namespace is needed, /sys is mounted as tmpfs (see commit `d8fc6a000f` for details). But in this case mode 755 was used as initial permissions for /sys whereas the default mode for sysfs is 555. In practice using 755 doesn't have any impact because /sys is mounted read-only too but for consistency, let's use the correct mode. Fixes: #10050	2018-09-11 00:34:00 +02:00
Luke Shumaker	677a72cd3e	nspawn: mount_sysfs(): Unconditionally mkdir /sys/fs/cgroup Currently, mount_sysfs() only creates /sys/fs/cgroup if cg_ns_supported(). The comment explains that we need to "Create mountpoint for cgroups. Otherwise we are not allowed since we remount /sys read-only."; that is: that we need to do it now, rather than later. However, the comment doesn't do anything to explain why we only need to do this if cg_ns_supported(); shouldn't we _always_ need to do it? The answer is that if !use_cgns, then this was already done by the outer child, so mount_sysfs() only needs to do it if use_cgns. Now, mount_sysfs() doesn't know whether use_cgns, but !cg_ns_supported() implies !use_cgns, so we can optimize" the case where we _know_ !use_cgns, and deal with a no-op mkdir_p() in the false-positive where cgns_supported() but !use_cgns. But is it really much of an optimization? We're potentially spending an access(2) (cg_ns_supported() could be cached from a previous call) to potentially save an lstat(2) and mkdir(2); and all of them are on virtual fileystems, so they should all be pretty cheap. So, simplify and drop the conditional. It's a dubious optimization that requires more text to explain than it's worth.	2018-07-20 12:12:03 -04:00
Luke Shumaker	0402948206	nspawn: Move cgroup mount stuff from nspawn-mount.c to nspawn-cgroup.c	2018-07-20 12:12:02 -04:00
Luke Shumaker	2fa017f169	nspawn: Simplify tmpfs_patch_options() usage, and trickle that up One of the things that tmpfs_patch_options does is take an (optional) UID, and insert "uid=${UID},gid=${UID}" into the options string. So we need a uid_t argument, and a way of telling if we should use it. Fortunately, that is built in to the uid_t type by having UID_INVALID as a possible value. So this is really a feature that requires one argument. Yet, it is somehow taking 4! That is absurd. Simplify it to only take one argument, and have that trickle all the way up to mount_all()'s usage. Now, in may of the uses, the argument becomes uid_shift == 0 ? UID_INVALID : uid_shift because it used to treat uid_shift=0 as invalid unless the patch_ids flag was also set. This keeps the behavior the same. Note that in all cases where it is invoked, if !use_userns (sometimes called !userns), then uid_shift is 0; we don't have to add any checks for that. That said, I'm pretty sure that "uid=0" and not setting "uid=" are the same, but Christian Brauner seemed to not think so when implementing the cgns support. https://github.com/systemd/systemd/pull/3589	2018-07-20 12:12:02 -04:00
Luke Shumaker	9c0fad5fb5	nspawn: Simplify mkdir_userns() usage, and trickle that up One of the things that mkdir_userns{,_p}() does is take an (optional) UID, and chown the directory to that. So we need a uid_t argument, and a way of telling if we should use that uid_t argument. Fortunately, that is built in to the uid_t type by having UID_INVALID as a possible value. However, currently mkdir_userns() also takes a MountSettingsMask and checks a couple of bits in it to decide if it should perform the chown. Drop the mask argument, and instead have the caller pass UID_INVALID if it shouldn't chown.	2018-07-20 12:12:02 -04:00
Lennart Poettering	0c69794138	tree-wide: remove Lennart's copyright lines These lines are generally out-of-date, incomplete and unnecessary. With SPDX and git repository much more accurate and fine grained information about licensing and authorship is available, hence let's drop the per-file copyright notice. Of course, removing copyright lines of others is problematic, hence this commit only removes my own lines and leaves all others untouched. It might be nicer if sooner or later those could go away too, making git the only and accurate source of authorship information.	2018-06-14 10:20:20 +02:00
Lennart Poettering	818bf54632	tree-wide: drop 'This file is part of systemd' blurb This part of the copyright blurb stems from the GPL use recommendations: https://www.gnu.org/licenses/gpl-howto.en.html The concept appears to originate in times where version control was per file, instead of per tree, and was a way to glue the files together. Ultimately, we nowadays don't live in that world anymore, and this information is entirely useless anyway, as people are very welcome to copy these files into any projects they like, and they shouldn't have to change bits that are part of our copyright header for that. hence, let's just get rid of this old cruft, and shorten our codebase a bit.	2018-06-14 10:20:20 +02:00
Lennart Poettering	d4b653c589	nspawn: lock down a few things in /proc by default This tightens security on /proc: a couple of files exposed there are now made inaccessible. These files might potentially leak kernel internals or expose non-virtualized concepts, hence lock them down by default. Moreover, a couple of dirs in /proc that expose stuff also exposed in /sys are now marked read-only, similar to how we handle /sys. The list is taken from what docker/runc based container managers generally apply, but slightly extended.	2018-05-03 17:45:42 +02:00
Lennart Poettering	10af01a5ff	nspawn: use free_and_replace() at more places	2018-05-03 17:19:46 +02:00
Lennart Poettering	88614c8a28	nspawn: size_t more stuff A follow-up for #8840	2018-05-03 17:19:46 +02:00
Zbigniew Jędrzejewski-Szmek	11a1589223	tree-wide: drop license boilerplate Files which are installed as-is (any .service and other unit files, .conf files, .policy files, etc), are left as is. My assumption is that SPDX identifiers are not yet that well known, so it's better to retain the extended header to avoid any doubt. I also kept any copyright lines. We can probably remove them, but it'd nice to obtain explicit acks from all involved authors before doing that.	2018-04-06 18:58:55 +02:00
Yu Watanabe	1cc6c93a95	tree-wide: use TAKE_PTR() and TAKE_FD() macros	2018-04-05 14:26:26 +09:00
Lennart Poettering	ae2a15bc14	macro: introduce TAKE_PTR() macro This macro will read a pointer of any type, return it, and set the pointer to NULL. This is useful as an explicit concept of passing ownership of a memory area between pointers. This takes inspiration from Rust: https://doc.rust-lang.org/std/option/enum.Option.html#method.take and was suggested by Alan Jenkins (@sourcejedi). It drops ~160 lines of code from our codebase, which makes me like it. Also, I think it clarifies passing of ownership, and thus helps readability a bit (at least for the initiated who know the new macro)	2018-03-22 20:21:42 +01:00
Zbigniew Jędrzejewski-Szmek	aa484f3561	tree-wide: use reallocarray instead of our home-grown realloc_multiply (#8279 ) There isn't much difference, but in general we prefer to use the standard functions. glibc provides reallocarray since version 2.26. I moved explicit_bzero is configure test to the bottom, so that the two stdlib functions are at the bottom.	2018-02-26 21:20:00 +01:00
Yu Watanabe	72d967df3e	nspawn: remove unnecessary mount option parsing logic	2018-02-21 09:06:55 +09:00
Yu Watanabe	30ffb010ff	nspawn: fix indentation	2018-02-21 09:05:33 +09:00
Zbigniew Jędrzejewski-Szmek	dae8b82eb9	Add mkdir_errno_wrapper() and use instead of mkdir() in various places We'd pass pointers to mkdir and mkdir_label to call in various places. mkdir returns the error in errno while mkdir_label returns the error directly.	2017-12-16 13:28:22 +01:00
Zbigniew Jędrzejewski-Szmek	40fd52f28d	util-lib: rename path_check_fstype to path_is_fs_type	2017-11-30 20:43:25 +01:00
Daniel Lockyer	87e4e28dcf	Replace empty ternary with helper method	2017-11-24 09:31:08 +00:00
Lennart Poettering	6925a0de4e	cgroup-util: move Set* allocation into cg_kernel_controllers() Previously, callers had to do this on their own. Let's make the call do that instead, making the caller code a bit shorter.	2017-11-21 11:54:08 +01:00
Lennart Poettering	bf516294c8	nspawn: minor optimization no need to prepare the target path if we quite the loop anyway one step later.	2017-11-21 11:54:08 +01:00
Lennart Poettering	d7c9693a3e	nspawn-mount: rework get_controllers() a bit Let's rename get_controllers() → get_process_controllers(), in order to underline the difference to cg_kernel_controllers(). After all, one returns the controllers available to the process, the other the controllers enabled in the kernel at all). Let's also update the code to use read_line() and set_put_strdup() to shorten the code a bit, and make it more robust.	2017-11-21 11:54:08 +01:00
Lennart Poettering	ea9053c5f8	nspawn: rework mount_systemd_cgroup_writable() a bit We shouldn't call alloca() as part of function calls, that's not really defined in C. Hence, let's first do our stack allocations, and then invoke functions. Also, some coding style fixes, and minor shuffling around. No functional changes.	2017-11-21 11:54:08 +01:00
Zbigniew Jędrzejewski-Szmek	53e1b68390	Add SPDX license identifiers to source files under the LGPL This follows what the kernel is doing, c.f. https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=5fd54ace4721fc5ce2bb5aef6318fcf17f421460.	2017-11-19 19:08:15 +01:00
Lauri Tirkkonen	4f13e53428	nspawn: EROFS for chowning mount points is not fatal (#7122 ) This fixes --read-only with --private-users. mkdir_userns_p may return -EROFS if either mkdir or lchown fails; lchown failing is fine as the mount point will just be overmounted, and if mkdir fails then the following mount() will also fail (with ENOENT).	2017-10-24 19:40:50 +02:00
Zbigniew Jędrzejewski-Szmek	349cc4a507	build-sys: use #if Y instead of #ifdef Y everywhere The advantage is that is the name is mispellt, cpp will warn us. $ git grep -Ee "conf.set$'(HAVE\|ENABLE)_" -l\|xargs sed -r -i "s/conf.set\('(HAVE\|ENABLE)_/conf.set10('\1_/" $ git grep -Ee '#ifn?def (HAVE\|ENABLE)' -l\|xargs sed -r -i 's/#ifdef (HAVE\|ENABLE)/#if \1/; s/#ifndef (HAVE\|ENABLE)/#if ! \1/;' $ git grep -Ee 'if.defined\(HAVE' -l\|xargs sed -i -r 's/defined\((HAVE_[A-Z0-9_])$/\1/g' $ git grep -Ee 'if.defined$ENABLE' -l\|xargs sed -i -r 's/defined\((ENABLE_[A-Z0-9_])$/\1/g' + manual changes to meson.build squash! build-sys: use #if Y instead of #ifdef Y everywhere v2: - fix incorrect setting of HAVE_LIBIDN2	2017-10-04 12:09:29 +02:00
Zbigniew Jędrzejewski-Szmek	b167945935	nspawn: do not mount /sys/fs/kdbus	2017-07-23 12:03:00 -04:00
tomty89	e8a94ce83e	nspawn: add nosuid and nodev to /tmp mount (#6004 ) When automatic /tmp mount was introduced to nspawn in v219, it was done without having the nosuid and nodev mount options, which was the same case as systemd's default tmp.mount unit back then. nosuid and nodev was added to tmp.mount(.m4) in v231 for security reasons. matching the nspawn /tmp mount entry against that. Ref.: `2f9df7c96a` `bbb99c30d0`	2017-05-23 09:41:36 +02:00
Zbigniew Jędrzejewski-Szmek	78e4f19ebc	Merge pull request #5444 from poettering/cgroups-revert-no-error Revert "core: simplify cg_[all_]unified()" and more.	2017-02-24 18:48:57 -05:00
AsciiWolf	13e785f7a0	Fix missing space in comments (#5439 )	2017-02-24 18:14:02 +01:00
Lennart Poettering	b4cccbc13a	cgroup: change cg_unified() to possibly return errors again We use our cgroup APIs in various contexts, including from our libraries sd-login, sd-bus. As we don#t control those environments we can't rely that the unified cgroup setup logic succeeds, and hence really shouldn't assert on it. This more or less reverts `415fc41cea`.	2017-02-24 17:52:58 +01:00
Tejun Heo	2977724b09	core: make hybrid cgroup unified mode keep compat /sys/fs/cgroup/systemd hierarchy Currently the hybrid mode mounts cgroup v2 on /sys/fs/cgroup instead of the v1 name=systemd hierarchy. While this works fine for systemd itself, it breaks tools which expect cgroup v1 hierarchy on /sys/fs/cgroup/systemd. This patch updates the hybrid mode so that it mounts v2 hierarchy on /sys/fs/cgroup/unified and keeps v1 "name=systemd" hierarchy on /sys/fs/cgroup/systemd for compatibility. systemd itself doesn't depend on the "name=systemd" hierarchy at all. All operations take place on the v2 hierarchy as before but the v1 hierarchy is kept in sync so that any tools which expect it to be there can keep doing so. This allows systemd to take advantage of cgroup v2 process management without requiring other tools to be aware of the hybrid mode. The hybrid mode is implemented by mapping the special systemd controller to /sys/fs/cgroup/unified and making the basic cgroup utility operations - cg_attach(), cg_create(), cg_rmdir() and cg_trim() - also operate on the /sys/fs/cgroup/systemd hierarchy whenever the cgroup2 hierarchy is updated. While a bit messy, this will allow dropping complications from using cgroup v1 for process management a lot sooner than otherwise possible which should make it a net gain in terms of maintainability. v2: Fixed !cgns breakage reported by @evverx and renamed the unified mount point to /sys/fs/cgroup/unified as suggested by @brauner. v3: chown the compat hierarchy too on delegation. Suggested by @evverx. v4: [zj] - drop the change to default, full "legacy" is still the default.	2017-02-20 12:28:35 -05:00
Tejun Heo	415fc41cea	core: simplify cg_[all_]unified() cg_[all_]unified() test whether a specific controller or all controllers are on the unified hierarchy. While what's being asked is a simple binary question, the callers must assume that the functions may fail any time, which unnecessarily complicates their usages. This complication is unnecessary. Internally, the test result is cached anyway and there are only a few places where the test actually needs to be performed. This patch simplifies cg_[all_]unified(). * cg_[all_]unified() are updated to return bool. If the result can't be decided, assertion failure is triggered. Error handlings from their callers are dropped. * cg_unified_flush() is updated to calculate the new result synchrnously and return whether it succeeded or not. Places which need to flush the test result are updated to test for failure. This ensures that all the following cg_[all_]unified() tests succeed. * Places which expected possible cg_[all_]unified() failures are updated to call and test cg_unified_flush() before calling cg_[all_]unified(). This includes functions used while setting up mounts during boot and manager_setup_cgroup().	2017-02-18 17:51:13 -05:00
Philip Withnall	b53ede699c	nspawn: Add support for sysroot pivoting (#5258 ) Add a new --pivot-root argument to systemd-nspawn, which specifies a directory to pivot to / inside the container; while the original / is pivoted to another specified directory (if provided). This adds support for booting container images which may contain several bootable sysroots, as is common with OSTree disk images. When these disk images are booted on real hardware, ostree-prepare-root is run in conjunction with sysroot.mount in the initramfs to achieve the same results.	2017-02-08 16:54:31 +01:00
Lennart Poettering	a4c35b6b4d	nspawn: split out VolatileMode definitions This moves the VolatileMode enum and its helper functions to src/shared/. This is useful to then reuse them to implement systemd.volatile= in a later commit.	2016-12-20 20:00:08 +01:00
Evgeny Vereshchagin	c9fd987279	nspawn: don't hide --bind=/tmp/* mounts (#4824 ) Fixes #4789	2016-12-05 18:14:05 +01:00
Lennart Poettering	cb638b5e96	util-lib: rename CHASE_NON_EXISTING → CHASE_NONEXISTENT As suggested by @keszybz	2016-12-01 12:49:55 +01:00
Lennart Poettering	ec57bd426a	nspawn: improve log messages When complaining about the inability to resolve a path, show the full path, not just the relative one. As suggested by @keszybz.	2016-12-01 12:41:18 +01:00
Lennart Poettering	c7a4890ce4	nspawn: optionally, automatically allocated --bind=/--overlay source from /var/tmp This extends the --bind= and --overlay= syntax so that an empty string as source/upper directory is taken as request to automatically allocate a temporary directory below /var/tmp, whose lifetime is bound to the nspawn runtime. In combination with the "+" path extension this permits a switch "--overlay=+/var::/var" in order to use the container's shipped /var, combine it with a writable temporary directory and mount it to the runtime /var of the container.	2016-12-01 12:41:18 +01:00
Lennart Poettering	86c0dd4a71	nspawn: permit prefixing of source paths in --bind= and --overlay= with "+" If a source path is prefixed with "+" it is taken relative to the container's root directory instead of the host. This permits easily establishing bind and overlay mounts based on data from the container rather than the host. This also reworks custom_mounts_prepare(), and turns it into two functions: one custom_mount_check_all() that remains in nspawn.c but purely verifies the validity of the custom mounts configured. And one called custom_mount_prepare_all() that actually does the preparation step, sorts the custom mounts, resolves relative paths, and allocates temporary directories as necessary.	2016-12-01 12:41:18 +01:00
Lennart Poettering	ad85779a50	nspawn: split out overlayfs argument parsing into a function of its own Add overlay_mount_parse() similar in style to tmpfs_mount_parse() and bind_mount_parse().	2016-12-01 00:25:51 +01:00
Lennart Poettering	48cbe5f80b	nspawn: use -ENOMEM instead of log_oom() in one case The function is of the "library" kind and doesn't log ENOMEM in all other cases, hence fix the one outlier.	2016-12-01 00:25:51 +01:00
Lennart Poettering	8ce48cf0f8	nspawn: use the new CHASE_NON_EXISTING flag when resolving mount points This restores the ability to implicitly create files/directories to mount specified mount points on.	2016-12-01 00:25:51 +01:00
Lennart Poettering	c4f4fce79e	fs-util: add flags parameter to chase_symlinks() Let's remove chase_symlinks_prefix() and instead introduce a flags parameter to chase_symlinks(), with a flag CHASE_PREFIX_ROOT that exposes the behaviour of chase_symlinks_prefix().	2016-12-01 00:25:51 +01:00
Lennart Poettering	68cf43c315	nspawn: use chase_symlinks() on all paths specified via --tmpfs=, --bind= and so on Fixes: #2860	2016-12-01 00:25:51 +01:00
Lennart Poettering	4da92e5857	nspawn: coding style: don't mix variable declarations and function calls	2016-12-01 00:25:51 +01:00
Lennart Poettering	5639193139	nspawn: use realloc_multiply() where it makes sense	2016-12-01 00:25:51 +01:00
Lennart Poettering	e187369587	tree-wide: stop using canonicalize_file_name(), use chase_symlinks() instead Let's use chase_symlinks() everywhere, and stop using GNU canonicalize_file_name() everywhere. For most cases this should not change behaviour, however increase exposure of our function to get better tested. Most importantly in a few cases (most notably nspawn) it can take the correct root directory into account when chasing symlinks.	2016-12-01 00:25:51 +01:00
Lennart Poettering	acbbf69b71	nspawn: don't require chown() if userns is not on Fixes: #4711	2016-11-22 13:35:24 +01:00

1 2

89 commits