Systemd

Author	SHA1	Message	Date
Yu Watanabe	62b1e758d3	nspawn: adjust path to static resolv.conf to support split usr Fixes #7302.	2017-11-25 21:11:07 +09:00
Lennart Poettering	d381c8a6bf	nspawn: hash the machine name, when looking for a suitable UID base (#7437 ) When "-U" is used we look for a UID range we can use for our container. We start with the UID the tree is already assigned to, and if that didn't work we'd pick random ranges so far. With this change we'll first try to hash a suitable range from the container name, and use that if it works, in order to make UID assignments more likely to be stable. This follows a similar logic PID 1 follows when using DynamicUser=1.	2017-11-24 20:57:19 +01:00
Lennart Poettering	a8027a18f1	Merge pull request #7442 from poettering/scope-fixes some fixes to the scope unit type	2017-11-24 17:15:09 +01:00
Lennart Poettering	f170504825	Merge pull request #7453 from neosilky/coccinelle-fixes Applied fixes from Coccinelle	2017-11-24 13:29:48 +01:00
Daniel Lockyer	f9ecfd3bbe	Replace free and reassignment with free_and_replace	2017-11-24 10:33:41 +00:00
Daniel Lockyer	87e4e28dcf	Replace empty ternary with helper method	2017-11-24 09:31:08 +00:00
Lennart Poettering	abdb9b08f6	nspawn: make use of the RequestStop logic of scope units Since time began, scope units had a concept of "Controllers", a bus peer that would be notified when somebody requested a unit to stop. None of our code used that facility so far, let's change that. This way, nspawn can print a nice message when somebody invokes "systemctl stop" on the container's scope unit, and then react with the right action to shut it down.	2017-11-23 21:47:48 +01:00
Zbigniew Jędrzejewski-Szmek	ffb70e4424	Merge pull request #7381 from poettering/cgroup-unified-delegate-rework Fix delegation in the unified hierarchy + more cgroup work	2017-11-22 07:42:08 +01:00
Lennart Poettering	6925a0de4e	cgroup-util: move Set* allocation into cg_kernel_controllers() Previously, callers had to do this on their own. Let's make the call do that instead, making the caller code a bit shorter.	2017-11-21 11:54:08 +01:00
Lennart Poettering	bf516294c8	nspawn: minor optimization no need to prepare the target path if we quite the loop anyway one step later.	2017-11-21 11:54:08 +01:00
Lennart Poettering	d7c9693a3e	nspawn-mount: rework get_controllers() a bit Let's rename get_controllers() → get_process_controllers(), in order to underline the difference to cg_kernel_controllers(). After all, one returns the controllers available to the process, the other the controllers enabled in the kernel at all). Let's also update the code to use read_line() and set_put_strdup() to shorten the code a bit, and make it more robust.	2017-11-21 11:54:08 +01:00
Lennart Poettering	ea9053c5f8	nspawn: rework mount_systemd_cgroup_writable() a bit We shouldn't call alloca() as part of function calls, that's not really defined in C. Hence, let's first do our stack allocations, and then invoke functions. Also, some coding style fixes, and minor shuffling around. No functional changes.	2017-11-21 11:54:08 +01:00
Shawn Landden	4831981d89	tree-wide: adjust fall through comments so that gcc is happy Distcc removes comments, making the comment silencing not work. I know there was a decision against a macro in commit `ec251fe7d5`	2017-11-20 13:06:25 -08:00
Zbigniew Jędrzejewski-Szmek	3a726fcd08	Add license headers and SPDX identifiers to meson.build files So far I avoided adding license headers to meson files, but they are pretty big and important and should carry license headers like everything else. I added my own copyright, even though other people modified those files too. But this is mostly symbolic, so I hope that's OK.	2017-11-19 19:08:15 +01:00
Zbigniew Jędrzejewski-Szmek	53e1b68390	Add SPDX license identifiers to source files under the LGPL This follows what the kernel is doing, c.f. https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=5fd54ace4721fc5ce2bb5aef6318fcf17f421460.	2017-11-19 19:08:15 +01:00
Lennart Poettering	3603efdea5	nspawn: make recursive chown()ing logic safe for being aborted in the middle We currently use the ownership of the top-level directory as a hint whether we need to descent into the whole tree to chown() it recursively or not. This is problematic with the previous chown()ing algorithm, as when descending into the tree we'd first chown() and then descend further down, which meant that the top-level directory would be chowned first, and an aborted recursive chowning would appear on the next invocation as successful, even though it was not. Let's reshuffle things a bit, to make the re-chown()ing safe regarding interruptions: a) We chown() the dir we are looking at last, and descent into all its children first. That way we know that if the top-level dir is properly owned everything inside of it is properly owned too. b) Before starting a chown()ing operation, we mark the top-level directory as owned by a special "busy" UID range, which we can use to recognize whether a tree was fully chowned: if it is marked as busy, it's definitely not fully chowned, as the busy ownership will only be fixed as final step of the chowning. Fixes: #6292	2017-11-17 11:12:33 +01:00
Lennart Poettering	14f8ccc755	nspawn: add missing #pragma once to header file	2017-11-17 11:12:33 +01:00
Lennart Poettering	0986658d51	Merge pull request #6866 from sourcejedi/set-linger2 logind: fix `loginctl enable-linger`	2017-11-15 11:15:15 +01:00
Lennart Poettering	bcde742e78	conf-parser: turn three bool function params into a flags fields This makes things more readable and fixes some issues with incorrect flag propagation between the various flavours of config_parse().	2017-11-13 10:24:03 +01:00
Lennart Poettering	759aaedc5c	dissect: when we invoke dissection on a loop device with partscan help the user This adds some simply detection logic for cases where dissection is invoked on an externally created loop device, and partitions have been detected on it, but partition scanning so far was off. If this is detected we now print a brief message indicating what the issue is, instead of failing with a useless EINVAL message the kernel passed to us.	2017-10-26 17:54:56 +02:00
Lennart Poettering	eb38edce88	machine-image: add partial discovery of block devices as images This adds some basic discovery of block device images for nspawn and friends. Note that this doesn't add searching for block devices using udev, but instead expects users to symlink relevant block devices into /var/lib/machines. Discovery is hence done exactly like for dir/subvol/raw file images, except that what is found may be a (symlink to) a block device. For now, we do not support cloning these images, but removal, renaming and read-only flags are supported to the point where that makes sense. Fixe: #6990	2017-10-26 17:54:56 +02:00
Lauri Tirkkonen	4f13e53428	nspawn: EROFS for chowning mount points is not fatal (#7122 ) This fixes --read-only with --private-users. mkdir_userns_p may return -EROFS if either mkdir or lchown fails; lchown failing is fine as the mount point will just be overmounted, and if mkdir fails then the following mount() will also fail (with ENOENT).	2017-10-24 19:40:50 +02:00
myrkr	1898e5f9a3	nspawn: Fix calculation of capabilities for configuration file (#7087 ) The current code shifting an integer 1 failed for capabilities like CAP_MAC_ADMIN (numerical value 33). This caused issues when specifying them in the nspawn configuration file. Using an uint64_t 1 instead. The similar code for processing the --capability command line option was already correctly working.	2017-10-24 09:56:40 +02:00
Alan Jenkins	8d9c2bca41	nspawn: comment to acknowledge lying about "user session"	2017-10-18 09:47:10 +01:00
Yu Watanabe	c31ad02403	mkdir: introduce follow_symlink flag to mkdir_safe{,_label}()	2017-10-06 16:03:33 +09:00
Lennart Poettering	44898c5358	seccomp: add three more seccomp groups @aio → asynchronous IO calls @sync → msync/fsync/... and friends @chown → changing file ownership (Also, change @privileged to reference @chown now, instead of the individual syscalls it contains)	2017-10-05 15:42:48 +02:00
Lennart Poettering	4c3a917617	seccomp: include prlimit64 and ugetrlimit in @default Also, move prlimit64() out of @resources. prlimit64() may be used both for getting and setting resource limits, and is implicitly called by glibc at various places, on some archs, the same was as getrlimit(). SImilar, igetrlimit() is an arch-specific replacement for getrlimit(), and hence should be whitelisted at the same place as getrlimit() and prlimit64(). Also see: https://lists.freedesktop.org/archives/systemd-devel/2017-September/039543.html	2017-10-05 11:27:34 +02:00
Zbigniew Jędrzejewski-Szmek	349cc4a507	build-sys: use #if Y instead of #ifdef Y everywhere The advantage is that is the name is mispellt, cpp will warn us. $ git grep -Ee "conf.set$'(HAVE\|ENABLE)_" -l\|xargs sed -r -i "s/conf.set\('(HAVE\|ENABLE)_/conf.set10('\1_/" $ git grep -Ee '#ifn?def (HAVE\|ENABLE)' -l\|xargs sed -r -i 's/#ifdef (HAVE\|ENABLE)/#if \1/; s/#ifndef (HAVE\|ENABLE)/#if ! \1/;' $ git grep -Ee 'if.defined\(HAVE' -l\|xargs sed -i -r 's/defined\((HAVE_[A-Z0-9_])$/\1/g' $ git grep -Ee 'if.defined$ENABLE' -l\|xargs sed -i -r 's/defined\((ENABLE_[A-Z0-9_])$/\1/g' + manual changes to meson.build squash! build-sys: use #if Y instead of #ifdef Y everywhere v2: - fix incorrect setting of HAVE_LIBIDN2	2017-10-04 12:09:29 +02:00
Djalal Harouni	09d3020b0a	seccomp: remove '@credentials' syscall set (#6958 ) This removes the '@credentials' syscall set that was added in commit v234-468-gcd0ddf6f75. Most of these syscalls are so simple that we do not want to filter them. They work on the current calling process, doing only read operations, they do not have a deep kernel path. The problem may only be in 'capget' syscall since it can query arbitrary processes, and used to discover processes, however sending signal 0 to arbitrary processes can be used to discover if a process exists or not. It is unfortunate that Linux allows to query processes of different users. Lets put it now in '@process' syscall set, and later we may add it to a new '@basic-process' set that allows most basic process operations.	2017-10-03 07:20:05 +02:00
Lennart Poettering	64fbdc0f91	nspawn: properly report all kinds of changed UID/GID when patching things for userns We forgot to propagate one chmod().	2017-10-02 17:41:43 +02:00
Andreas Rammhold	3742095b27	tree-wide: use IN_SET where possible In addition to the changes from #6933 this handles cases that could be matched with the included cocci file.	2017-10-02 13:09:54 +02:00
Lennart Poettering	8e5430c4bd	nspawn: set up a new session keyring for the container process keyring material should not leak into the container. So far we relied on seccomp to deny access to the keyring, but given that we now made the seccomp configurable, and access to keyctl() and friends may optionally be permitted to containers now let's make sure we disconnect the callers keyring from the keyring of PID 1 in the container.	2017-09-22 15:28:04 +02:00
Lennart Poettering	96bedbe2e5	nspawn: replace syscall blacklist by a whitelist Let's lock things down a bit, and maintain a list of what's permitted rather than a list of what's prohibited in nspawn (also to make things a bit more like Docker and friends). Note that this slightly alters the effect of --system-call-filter=, as now the negative list now takes precedence over the positive list. However, given that the option is just a few days old and not included in any released version it should be fine to change it at this point in time. Note that the whitelist is good chunk more restrictive thatn the previous blacklist. Specifically: - fanotify is not permitted (given the buffer size issues it's problematic in containers) - nfsservctl is not permitted (NFS server support is not virtualized) - pkey_xyz stuff is not permitted (really new stuff I don't grok) - @cpu-emulation is prohibited (untested legacy stuff mostly, and if people really want to run dosemu in nspawn, they should use --system-call-filter=@cpu-emulation and all should be good)	2017-09-14 15:45:21 +02:00
Lennart Poettering	960e4569e1	nspawn: implement configurable syscall whitelisting/blacklisting Now that we have ported nspawn's seccomp code to the generic code in seccomp-util, let's extend it to support whitelisting and blacklisting of specific additional syscalls. This uses similar syntax as PID1's support for system call filtering, but in contrast to that always implements a blacklist (and not a whitelist), as we prepopulate the filter with a blacklist, and the unit's system call filter logic does not come with anything prepopulated. (Later on we might actually want to invert the logic here, and whitelist rather than blacklist things, but at this point let's not do that. In case we switch this over later, the syscall add/remove logic of this commit should be compatible conceptually.) Fixes: #5163 Replaces: #5944	2017-09-12 14:06:21 +02:00
Lennart Poettering	7609340e2f	nspawn: replace homegrown seccomp filter table largely with references to the existing syscall groups Let's shorten the table, now that we are hooked up to the syscall group system.	2017-09-11 18:00:07 +02:00
Lennart Poettering	402530d91e	nspawn: part over seccomp code to use seccomp_add_syscall_filter_item() Let's unify a bit of the code here.	2017-09-11 18:00:07 +02:00
Lennart Poettering	21022b9dde	util-lib: wrap personality() to fix up broken glibc error handling (#6766 ) glibc appears to propagate different errors in different ways, let's fix this up, so that our own code doesn't get confused by this. See #6752 + #6737 for details. Fixes: #6755	2017-09-08 17:16:29 +03:00
Zbigniew Jędrzejewski-Szmek	b167945935	nspawn: do not mount /sys/fs/kdbus	2017-07-23 12:03:00 -04:00
Lennart Poettering	8cb5743079	nspawn: downgrade warning when we get sd_notify() message from unexpected process (#6416 ) Given that we set NOTIFY_SOCKET unconditionally it's not surprising that processes way down the process tree think it's smart to send us a notification message. It's still useful to keep this message, for debugging things, but it shouldn't be generated by default.	2017-07-20 14:46:58 -04:00
Zbigniew Jędrzejewski-Szmek	e5f752082e	build-sys: drop gitignore patterns for in-tree builds ... and other autotools-generated files.	2017-07-18 10:05:06 -04:00
Zbigniew Jędrzejewski-Szmek	72cdb3e783	build-sys: drop automake support v2: - also mention m4	2017-07-18 10:04:44 -04:00
Lennart Poettering	3dad4f0666	Merge pull request #6257 from keszybz/unnecessary-job-log core: do not print color console message about gc-ed jobs	2017-07-03 10:48:28 +02:00
Zbigniew Jędrzejewski-Szmek	0a5706d143	nspawn: wait for the scope to be created (#6261 ) Fixes #6253.	2017-07-03 07:59:49 +02:00
Zbigniew Jędrzejewski-Szmek	bd68e99bd0	Be slightly more verbose in error message Including the full path is always useful. Also use PID_FMT in one more place.	2017-07-02 12:03:56 -04:00
Lennart Poettering	cd2dfc6fae	nspawn: register a scope for the unit if --register=no is specified (#6166 ) Previously, only when --register=yes was set (the default) the invoked container would get its own scope, created by machined on behalf of nspawn. With this change if --register=no is set nspawn will still get its own scope (which is a good thing, so that --slice= and --property= take effect), but this is not done through machined but by registering a scope unit directly in PID 1. Summary: --register=yes → allocate a new scope through machined (the default) --register=yes --keep-unit → use the unit we are already running in an register with machined --register=no → allocate a new scope directly, but no machined --register=no --keep-unit → do not allocate nor register anything Fixes: #5823	2017-06-28 13:22:46 -04:00
Lennart Poettering	a462478539	nspawn: make sure to send SIGTERM/SIGHUP to the main nspawn process if stubinit receives SIGRTMIN+3 (#6167 ) This code already existed in some form, however commented. Remove the comments, as this was most likely simply a forgotten commenting for debugging purposes. This also extends the logic a bit, by sending SIGHUP right after the SIGTERM, so that shells will also terminate, when PID 1 gets a SIGRTMIN+3. Fixes: #5711	2017-06-22 22:20:09 -04:00
tomty89	e8a94ce83e	nspawn: add nosuid and nodev to /tmp mount (#6004 ) When automatic /tmp mount was introduced to nspawn in v219, it was done without having the nosuid and nodev mount options, which was the same case as systemd's default tmp.mount unit back then. nosuid and nodev was added to tmp.mount(.m4) in v231 for security reasons. matching the nspawn /tmp mount entry against that. Ref.: `2f9df7c96a` `bbb99c30d0`	2017-05-23 09:41:36 +02:00
Lennart Poettering	401a38e770	Merge pull request #5958 from keszybz/explicit-log-errno Use explicit errno in log calls	2017-05-22 10:12:18 +02:00
Matija Skala	fe9938888b	Fix includes (#5980 ) Needed on musl.	2017-05-19 10:01:35 -04:00
Zbigniew Jędrzejewski-Szmek	35bca925f9	tree-wide: fix incorrect uses of %m In those cases errno was not set, so we would be logging some unrelated error or "Success".	2017-05-13 15:42:26 -04:00

1 2 3 4 5 ...

692 commits