Systemd

Author	SHA1	Message	Date
Lennart Poettering	2038c3f584	core: add support for StandardInputFile= and friends These new settings permit specifiying arbitrary paths as stdin/stdout/stderr locations. We try to open/create them as necessary. Some special magic is applied: 1) if the same path is specified for both input and output/stderr, we'll open it only once O_RDWR, and duplicate them fd instead. 2) If we an AF_UNIX socket path is specified, we'll connect() to it, rather than open() it. This allows invoking systemd services with stdin/stdout/stderr connected to arbitrary foreign service sockets. Fixes: #3991	2017-11-17 11:13:44 +01:00
Lennart Poettering	e75a9ed176	execute: some extra asserts In some cases we checked for fd validity already explicitly, let's do this for all our fds.	2017-11-17 11:13:44 +01:00
Lennart Poettering	5073ff6bec	core: fold property_get_input_fdname() and property_get_output_fdname() into one property_get_output_fdname() already had two different control flows for stdout and stderr, it might as well handle stdin too, thus shortening our code a bit.	2017-11-17 11:13:44 +01:00
Lennart Poettering	3a274a218d	execute: fix type of open_terminal_as() flags parameter It's the flags parameter we propagate here, not the mode parameter, hence let's name it properly, and use the right type.	2017-11-17 11:13:44 +01:00
Lennart Poettering	08f3be7a38	core: add two new unit file settings: StandardInputData= + StandardInputText= Both permit configuring data to pass through STDIN to an invoked process. StandardInputText= accepts a line of text (possibly with embedded C-style escapes as well as unit specifiers), which is appended to the buffer to pass as stdin, followed by a single newline. StandardInputData= is similar, but accepts arbitrary base64 encoded data, and will not resolve specifiers or C-style escapes, nor append newlines. This may be used to pass input/configuration data to services, directly in-line from unit files, either in a cooked or in a more raw format.	2017-11-17 11:13:44 +01:00
Lennart Poettering	1fb0682e78	execute: check whether we are actually on a TTY before doing TIOCSCTTY Given that Linux assigns the same ioctl numbers ot multiple subsystems, we should be careful when invoking ioctls, so that we don't end up calling something we wouldn't want to call.	2017-11-17 11:13:44 +01:00
Lennart Poettering	046a82c1b2	fd-util: add new helper move_fd() and make use of it We are using the same pattern at various places: call dup2() on an fd, and close the old fd, usually in combination with some O_CLOEXEC fiddling. Let's add a little helper for this, and port a few obvious cases over.	2017-11-17 11:13:44 +01:00
Lennart Poettering	d3070fbdf6	core: implement /run/systemd/units/-based path for passing unit info from PID 1 to journald And let's make use of it to implement two new unit settings with it: 1. LogLevelMax= is a new per-unit setting that may be used to configure log priority filtering: set it to LogLevelMax=notice and only messages of level "notice" and lower (i.e. more important) will be processed, all others are dropped. 2. LogExtraFields= is a new per-unit setting for configuring per-unit journal fields, that are implicitly included in every log record generated by the unit's processes. It takes field/value pairs in the form of FOO=BAR. Also, related to this, one exisiting unit setting is ported to this new facility: 3. The invocation ID is now pulled from /run/systemd/units/ instead of cgroupfs xattrs. This substantially relaxes requirements of systemd on the kernel version and the privileges it runs with (specifically, cgroupfs xattrs are not available in containers, since they are stored in kernel memory, and hence are unsafe to permit to lesser privileged code). /run/systemd/units/ is a new directory, which contains a number of files and symlinks encoding the above information. PID 1 creates and manages these files, and journald reads them from there. Note that this is supposed to be a direct path between PID 1 and the journal only, due to the special runtime environment the journal runs in. Normally, today we shouldn't introduce new interfaces that (mis-)use a file system as IPC framework, and instead just an IPC system, but this is very hard to do between the journal and PID 1, as long as the IPC system is a subject PID 1 manages, and itself a client to the journal. This patch cleans up a couple of types used in journal code: specifically we switch to size_t for a couple of memory-sizing values, as size_t is the right choice for everything that is memory. Fixes: #4089 Fixes: #3041 Fixes: #4441	2017-11-16 12:40:17 +01:00
Yu Watanabe	3df90f24cc	core: allow to specify errno number in SystemCallErrorNumber=	2017-11-11 21:54:24 +09:00
Yu Watanabe	8cfa775f4f	core: add support to specify errno in SystemCallFilter= This makes each system call in SystemCallFilter= blacklist optionally takes errno name or number after a colon. The errno takes precedence over the one given by SystemCallErrorNumber=. C.f. #7173. Closes #7169.	2017-11-11 21:54:12 +09:00
Yu Watanabe	8092a48cc1	core/execute: do not create RuntimeDirectory= under private/ sub-directory RuntimeDirectory= often used for sharing files or sockets with other services. So, if creating them under private/ sub-directory, we cannot set DynamicUser= to service units which want to share something through RuntimeDirectory=. This makes the directories given by RuntimeDirectory= are created under /run/ even if DynamicUser= is set. Fixes #7260.	2017-11-08 15:50:58 +09:00
Yu Watanabe	652bb2637a	core/execute: RuntimeDirectory= or friends requires mount namespace Since #6940, RuntimeDirectory= or their friends imply BindPaths=. So, if at least one of them are set, mount namespace is required.	2017-11-08 15:48:51 +09:00
Yu Watanabe	7bcef4efe6	core: remove compile_read_write_paths() From `6c47cd7d3b`, RuntimeDirectory= and their friends also imply BindPaths=. Thus, implying ReadWritePaths= is meaningless.	2017-11-08 15:07:22 +09:00
Zbigniew Jędrzejewski-Szmek	895265ad7d	Merge pull request #7059 from yuwata/dynamic-user-7013 dynamic-user: permit the case static uid and gid are different	2017-10-18 08:37:12 +02:00
Yu Watanabe	e2b0cc3415	core: fix invalid error message The error message corresponds to EILSEQ is "Invalid or incomplete multibyte or wide character", and is not suitable in this case. So, let's show a custom error message when the function dynamic_creds_realize() returns -EILSEQ.	2017-10-18 08:57:59 +09:00
Yu Watanabe	709dbeac14	core: cleanup for enforce_groups() (#7064 ) SupplementaryGroups= is preprocessed in get_supplementary_groups(). So, it is not necessary to input ExecContext to enforce_groups().	2017-10-12 08:10:25 +02:00
Yu Watanabe	a8cabc612b	core: fix segfault in compile_bind_mounts() when BindPaths= or BindReadOnlyPaths= is set This fixes a bug introduced by `6c47cd7d3b`. Fixes #7055.	2017-10-11 12:28:22 +09:00
Zbigniew Jędrzejewski-Szmek	7081228acd	Merge pull request #7045 from poettering/namespace-casing some super-trivial fixes to namespace.c	2017-10-10 21:50:17 +02:00
Lennart Poettering	b74023db06	Merge pull request #7003 from yuwata/enable-dynamic-user timesyncd, journal-upload: Enable DynamicUser=	2017-10-10 10:05:43 +02:00
Lennart Poettering	bb0ff3fb1b	namespace: change NameSpace → Namespace We generally use the casing "Namespace" for the word, and that's visible in a number of user-facing interfaces, including "RestrictNamespace=" or "JoinsNamespaceOf=". Let's make sure to use the same casing internally too. As discussed in #7024	2017-10-10 09:51:58 +02:00
Michal Sekletar	6e2d7c4f13	namespace: fall back gracefully when kernel doesn't support network namespaces (#7024 )	2017-10-10 09:46:13 +02:00
Yu Watanabe	c31ad02403	mkdir: introduce follow_symlink flag to mkdir_safe{,_label}()	2017-10-06 16:03:33 +09:00
Lennart Poettering	4aa1d31c89	Merge pull request #6974 from keszybz/clean-up-defines Clean up define definitions	2017-10-04 19:25:30 +02:00
Lennart Poettering	5ad90fe376	Merge pull request #6985 from yuwata/empty load-fragment: do not create empty array	2017-10-04 17:54:35 +02:00
Yu Watanabe	4c70109600	tree-wide: use IN_SET macro (#6977 )	2017-10-04 16:01:32 +02:00
Zbigniew Jędrzejewski-Szmek	f9fa32f09c	build-sys: s/HAVE_SMACK/ENABLE_SMACK/ Same justification as for HAVE_UTMP.	2017-10-04 12:09:50 +02:00
Zbigniew Jędrzejewski-Szmek	349cc4a507	build-sys: use #if Y instead of #ifdef Y everywhere The advantage is that is the name is mispellt, cpp will warn us. $ git grep -Ee "conf.set$'(HAVE\|ENABLE)_" -l\|xargs sed -r -i "s/conf.set\('(HAVE\|ENABLE)_/conf.set10('\1_/" $ git grep -Ee '#ifn?def (HAVE\|ENABLE)' -l\|xargs sed -r -i 's/#ifdef (HAVE\|ENABLE)/#if \1/; s/#ifndef (HAVE\|ENABLE)/#if ! \1/;' $ git grep -Ee 'if.defined\(HAVE' -l\|xargs sed -i -r 's/defined\((HAVE_[A-Z0-9_])$/\1/g' $ git grep -Ee 'if.defined$ENABLE' -l\|xargs sed -i -r 's/defined\((ENABLE_[A-Z0-9_])$/\1/g' + manual changes to meson.build squash! build-sys: use #if Y instead of #ifdef Y everywhere v2: - fix incorrect setting of HAVE_LIBIDN2	2017-10-04 12:09:29 +02:00
Zbigniew Jędrzejewski-Szmek	ac6e8be66e	core: use strv_isempty to check if supplementary_groups is empty With the previous commit, we know that it will be NULL if empty, but it's safe to always use strv_isempty() in case the code changes in the future.	2017-10-04 11:33:30 +02:00
Lennart Poettering	da50b85af7	core: when looking for a UID to use for a dynamic UID start with the current owner of the StateDirectory= and friends Let's optimize dynamic UID allocation a bit: if a StateDirectory= (or suchlike) is configured, we start our allocation loop from that UID and use it if it currently isn't used otherwise. This is beneficial as it saves us from having to expensively recursively chown() these directories in the typical case (which StateDirectory= does when it notices that the owner of the directory doesn't match the UID picked). With this in place we now have the a three-phase logic for allocating a dynamic UID: a) first, we try to use the owning UID of StateDirectory=, CacheDirectory=, LogDirectory= if that exists and is currently otherwise unused. b) if that didn't work out, we hash the UID from the service name c) if that didn't yield an unused UID either, randomly pick new ones until we find a free one.	2017-10-02 17:41:44 +02:00
Lennart Poettering	6c47cd7d3b	execute: make StateDirectory= and friends compatible with DynamicUser=1 and RootDirectory=/RootImage= Let's clean up the interaction of StateDirectory= (and friends) to DynamicUser=1: instead of creating these directories directly below /var/lib, place them in /var/lib/private instead if DynamicUser=1 is set, making that directory 0700 and owned by root:root. This way, if a dynamic UID is later reused, access to the old run's state directory is prohibited for that user. Then, use file system namespacing inside the service to make /var/lib/private a readable tmpfs, hiding all state directories that are not listed in StateDirectory=, and making access to the actual state directory possible. Mount all directories listed in StateDirectory= to the same places inside the service (which means they'll now be mounted into the tmpfs instance). Finally, add a symlink from the state directory name in /var/lib/ to the one in /var/lib/private, so that both the host and the service can access the path under the same location. Here's an example: let's say a service runs with StateDirectory=foo. When DynamicUser=0 is set, it will get the following setup, and no difference between what the unit and what the host sees: /var/lib/foo (created as directory) Now, if DynamicUser=1 is set, we'll instead get this on the host: /var/lib/private (created as directory with mode 0700, root:root) /var/lib/private/foo (created as directory) /var/lib/foo → private/foo (created as symlink) And from inside the unit: /var/lib/private (a tmpfs mount with mode 0755, root:root) /var/lib/private/foo (bind mounted from the host) /var/lib/foo → private/foo (the same symlink as above) This takes inspiration from how container trees are protected below /var/lib/machines: they generally reuse UIDs/GIDs of the host, but because /var/lib/machines itself is set to 0700 host users cannot access files in the container tree even if the UIDs/GIDs are reused. However, for this commit we add one further trick: inside and outside of the unit /var/lib/private is a different thing: outside it is a plain, inaccessible directory, and inside it is a world-readable tmpfs mount with only the whitelisted subdirs below it, bind mounte din. This means, from the outside the dir acts as an access barrier, but from the inside it does not. And the symlink created in /var/lib/foo itself points across the barrier in both cases, so that root and the unit's user always have access to these dirs without knowing the details of this mounting magic. This logic resolves a major shortcoming of DynamicUser=1 units: previously they couldn't safely store persistant data. With this change they can have their own private state, log and data directories, which they can write to, but which are protected from UID recycling. With this change, if RootDirectory= or RootImage= are used it is ensured that the specified state/log/cache directories are always mounted in from the host. This change of semantics I think is much preferable since this means the root directory/image logic can be used easily for read-only resource bundling (as all writable data resides outside of the image). Note that this is a change of behaviour, but given that we haven't released any systemd version with StateDirectory= and friends implemented this should be a safe change to make (in particular as previously it wasn't clear what would actually happen when used in combination). Moreover, by making this change we can later add a "+" modifier to these setings too working similar to the same modifier in ReadOnlyPaths= and friends, making specified paths relative to the container itself.	2017-10-02 17:41:44 +02:00
Lennart Poettering	72fd17682d	core: usually our enum's _INVALID and _MAX special values are named after the full type In most cases we followed the rule that the special _INVALID and _MAX values we use in our enums use the full type name as prefix (in contrast to regular values that we often make shorter), do so for ExecDirectoryType as well. No functional changes, just a little bit of renaming to make this code more like the rest.	2017-10-02 17:41:43 +02:00
Lennart Poettering	a1164ae380	core: chown() StateDirectory= and friends recursively when starting a service This is particularly useful when used in conjunction with DynamicUser=1, where the UID might change for every invocation, but is useful in other cases too, for example, when these directories are shared between systems where the UID assignments differ slightly.	2017-10-02 17:41:43 +02:00
Lennart Poettering	40a80078d2	execute: let's close glibc syslog channels too Just in case something opened them, let's make sure glibc invalidates them too. Thankfully so far no library opened log channels behind our back, at least as far as I know, hence this is actually a NOP, but let's better be safe than sorry.	2017-09-26 17:52:25 +02:00
Lennart Poettering	12145637e9	execute: normalize logging in execute.c Now that logging can implicitly reopen the log streams when needed we can log errors without any special magic, hence let's normalize things, and log the same way we do everywhere else.	2017-09-26 17:51:22 +02:00
Lennart Poettering	86ffb32560	execute: drop explicit log_open()/log_close() now that it is unnecessary	2017-09-26 17:46:34 +02:00
Lennart Poettering	2c027c62dd	execute: make use of the new logging mode in execute.c	2017-09-26 17:46:34 +02:00
Lennart Poettering	82677ae4c7	execute: downgrade a log message ERR → WARNING, since we proceed ignoring its result	2017-09-26 17:46:33 +02:00
Lennart Poettering	8002fb9747	execute: rework logging in setup_keyring() to include unit info Let's use log_unit_error() instead of log_error() everywhere (and friends).	2017-09-26 17:46:33 +02:00
Lennart Poettering	e6a7ec4b8e	io-util: add new IOVEC_INIT/IOVEC_MAKE macros This adds IOVEC_INIT() and IOVEC_MAKE() for initializing iovec structures from a pointer and a size. On top of these IOVEC_INIT_STRING() and IOVEC_MAKE_STRING() are added which take a string and automatically determine the size of the string using strlen(). This patch removes the old IOVEC_SET_STRING() macro, given that IOVEC_MAKE_STRING() is now useful for similar purposes. Note that the old IOVEC_SET_STRING() invocations were two characters shorter than the new ones using IOVEC_MAKE_STRING(), but I think the new syntax is more readable and more generic as it simply resolves to a C99 literal structure initialization. Moreover, we can use very similar syntax now for initializing strings and pointer+size iovec entries. We canalso use the new macros to initialize function parameters on-the-fly or array definitions. And given that we shouldn't have so many ways to do the same stuff, let's just settle on the new macros. (This also converts some code to use _cleanup_ where dynamically allocated strings were using IOVEC_SET_STRING() before, to modernize things a bit)	2017-09-22 15:28:04 +02:00
Lennart Poettering	f1c50becda	core: make sure to log invocation ID of units also when doing structured logging	2017-09-22 15:24:55 +02:00
Jan Synacek	f679ed6116	execute: fix typo in error message (#6881 )	2017-09-21 10:38:52 +02:00
Zbigniew Jędrzejewski-Szmek	61ceaea5c8	Move one space from dbus-execute.c to execute.c The number of spaces is conserved ;)	2017-09-16 08:45:02 +02:00
Zbigniew Jędrzejewski-Szmek	3d7d3cbbda	Merge pull request #6832 from poettering/keyring-mode Add KeyringMode unit property to fix cryptsetup key caching	2017-09-15 21:24:48 +02:00
Lennart Poettering	b1edf4456e	core: add new per-unit setting KeyringMode= for controlling kernel keyring setup Usually, it's a good thing that we isolate the kernel session keyring for the various services and disconnect them from the user keyring. However, in case of the cryptsetup key caching we actually want that multiple instances of the cryptsetup service can share the keys in the root user's user keyring, hence we need to be able to disable this logic for them. This adds KeyringMode=inherit\|private\|shared: inherit: don't do any keyring magic (this is the default in systemd --user) private: a private keyring as before (default in systemd --system) shared: the new setting	2017-09-15 16:53:35 +02:00
Lennart Poettering	0460aa5c08	execute: improve and augment execution log messages Let's generate friendly messages for more cases, and make slight adjustments to the existing messages.	2017-09-15 16:43:06 +02:00
Lennart Poettering	ab2116b140	core: make sure that $JOURNAL_STREAM prefers stderr over stdout information (#6824 ) If two separate log streams are connected to stdout and stderr, let's make sure $JOURNAL_STREAM points to the latter, as that's the preferred log destination, and the environment variable has been created in order to permit services to automatically upgrade from stderr based logging to native journal logging. Also, document this behaviour. Fixes: #6800	2017-09-15 08:26:38 +02:00
Lennart Poettering	00819cc151	core: add new UnsetEnvironment= setting for unit files With this setting we can explicitly unset specific variables for processes of a unit, as last step of assembling the environment block for them. This is useful to fix #6407. While we are at it, greatly expand the documentation on how the environment block for forked off processes is assembled.	2017-09-14 15:17:40 +02:00
Lennart Poettering	21022b9dde	util-lib: wrap personality() to fix up broken glibc error handling (#6766 ) glibc appears to propagate different errors in different ways, let's fix this up, so that our own code doesn't get confused by this. See #6752 + #6737 for details. Fixes: #6755	2017-09-08 17:16:29 +03:00
Lennart Poettering	aac8c0c382	execute: minor ExecOutput handling beautification (#6711 ) Let's clean up the checking for the various ExecOutput values a bit, let's use IN_SET everywhere, and the same concepts for all three bools we pass to dprintf().	2017-09-01 09:04:27 +09:00
Lennart Poettering	e8132d63fe	seccomp: default to something resembling the current personality when locking it Let's lock the personality to the currently set one, if nothing is specifically specified. But do so with a grain of salt, and never default to any exotic personality here, but only PER_LINUX or PER_LINUX32.	2017-08-29 15:56:57 +02:00

1 2 3 4 5 ...

434 commits