Systemd

Author	SHA1	Message	Date
Yu Watanabe	5029912157	network,udev: use uint64_t for bit rate Fixes #14620.	2020-01-21 16:51:19 +01:00
Lennart Poettering	88414eed6f	core: never allow perpetual units to be masked Fixes: #14550	2020-01-17 15:02:15 +01:00
Lennart Poettering	04d8507f68	Merge pull request #14381 from keszybz/ifindex-cleanup Resolve alternative names	2020-01-13 17:57:59 +01:00
Zbigniew Jędrzejewski-Szmek	5c3fa98db6	util-lib: move things that parse ifnames to shared/ In subsequent commits, calls to if_nametoindex() will be replaced by a wrapper that falls back to alternative name resolution over netlink. netlink support requires libsystemd (for sd-netlink), and we don't want to add any functions that require netlink in basic/. So stuff that calls if_nametoindex() for user supplied interface names, and everything that depends on that, needs to be moved.	2020-01-11 12:07:28 +01:00
Lennart Poettering	eb34a981d6	core: initialize priority_set when parsing swap unit files Fixes: #14524	2020-01-09 17:08:31 +01:00
Zbigniew Jędrzejewski-Szmek	a61d68748a	pid1: fix setting of DefaultTimeoutAbortSec This partially reverts `a07a7324ad`. We have two pieces of information: the value and a boolean. config_parse_timeout_abort() added in the reverted commit would write the boolean to the usec_t value, making a mess. The code is reworked to have just one implementation and two wrappers which pass two pointers.	2019-11-27 13:56:28 +01:00
Zbigniew Jędrzejewski-Szmek	b9d9fbe411	shared/conf-parser: remove unnecessary whitespace skipping The conf-parser machinery already removed whitespace before and after "=", no need to repeat this step. The test is adjusted to pass. It was testing an code path that doesn't happen normally, no point in doing that.	2019-11-27 13:56:28 +01:00
Zbigniew Jędrzejewski-Szmek	0b8d307587	pid1: fix the names of AllowedCPUs= and AllowedMemoryNodes= The original PR was submitted with CPUSetCpus and CPUSetMems, which was later changed to AllowedCPUs and AllowedMemmoryNodes everywhere (including the parser used by systemd-run), but not in the parser for unit files. Since we already released -rc1, let's keep support for the old names. I think we can remove it in a release or two if anyone remembers to do that. Fixes #14126. Follow-up for `047f5d63d7`.	2019-11-25 14:02:14 +01:00
Zbigniew Jędrzejewski-Szmek	3a0f06c41a	core: make TasksMax a partially dynamic property TasksMax= and DefaultTasksMax= can be specified as percentages. We don't actually document of what the percentage is relative to, but the implementation uses the smallest of /proc/sys/kernel/pid_max, /proc/sys/kernel/threads-max, and /sys/fs/cgroup/pids.max (when present). When the value is a percentage, we immediately convert it to an absolute value. If the limit later changes (which can happen e.g. when systemd-sysctl runs), the absolute value becomes outdated. So let's store either the percentage or absolute value, whatever was specified, and only convert to an absolute value when the value is used. For example, when starting a unit, the absolute value will be calculated when the cgroup for the unit is created. Fixes #13419.	2019-11-14 18:41:54 +01:00
Yu Watanabe	e30e8b5073	tree-wide: drop stat.h or statfs.h when stat-util.h is included	2019-11-04 00:30:32 +09:00
Yu Watanabe	455fa9610c	tree-wide: drop string.h when string-util.h or friends are included	2019-11-04 00:30:32 +09:00
Yu Watanabe	f5947a5e92	tree-wide: drop missing.h	2019-10-31 17:57:03 +09:00
Zbigniew Jędrzejewski-Szmek	abc9fa1cf1	core/load-fragment: remove unnecessary intialization manager_load_unit() better set it on success, and unit_set_slice() asserts that the argument is not NULL, so initializing it to NULL is not useful.	2019-10-16 16:33:54 +02:00
Zbigniew Jędrzejewski-Szmek	47538b7686	core/load-fragment: constify Unit* arguments where possible This makes it easy to tell that the function only uses the Unit* for reporting, and only makes changes to the other argument (which most likely also points at the same Unit structure) for modifications.	2019-10-16 16:32:45 +02:00
Zbigniew Jędrzejewski-Szmek	86e94d95d0	Merge pull request #13246 from keszybz/add-SystemdOptions-efi-variable Add efi variable to augment /proc/cmdline	2019-10-03 12:19:44 +02:00
Pavel Hrdina	047f5d63d7	cgroup: introduce support for cgroup v2 CPUSET controller Introduce support for configuring cpus and mems for processes using cgroup v2 CPUSET controller. This allows users to limit which cpus and memory NUMA nodes can be used by processes to better utilize system resources. The cgroup v2 interfaces to control it are cpuset.cpus and cpuset.mems where the requested configuration is written. However, it doesn't mean that the requested configuration will be actually used as parent cgroup may limit the cpus or mems as well. In order to reflect the real configuration cgroup v2 provides read-only files cpuset.cpus.effective and cpuset.mems.effective which are exported to users as well.	2019-09-24 15:16:07 +02:00
Maciej Stanczew	6327aa9f6c	core: Fix setting StatusUnitFormat from config files	2019-09-17 15:21:21 +09:00
Zbigniew Jędrzejewski-Szmek	fdb3decaa7	util-lib: move some functions from basic/cgroup-util to shared/cgroup-setup This way less stuff needs to be in basic. Initially, I wanted to move all the parts of cgroup-utils.[ch] that depend on efivars.[ch] to shared, because efivars.[ch] is in shared/. Later on, I decide to split efivars.[ch], so the move done in this patch is not necessary anymore. Nevertheless, it is still valid on its own. If at some point we want to expose libbasic, it is better to to not have stuff that belong in libshared there.	2019-09-16 18:08:00 +02:00
Franck Bui	5a1c1b534f	core: restore initialization of u->source_mtime During the rework of unit file loading, commit `e8630e6952` dropped the initialization u->source_mtime. This had the bad side effect that generated units always needed daemon reloading.	2019-09-16 15:53:52 +02:00
Yu Watanabe	4b259b3c63	Merge pull request #13244 from keszybz/allow-dots-in-usernames Allow dots in usernames	2019-08-29 00:03:19 +09:00
Lennart Poettering	4a8daee72f	load-fragment: use path_join() where appropriate	2019-08-20 17:32:34 +02:00
Zbigniew Jędrzejewski-Szmek	ae480f0b09	shared/user-util: allow usernames with dots in specific fields People do have usernames with dots, and it makes them very unhappy that systemd doesn't like their that. It seems that there is no actual problem with allowing dots in the username. In particular chown declares ":" as the official separator, and internally in systemd we never rely on "." as the seperator between user and group (nor do we call chown directly). Using dots in the name is probably not a very good idea, but we don't need to care. Debian tools (adduser) do not allow users with dots to be created. This patch allows existing names with dots to be used in User, Group, SupplementaryGroups, SocketUser, SocketGroup fields, both in unit files and on the command line. DynamicUsers and sysusers still follow the strict policy. user@.service and tmpfiles already allowed arbitrary user names, and this remains unchanged. Fixes #12754.	2019-08-19 21:19:13 +02:00
Lennart Poettering	5756bff6f1	Merge pull request #13119 from keszybz/unit-loading-2 Rework unit loading to take into account all aliases	2019-07-30 17:55:37 +02:00
Zbigniew Jędrzejewski-Szmek	91e0ee5f16	pid1: drop unit caches only based on mtime v2: - do not watch mtime of transient and generated dirs We'd reload the map after every transient unit we created, which we don't need to do, since we create those units ourselves and know their fragment path.	2019-07-30 14:01:46 +02:00
Zbigniew Jędrzejewski-Szmek	e8630e6952	pid1: use a cache for all unit aliases This reworks how we load units from disk. Instead of chasing symlinks every time we are asked to load a unit by name, we slurp all symlinks from disk and build two hashmaps: 1. from unit name to either alias target, or fragment on disk (if an alias, we put just the target name in the hashmap, if a fragment we put an absolute path, so we can distinguish both). 2. from a unit name to all aliases Reading all this data can be pretty costly (40 ms) on my machine, so we keep it around for reuse. The advantage is that we can reliably know what all the aliases of a given unit are. This means we can reliably load dropins under all names. This fixes #11972.	2019-07-30 14:01:46 +02:00
Zbigniew Jędrzejewski-Szmek	2e2ed88062	pid1,systemctl: allow symbolic exit code names	2019-07-29 15:54:53 +02:00
Zbigniew Jędrzejewski-Szmek	23d5dd1687	shared/exit-status: use Bitmap instead of Sets I opted to embed the Bitmap structure directly in the ExitStatusSet. This means that memory usage is a bit higher for units which don't define this setting: Service changes: /* size: 2720, cachelines: 43, members: 73 / / sum members: 2680, holes: 9, sum holes: 39 / / sum bitfield members: 7 bits, bit holes: 1, sum bit holes: 1 bits / / last cacheline: 32 bytes / / size: 2816, cachelines: 44, members: 73 / / sum members: 2776, holes: 9, sum holes: 39 / / sum bitfield members: 7 bits, bit holes: 1, sum bit holes: 1 bits */ But this way the code is simpler and we do less pointer chasing.	2019-07-29 15:54:53 +02:00
Zbigniew Jędrzejewski-Szmek	4ec8514142	Rename EXTRACT_QUOTES to EXTRACT_UNQUOTE Whenever I see EXTRACT_QUOTES, I'm always confused whether it means to leave the quotes in or to take them out. Let's say "unquote", like we say "cunescape".	2019-06-28 11:35:05 +02:00
Zbigniew Jędrzejewski-Szmek	cae90de3d3	Reindent some things for readability	2019-06-28 11:19:24 +02:00
Zbigniew Jędrzejewski-Szmek	9266f31e61	core: skip whitespace after "\|" and "!" in the condition parser We'd skip any whitespace immediately after "=", but then we'd treat whitespace that is between "\|" or "!" and the value as significant. This is rather confusing, let's ignore it too.	2019-06-27 10:54:37 +02:00
Frantisek Sumsal	a07a7324ad	core: move config_parse_* functions to a shared module Apart from making the code a little bit more clean, it should allow us to write a fuzzer around the config-parsing functions in the future	2019-06-25 22:35:02 +09:00
Kai Lüke	fab347489f	bpf-firewall: custom BPF programs through IP(Ingress\|Egress)FilterPath= Takes a single /sys/fs/bpf/pinned_prog string as argument, but may be specified multiple times. An empty assignment resets all previous filters. Closes https://github.com/systemd/systemd/issues/10227	2019-06-25 09:56:16 +02:00
Michal Sekletar	b070c7c0e1	core: introduce NUMAPolicy and NUMAMask options Make possible to set NUMA allocation policy for manager. Manager's policy is by default inherited to all forked off processes. However, it is possible to override the policy on per-service basis. Currently we support, these policies: default, prefer, bind, interleave, local. See man 2 set_mempolicy for details on each policy. Overall NUMA policy actually consists of two parts. Policy itself and bitmask representing NUMA nodes where is policy effective. Node mask can be specified using related option, NUMAMask. Default mask can be overwritten on per-service level.	2019-06-24 16:58:54 +02:00
Yu Watanabe	657ee2d82b	tree-wide: replace strjoin() with path_join()	2019-06-21 03:26:16 +09:00
Zbigniew Jędrzejewski-Szmek	0985c7c4e2	Rework cpu affinity parsing The CPU_SET_S api is pretty bad. In particular, it has a parameter for the size of the array, but operations which take two (CPU_EQUAL_S) or even three arrays (CPU_{AND,OR,XOR}_S) still take just one size. This means that all arrays must be of the same size, or buffer overruns will occur. This is exactly what our code would do, if it received an array of unexpected size over the network. ("Unexpected" here means anything different from what cpu_set_malloc() detects as the "right" size.) Let's rework this, and store the size in bytes of the allocated storage area. The code will now parse any number up to 8191, independently of what the current kernel supports. This matches the kernel maximum setting for any architecture, to make things more portable. Fixes #12605.	2019-05-29 10:20:42 +02:00
Chris Down	22bf131be2	cgroup: Support 0-value for memory protection directives These make sense to be explicitly set at 0 (which has a different effect than the default, since it can affect processing of `DefaultMemoryXXX`). Without this, it's not easily possible to relinquish memory protection for a subtree, which is not great.	2019-05-08 12:06:32 +01:00
Chris Down	7ad5439e06	unit: Add DefaultMemoryMin	2019-04-16 18:45:04 +01:00
Jan Klötzke	dc653bf487	service: handle abort stops with dedicated timeout When shooting down a service with SIGABRT the user might want to have a much longer stop timeout than on regular stops/shutdowns. Especially in the face of short stop timeouts the time might not be sufficient to write huge core dumps before the service is killed. This commit adds a dedicated (Default)TimeoutAbortSec= timer that is used when stopping a service via SIGABRT. In all other cases the existing TimeoutStopSec= is used. The timer value is unset by default to skip the special handling and use TimeoutStopSec= for state 'stop-watchdog' to keep the old behaviour. If the service is in state 'stop-watchdog' and the service should be stopped explicitly we still go to 'stop-sigterm' and re-apply the usual TimeoutStopSec= timeout.	2019-04-12 17:32:52 +02:00
Chris Down	c52db42b78	cgroup: Implement default propagation of MemoryLow with DefaultMemoryLow In cgroup v2 we have protection tunables -- currently MemoryLow and MemoryMin (there will be more in future for other resources, too). The design of these protection tunables requires not only intermediate cgroups to propagate protections, but also the units at the leaf of that resource's operation to accept it (by setting MemoryLow or MemoryMin). This makes sense from an low-level API design perspective, but it's a good idea to also have a higher-level abstraction that can, by default, propagate these resources to children recursively. In this patch, this happens by having descendants set memory.low to N if their ancestor has DefaultMemoryLow=N -- assuming they don't set a separate MemoryLow value. Any affected unit can opt out of this propagation by manually setting `MemoryLow` to some value in its unit configuration. A unit can also stop further propagation by setting `DefaultMemoryLow=` with no argument. This removes further propagation in the subtree, but has no effect on the unit itself (for that, use `MemoryLow=0`). Our use case in production is simplifying the configuration of machines which heavily rely on memory protection tunables, but currently require tweaking a huge number of unit files to make that a reality. This directive makes that significantly less fragile, and decreases the risk of misconfiguration. After this patch is merged, I will implement DefaultMemoryMin= using the same principles.	2019-04-12 17:23:58 +02:00
Lennart Poettering	afcfaa695c	core: implement OOMPolicy= and watch cgroups for OOM killings This adds a new per-service OOMPolicy= (along with a global DefaultOOMPolicy=) that controls what to do if a process of the service is killed by the kernel's OOM killer. It has three different values: "continue" (old behaviour), "stop" (terminate the service), "kill" (let the kernel kill all the service's processes). On top of that, track OOM killer events per unit: generate a per-unit structured, recognizable log message when we see an OOM killer event, and put the service in a failure state if an OOM killer event was seen and the selected policy was not "continue". A new "result" is defined for this case: "oom-kill". All of this relies on new cgroupv2 kernel functionality: the "memory.events" notification interface and the "memory.oom.group" attribute (which makes the kernel kill all cgroup processes automatically).	2019-04-09 11:17:58 +02:00
Zbigniew Jędrzejewski-Szmek	58f6ab4454	pid1: pass unit name to seccomp parser when we have no file location Building on previous commit, let's pass the unit name when parsing dbus message or builtin whitelist, which is better than nothing. seccomp_parse_syscall_filter() is not needed anymore, so it is removed, and seccomp_parse_syscall_filter_full() is renamed to take its place.	2019-04-03 09:17:42 +02:00
Lennart Poettering	dc44c96d97	core: pass parse error to log functions when parsing timer expressions	2019-04-01 18:25:43 +02:00
Lennart Poettering	25a04ae55e	core: simply timer expression parsing by using ".ltype" field of conf-parser logic No change of behaviour. Let's just not parse the lvalue all the time with timer_base_from_string() if we can already pass it in parsed.	2019-04-01 18:25:43 +02:00
Zbigniew Jędrzejewski-Szmek	983616735e	Merge pull request #12137 from poettering/socket-var-run warn about sockets in /var/run/ too	2019-03-29 15:00:25 +01:00
Lennart Poettering	4a66b5c9bf	core: complain and correct /var/run/ → /run/ for listening sockets We already do that for PIDFile= paths, and for tmpfiles.d/ snippets, let's also do this for .socket paths.	2019-03-28 16:59:57 +01:00
Lennart Poettering	7d2c9c6b50	load-fragment: use TAKE_PTR() where we can	2019-03-28 16:46:27 +01:00
Lennart Poettering	acd142af79	core: break overly long line	2019-03-28 12:09:38 +01:00
Lennart Poettering	2f6b9110fc	core: parse '@default' seccomp group permissively We are about to add system calls (rseq()) not available on old libseccomp/old kernels, and hence we need to be permissive when parsing our definitions.	2019-03-28 12:09:38 +01:00
Lennart Poettering	d8b4d14df4	util: split out nulstr related stuff to nulstr-util.[ch]	2019-03-14 13:25:52 +01:00
Lennart Poettering	eefc66aa8f	util: split out some stuff into a new file limits-util.[ch]	2019-03-13 12:16:43 +01:00

1 2 3 4 5 ...

507 commits