Systemd

Author	SHA1	Message	Date
Zbigniew Jędrzejewski-Szmek	90e74a66e6	tree-wide: define iterator inside of the macro	2020-09-08 12:14:05 +02:00
Zbigniew Jędrzejewski-Szmek	771b52427a	core/job: adjust whitespace and comment	2020-07-22 17:58:12 +02:00
Dave Reisner	cc479760b4	Revert "job: Don't mark as redundant if deps are relevant" This reverts commit `097537f07a`. At least Fedora and Debian have already reverted this at the distro level because it causes more problems than it solves. Arch is debating reverting it as well [0] but would strongly prefer that this happens upstream first. Fixes #15188. [0] https://bugs.archlinux.org/task/66458	2020-06-23 11:42:45 +02:00
Zbigniew Jędrzejewski-Szmek	a4ac27c1af	manager: free the jobs hashmap after we have no jobs After a larger transaction, e.g. after bootup, we're left with an empty hashmap with hundreds of buckets. Long-term, it'd be better to size hashmaps down when they are less than 1/4 full, but even if we implement that, jobs hashmap is likely to be empty almost always, so it seems useful to deallocate it once the jobs count reaches 0.	2020-05-28 18:54:20 +02:00
Luca Boccassi	c03fbd37d6	core: add debug log when a job in the activation queue is not runnable When a job is skipped due its dependencies not being ready, log a debug message saying what is holding it back. This was very useful with transient units timing out to figure out where the problem was.	2020-04-22 09:58:12 +01:00
Zbigniew Jędrzejewski-Szmek	162392b75a	tree-wide: spellcheck using codespell Fixes #15436.	2020-04-16 18:00:40 +02:00
Zbigniew Jędrzejewski-Szmek	eda0cbf071	Use Finished instead of Started for Type=oneshot services (#14851 ) UnitStatusMessageFormats.finished_job, if present, will be called with the same arguments as job_get_done_status_message_format() to provide a format string appropriate for the context This commit replaces "Started" with "Finished" for started oneshot units, as mentioned in the referenced issue Closes #2458.	2020-03-05 17:24:19 +01:00
Zbigniew Jędrzejewski-Szmek	5bcf34ebf3	pid1: when showing error status, do not switch to status=temporary We would flip to status=temporary mode on the first error, and then switch back to status=auto after the initial transaction was done. This isn't very useful, because usually all the messages about successfully started units and not related to the original failure. In fact, all those messages most likely cause the information about the prime error to scroll off screen. And if the user requested quiet boot, there's no reason to think that they care about those success messages. Also, when logging about dependency cycles, treat this similarly to a unit error and show the message even if the status is "soft disabled" (before we wouldn't show it in that case).	2020-03-01 11:42:42 +01:00
Zbigniew Jędrzejewski-Szmek	7365a29670	pid1: when printing status message status, give reason	2020-03-01 11:42:19 +01:00
Kevin Kuehler	097537f07a	job: Don't mark as redundant if deps are relevant In the steps given in #13850, the resulting graph looks like: C (Anchor) -> B -> A Since B is inactive, it will be flagged as redundant and removed from the transaction, causing A to get garbage collected. The proposed fix is to not mark nodes as redundant if doing so causes a relevant node to be garbage collected. Fixes #13850	2020-01-03 15:58:10 +01:00
Zbigniew Jędrzejewski-Szmek	754499fab2	Merge pull request #13904 from keur/job_mode_triggering Job mode triggering	2019-11-07 08:36:26 +01:00
Kevin Kuehler	1f0f9f21c1	core: Add triggering job mode When used with systemctl stop, follows TRIGGERED_BY dependencies and adds them to the same transaction. Fixes: #3043	2019-11-05 11:17:38 -08:00
HATAYAMA Daisuke	d1559793df	core, job: fix breakage of ordering dependencies by systemctl reload command Currently, systemctl reload command breaks ordering dependencies if it's executed when its target service unit is in activating state. For example, prepare A.service, B.service and C.target as follows: # systemctl cat A.service B.service C.target # /etc/systemd/system/A.service [Unit] Description=A [Service] Type=oneshot ExecStart=/usr/bin/echo A1 ExecStart=/usr/bin/sleep 60 ExecStart=/usr/bin/echo A2 ExecReload=/usr/bin/echo A reloaded RemainAfterExit=yes # /etc/systemd/system/B.service [Unit] Description=B After=A.service [Service] Type=oneshot ExecStart=/usr/bin/echo B RemainAfterExit=yes # /etc/systemd/system/C.target [Unit] Description=C Wants=A.service B.service Start them. # systemctl daemon-reload # systemctl start C.target Then, we have: # LANG=C journalctl --no-pager -u A.service -u B.service -u C.target -b -- Logs begin at Mon 2019-09-09 00:25:06 EDT, end at Thu 2019-10-24 22:28:47 EDT. -- Oct 24 22:27:47 localhost.localdomain systemd[1]: Starting A... Oct 24 22:27:47 localhost.localdomain systemd[1]: A.service: Child 967 belongs to A.service. Oct 24 22:27:47 localhost.localdomain systemd[1]: A.service: Main process exited, code=exited, status=0/SUCCESS Oct 24 22:27:47 localhost.localdomain systemd[1]: A.service: Running next main command for state start. Oct 24 22:27:47 localhost.localdomain systemd[1]: A.service: Passing 0 fds to service Oct 24 22:27:47 localhost.localdomain systemd[1]: A.service: About to execute: /usr/bin/sleep 60 Oct 24 22:27:47 localhost.localdomain systemd[1]: A.service: Forked /usr/bin/sleep as 968 Oct 24 22:27:47 localhost.localdomain systemd[968]: A.service: Executing: /usr/bin/sleep 60 Oct 24 22:27:52 localhost.localdomain systemd[1]: A.service: Trying to enqueue job A.service/reload/replace Oct 24 22:27:52 localhost.localdomain systemd[1]: A.service: Merged into running job, re-running: A.service/reload as 1288 Oct 24 22:27:52 localhost.localdomain systemd[1]: A.service: Enqueued job A.service/reload as 1288 Oct 24 22:27:52 localhost.localdomain systemd[1]: A.service: Unit cannot be reloaded because it is inactive. Oct 24 22:27:52 localhost.localdomain systemd[1]: A.service: Job 1288 A.service/reload finished, result=invalid Oct 24 22:27:52 localhost.localdomain systemd[1]: B.service: Passing 0 fds to service Oct 24 22:27:52 localhost.localdomain systemd[1]: B.service: About to execute: /usr/bin/echo B Oct 24 22:27:52 localhost.localdomain systemd[1]: B.service: Forked /usr/bin/echo as 970 Oct 24 22:27:52 localhost.localdomain systemd[970]: B.service: Executing: /usr/bin/echo B Oct 24 22:27:52 localhost.localdomain systemd[1]: B.service: Failed to send unit change signal for B.service: Connection reset by peer Oct 24 22:27:52 localhost.localdomain systemd[1]: B.service: Changed dead -> start Oct 24 22:27:52 localhost.localdomain systemd[1]: Starting B... Oct 24 22:27:52 localhost.localdomain echo[970]: B Oct 24 22:27:52 localhost.localdomain systemd[1]: B.service: Child 970 belongs to B.service. Oct 24 22:27:52 localhost.localdomain systemd[1]: B.service: Main process exited, code=exited, status=0/SUCCESS Oct 24 22:27:52 localhost.localdomain systemd[1]: B.service: Changed start -> exited Oct 24 22:27:52 localhost.localdomain systemd[1]: B.service: Job 1371 B.service/start finished, result=done Oct 24 22:27:52 localhost.localdomain systemd[1]: Started B. Oct 24 22:27:52 localhost.localdomain systemd[1]: C.target: Job 1287 C.target/start finished, result=done Oct 24 22:27:52 localhost.localdomain systemd[1]: Reached target C. Oct 24 22:27:52 localhost.localdomain systemd[1]: C.target: Failed to send unit change signal for C.target: Connection reset by peer Oct 24 22:28:47 localhost.localdomain systemd[1]: A.service: Child 968 belongs to A.service. Oct 24 22:28:47 localhost.localdomain systemd[1]: A.service: Main process exited, code=exited, status=0/SUCCESS Oct 24 22:28:47 localhost.localdomain systemd[1]: A.service: Running next main command for state start. Oct 24 22:28:47 localhost.localdomain systemd[1]: A.service: Passing 0 fds to service Oct 24 22:28:47 localhost.localdomain systemd[1]: A.service: About to execute: /usr/bin/echo A2 Oct 24 22:28:47 localhost.localdomain systemd[1]: A.service: Forked /usr/bin/echo as 972 Oct 24 22:28:47 localhost.localdomain systemd[972]: A.service: Executing: /usr/bin/echo A2 Oct 24 22:28:47 localhost.localdomain echo[972]: A2 Oct 24 22:28:47 localhost.localdomain systemd[1]: A.service: Child 972 belongs to A.service. Oct 24 22:28:47 localhost.localdomain systemd[1]: A.service: Main process exited, code=exited, status=0/SUCCESS Oct 24 22:28:47 localhost.localdomain systemd[1]: A.service: Changed start -> exited The issue occurs not only in reload command, i.e.: - reload - try-restart - reload-or-restart - reload-or-try-restart commands The cause of this issue is that job_type_collapse() doesn't take care of the activating state. Fixes: #10464	2019-11-04 16:45:23 +01:00
Lennart Poettering	735a8b6d38	job: fix coverity issue Fixes coverity issue 1403550	2019-07-31 09:45:03 +02:00
Michael Olbrich	da8e178296	job: make the run queue order deterministic Jobs are added to the run queue in random order. This happens because most jobs are added by iterating over the transaction or dependency hash maps. As a result, jobs that can be executed at the same time are started in a different order each time. On small embedded devices this can cause a measurable jitter for the point in time when a job starts (~100ms jitter for 10 units that are started in random order). This results is a similar jitter for the boot time. This is undesirable in general and make optimizing the boot time a lot harder. Also, jobs that should have a higher priority because the unit has a higher CPU weight might get executed later than others. Fix this by turning the job run_queue into a Prioq and sort by the following criteria (use the next if the values are equal): - CPU weight - nice level - unit type - unit name The last one is just there for deterministic sorting to avoid any jitter.	2019-07-18 10:28:39 +02:00
Anita Zhang	31cd5f63ce	core: ExecCondition= for services Closes #10596	2019-07-17 11:35:02 +02:00
Yu Watanabe	8cec0a5c32	tree-wide: drop duplicated blank lines ``` $ for i in /.[ch] //*.[ch]; do sed -e '/^$/ {N; s/\n$//g}' -i $i; done $ git checkout HEAD -- basic/linux shared/linux ```	2019-07-15 18:41:27 +02:00
Lennart Poettering	2e8e1a1ab6	Merge pull request #12461 from Werkov/fix-job-ordering Refactor job ordering implementation (and fix cycle detection)	2019-07-11 16:43:58 +02:00
Zbigniew Jędrzejewski-Szmek	2a8f53c67b	Use unit->id instead of description in messages v2: - rename unit_identifier to unit_status_string	2019-07-10 13:35:26 +02:00
Zbigniew Jędrzejewski-Szmek	1f65fd4926	basic/time-util: add helper function to check if timestamp is set No functional change.	2019-07-04 19:12:47 +02:00
Michal Koutný	e602f15282	core: Extract job ordering logic The job ordering logic is spread at multiple places of the code, making it hard to maintain and also a bit to understand. The actual execution order of two jobs always depends on their types and the ordering contraint between their units. Extract this logic to a new function job_compare. The second change is simplification of the order evaluation, JOB_STOP takes always precedence (as documented), unless two units are both stopping, then the ordering constraint is taken into account.	2019-06-26 00:00:43 +02:00
Lennart Poettering	760877e90c	util: split out sorting related calls to new sort-util.[ch]	2019-03-13 12:16:43 +01:00
Jonathon Kowalski	791cd15993	Fail RequisiteOf units with oneshots Fixes: #11422 Oneshots going to inactive directly without ever entering UNIT_ACTIVE is considered success. This however means that if something both Requires= and Requisites= a unit of such nature, the verify-active job getting merged into the start job makes it lose this property of failing the depending jobs, as there, the start job has the result JOB_DONE on success, so we never walk over RequisiteOf units. This change makes sure that such units always go down. It is also only meaningful with After=, but so is Requisite= itself. Also, we also catch cases like a oneshot having RemainAfterExit= true making us start up properly in such a setting, but then removing it, reloading the unit, and restarting it. In such a case, we go down due to restart propagation before them, and our start job waits on theirs, properly failing with the JOB_DEPENDENCY result. This covers cases where ConditionXYZ= creates a similar situation as well.	2019-02-15 13:42:54 +01:00
Alberts Muktupāvels	52c6c9eaec	core: when we uninstall a job, add unit to dbus queue Commit `e6d05912cb` added unit to dbus queue on job install. Do same on job uninstall to make sure we get PropertiesChanged signal.	2019-02-12 16:55:45 +01:00
Lennart Poettering	92e29d82e6	tree-wide: fix some trailing whitespace @bl33pbl0p, please fix your editor (Apparently you never configured the source tree? If you did, then the git pre-commit hook would have been enabled which doesn't allow commiting non-whitespace clean stuff...)	2019-01-17 20:06:28 +01:00
bl33pbl0p	28d78d0726	Log the job being merged Makes it easier to understand what was merged (and easier to realize why). Example is a start job running, and another unit triggering a verify-active job. It is not clear what job was it that from baz.service that merged into the installed job for bar.service in the debug logs. This makes it useful when debugging issues. Jan 15 11:45:58 jupiter systemd[1218]: baz.service: Trying to enqueue job baz.service/start/replace Jan 15 11:45:58 jupiter systemd[1218]: baz.service: Installed new job baz.service/start as 498 Jan 15 11:45:58 jupiter systemd[1218]: bar.service: Merged into installed job bar.service/start as 497 Jan 15 11:45:58 jupiter systemd[1218]: baz.service: Enqueued job baz.service/start as 498 It becomes: Jan 15 11:45:58 jupiter systemd[1218]: bar.service: Merged bar.service/verify-active into installed job bar.service/start as 497	2019-01-16 12:34:54 +01:00
Lennart Poettering	b17c9620c8	core: rework how we deserialize jobs Let's add a helper call unit_deserialize_job() for this purpose, and let's move registration in the global jobs hash table into job_install_deserialized() so that it it is done after all superficial checks are done, and before transitioning into installed states, so that rollback code is not necessary anymore.	2018-12-12 11:15:07 +01:00
Lennart Poettering	48235ad6b7	job: be more careful when removing job object from jobs hash table Let's validate that the ID is actually allocated to us before remove a job. This is relevant as various bits of code will call job_free() on partially set up Job objects, and we really shouldn't remove another job object accidentally from the hash table, when the set up didn't complete.	2018-12-12 11:15:07 +01:00
Lennart Poettering	4a53080be6	core: don't track jobs-finishing-during-reload explicitly Memory management is borked for this, and moreover this is unnecessary since `f0831ed2a0`, i.e. since coldplug() and catchup() are two different concepts: the former restoring the state from before a reload, the latter than adjusting it again to the actual status in effect after the reload. Fixes: #10716 Mostly reverts: #8803	2018-12-12 11:15:06 +01:00
Lennart Poettering	728ba51e98	job: update job_free() to follow our usual return-NULL style	2018-12-12 11:14:26 +01:00
Zbigniew Jędrzejewski-Szmek	2d479ff1cc	Merge pull request #10963 from poettering/bus-force-state-change-signal force PropertiesChanged bus signal on all unit state changes	2018-12-06 16:42:21 +01:00
Yu Watanabe	f2a3de0116	tree-wide: add whitespace between type and variable name	2018-12-04 09:29:54 +01:00
Lennart Poettering	e6d05912cb	core: when we install a job, announce this via the bus Whenever we enqueue a job, we should announce this on the bus, hence add both the job and the unit to the dbus queues. (Why both? The former should be obvious, the latter because we send out Job properties). In most cases adding these to the queue is not necessary, as other properties tend to change at the same time and result in a change being sent out. However, let's clean this up and make it explicit.	2018-12-01 12:53:26 +01:00
Lennart Poettering	7af67e9a8b	core: allow to set exit status when using SuccessAction=/FailureAction=exit in units This adds SuccessActionExitStatus= and FailureActionExitStatus= that may be used to configure the exit status to propagate in when SuccessAction=exit or FailureAction=exit is used. When not specified let's also propagate the exit status of the main process we fork off for the unit.	2018-11-27 09:44:40 +01:00
Zbigniew Jędrzejewski-Szmek	baaa35ad70	coccinelle: make use of SYNTHETIC_ERRNO Ideally, coccinelle would strip unnecessary braces too. But I do not see any option in coccinelle for this, so instead, I edited the patch text using search&replace to remove the braces. Unfortunately this is not fully automatic, in particular it didn't deal well with if-else-if-else blocks and ifdefs, so there is an increased likelikehood be some bugs in such spots. I also removed part of the patch that coccinelle generated for udev, where we returns -1 for failure. This should be fixed independently.	2018-11-22 10:54:38 +01:00
Lennart Poettering	ff30a86bd4	job: simplify status message extraction As @keszybz points out these fields are always here, there's no point in checking if they are NULL or not.	2018-11-16 15:30:36 +01:00
Lennart Poettering	9a80f2f453	job: when a job was skipped due to a failed condition, log about it Previously we'd neither show console status output nor log output. Let's fix that, and still log something.	2018-11-16 15:30:36 +01:00
Lennart Poettering	6e64994d69	core: make unit_start() return a distinguishable error code in case conditions didn't hold Ideally we'd even propagate this all the way to the client, by having a separate JobType enum value for this. But it's hard to add this without breaking compat, hence for now let's at least internally propagate this case differently from the case "already on it". This is then used to call job_finish_and_invalidate() slightly differently, with the already= parameter false, as in the failed condition case no message was likely produced so far.	2018-11-16 15:22:48 +01:00
Lennart Poettering	0e2b4a822e	job: add two explanatory comments	2018-11-16 15:22:48 +01:00
Lennart Poettering	a69b3872ac	job: let's remove one comparison and reduce indentation level by one	2018-11-16 15:22:48 +01:00
Lennart Poettering	b344b363ce	job: also include job ID in log messages when we begin with a job	2018-11-16 15:22:48 +01:00
Lennart Poettering	33a3fdd978	core: move unit_status_emit_starting_stopping_reloading() and related calls to job.c This call is only used by job.c and very specific to job handling. Moreover the very similar logic of job_emit_status_message() is already in job.c. Hence, let's clean this up, and move both sets of functions to job.c, and rename them a bit so that they express precisely what they do: 1. unit_status_emit_starting_stopping_reloading() → job_emit_begin_status_message() 2. job_emit_status_message() → job_emit_done_status_message() The first call is after all what we call when we begin with the execution of a job, and the second call what we call when we are done wiht it. Just some moving and renaming, not other changes, and hence no change in behaviour.	2018-11-16 15:22:48 +01:00
Lennart Poettering	f8c34706f5	job: add log message when we can't enable the job run event source	2018-11-16 15:22:48 +01:00
Lennart Poettering	8ebd9175db	job: add comment for EAGAIN job run case	2018-11-16 15:22:48 +01:00
Lennart Poettering	ea2c0e4526	job: minor coding style tweaks	2018-11-16 15:22:48 +01:00
Lennart Poettering	1cd81629e1	job: include JOB_ID field in log message about jobs	2018-11-16 15:22:48 +01:00
Lennart Poettering	d68c645bd3	core: rework serialization Let's be more careful with what we serialize: let's ensure we never serialize strings that are longer than LONG_LINE_MAX, so that we know we can read them back with read_line(…, LONG_LINE_MAX, …) safely. In order to implement this all serialization functions are move to serialize.[ch], and internally will do line size checks. We'd rather skip a serialization line (with a loud warning) than write an overly long line out. Of course, this is just a second level protection, after all the data we serialize shouldn't be this long in the first place. While we are at it also clean up logging: while serializing make sure to always log about errors immediately. Also, (void)ify all calls we don't expect errors in (or catch errors as part of the general fflush_and_check() at the end.	2018-10-26 10:52:41 +02:00
Lennart Poettering	8948b3415d	core: when deserializing state always use read_line(…, LONG_LINE_MAX, …) This should be much better than fgets(), as we can read substantially longer lines and overly long lines result in proper errors. Fixes a vulnerability discovered by Jann Horn at Google. CVE-2018-15686 LP: #1796402 https://bugzilla.redhat.com/show_bug.cgi?id=1639071	2018-10-26 10:40:01 +02:00
Lennart Poettering	108e8de655	Merge pull request #10439 from poettering/job-struct-init three trivial simplifications/clean-ups	2018-10-17 22:55:00 +02:00
Lennart Poettering	15ec102145	job: add lots of colons to log messages	2018-10-17 21:18:09 +02:00

1 2 3 4 5

219 commits