systemd service files as blogged have issues on reboot #3148

mbaldessari · 2019-05-17T13:55:15Z

** BUG REPORT **

/kind bug

Description

After configuring a service file to start a podman container as documented here - https://podman.io/blogs/2018/09/13/systemd.html we noticed that on reboot all processes inside the container get SIGKILLED and the container has no chance of terminating gracefully.

Steps to reproduce the issue:
sudo podman pull docker.io/redis
sudo podman run -d --name redis -p 6379:6379 redis

/etc/systemd/system/redis.service:
[Unit]
Description=Redis Podman container
Wants=syslog.service

[Service]
Restart=always
ExecStart=/usr/bin/podman start -a redis
ExecStop=/usr/bin/podman stop -t 10 redis

[Install]
WantedBy=multi-user.target

sudo systemctl enable redis.service
sudo systemctl start redis.service
sudo reboot

Describe the results you received:

May 16 10:05:22 podmanreboot systemd[1]: Stopping Restore /run/initramfs on shutdown...
May 16 10:05:22 podmanreboot systemd[1]: libpod-conmon-7bfbd4ae7b10b983bc65c8a55450a2294d6e7d0b659a2d0de0d8ba2f205def55.scope: Killing process 1168 (conmon) with signal SIGKILL.
May 16 10:05:22 podmanreboot systemd[1]: Stopped libpod-conmon-7bfbd4ae7b10b983bc65c8a55450a2294d6e7d0b659a2d0de0d8ba2f205def55.scope.
May 16 10:05:22 podmanreboot systemd[1]: Stopping Authorization Manager...
May 16 10:05:22 podmanreboot systemd[1]: libpod-7bfbd4ae7b10b983bc65c8a55450a2294d6e7d0b659a2d0de0d8ba2f205def55.scope: Killing process 1201 (redis-server) with signal SIGKILL.
May 16 10:05:22 podmanreboot systemd[1]: Stopped libcontainer container 7bfbd4ae7b10b983bc65c8a55450a2294d6e7d0b659a2d0de0d8ba2f205def55.
May 16 10:05:22 podmanreboot systemd[1]: libpod-7bfbd4ae7b10b983bc65c8a55450a2294d6e7d0b659a2d0de0d8ba2f205def55.scope: Consumed 133ms CPU time
May 16 10:05:22 podmanreboot systemd[1]: Removed slice machine.slice.
May 16 10:05:22 podmanreboot systemd[1]: Removed slice system-sshd\x2dkeygen.slice.
May 16 10:05:22 podmanreboot systemd[1]: Stopping irqbalance daemon...
May 16 10:05:22 podmanreboot systemd[1]: Stopped target Login Prompts.
May 16 10:05:22 podmanreboot systemd[1]: Stopping Serial Getty on ttyS0...
May 16 10:05:22 podmanreboot systemd[7088]: Stopped target Default.
May 16 10:05:22 podmanreboot systemd[7088]: Stopped target Basic System.
May 16 10:05:22 podmanreboot systemd[7088]: Stopped target Paths.
May 16 10:05:22 podmanreboot systemd[7088]: Stopped target Sockets.
May 16 10:05:22 podmanreboot systemd[1]: Stopping Redis Podman container...

Notice how the processes in the redis container got a SIGKILL first thing after a reboot was issued and only later system tries to shut down the podman container.

Describe the results you expected:
We expected a graceful sigterm to the processes and then the 'Stopping Redis Podman container'

Additional information you deem important (e.g. issue happens only occasionally):

Seems pretty reproducable so far

Output of podman version:

podman version 1.2.0 (and 1.0.0)

Output of podman info --debug:

debug:
  compiler: gc
  git commit: ""
  go version: go1.11.5
  podman version: 1.2.0
host:
  BuildahVersion: 1.7.2
  Conmon:
    package: podman-1.2.0-1.git3bd528e.module+el8+3135+c5113def.x86_64
    path: /usr/libexec/podman/conmon
    version: 'conmon version 1.14.0-dev, commit: f6f8d3c7d46e7cceeb52585acdafaee26991fcb0-dirty'
  Distribution:
    distribution: '"rhel"'
    version: "8.0"
  MemFree: 3270287360
  MemTotal: 3863539712
  OCIRuntime:
    package: runc-1.0.0-54.rc5.dev.git2abd837.module+el8+2769+577ad176.x86_64
    path: /usr/bin/runc
    version: 'runc version spec: 1.0.0'
  SwapFree: 0
  SwapTotal: 0
  arch: amd64
  cpus: 4
  hostname: podmanreboot
  kernel: 4.18.0-80.el8.x86_64
  os: linux
  rootless: false
  uptime: 1h 2m 35.6s (Approximately 0.04 days)
insecure registries:
  registries:
  - brew-pulp-docker01.web.prod.ext.phx2.redhat.com:8888
registries:
  registries:
  - registry.redhat.io
  - quay.io
  - docker.io
store:
  ConfigFile: /etc/containers/storage.conf
  ContainerStore:
    number: 1
  GraphDriverName: overlay
  GraphOptions: null
  GraphRoot: /var/lib/containers/storage
  GraphStatus:
    Backing Filesystem: xfs
    Native Overlay Diff: "true"
    Supports d_type: "true"
    Using metacopy: "false"
  ImageStore:
    number: 1
  RunRoot: /var/run/containers/storage
  VolumePath: /var/lib/containers/storage/volumes

Additional environment details (AWS, VirtualBox, physical, etc.):
Issue first observed with podman from rhel8: podman-1.0.0-2.git921f98f.module+el8+2785+ff8a053f.x86_64

We also observed the problem with podman-1.2.0-1.git3bd528e.module+el8+3135+c5113def.x86_64

The text was updated successfully, but these errors were encountered:

mheon · 2019-05-17T13:57:06Z

On reboot, as in as the system shuts down, or when it comes back up?

mbaldessari · 2019-05-17T13:57:33Z

As the system shuts down

mheon · 2019-05-17T13:59:27Z

So ExecStop is specified, but systemd has decided to instead send SIGKILL aggressively to everything.

Could we be talking about https://bugzilla.redhat.com/show_bug.cgi?id=1394937

mbaldessari · 2019-05-17T14:20:21Z

Potentially, yes I guess. What caught my attention was this 'systemd[1]: libpod-conmon-7bfbd4ae7b10b983bc65c8a55450a2294d6e7d0b659a2d0de0d8ba2f205def55.scope: Killing process 1168 (conmon) with signal SIGKILL.' . I.e. I am wondering if systemd somehow has issues (ignores deps?) with the .scope that podman creates?

rhatdan · 2019-05-17T17:48:01Z

@mrunalp PTAL
I know that CRI-O had to deal with something here. Mrunal would know more about it.

cjeanner · 2019-05-21T08:10:33Z

Hello there!
Small note: the .scope units are transient, created via the systemd socket, directly by conmon and/or libpod (at least this makes sense seeing the name of those units). Apparently they are set to be stopped/removed at a really early stage of the shutdown process. I'm pretty sure that's the issue we're hitting right now.

Would love to get @mrunalp feedback if CRI-O already dealt with that issue :).

haircommander · 2019-06-06T14:52:42Z

There's an upstream fix proposed for runc here: opencontainers/runc#2062

mrunalp · 2019-06-06T18:03:33Z

Yeah the runc PR should probably help. Could you test with that change and see if it helps?

mbaldessari · 2019-06-12T18:21:37Z

So I tried with an installed runc with the 2062 applied to no avail. Is updating runc, restarting the box and then rerunning the test sufficient? I left some details around my testing here https://bugzilla.redhat.com/show_bug.cgi?id=1710871#c11

mheon · 2019-06-12T18:29:23Z

If I'm correct, we also need to teach Podman to take advantage of the new runc feature - correct? @mrunalp

mbaldessari · 2019-07-03T08:25:04Z

So just an update here. Since I got back from PTO I am unable to reproduce this downstream. I'll try a bit more in the next days, but for sure it is not seen constantly as we did before. Not sure what changed under me though

baude · 2019-08-02T17:23:01Z

i believe this is fixed and therefore I am closing.

mheon · 2019-08-02T17:51:34Z

This is not fixed.

baude · 2019-08-04T20:05:47Z

what needs to be done @mheon ... so we can get it assigned?

mheon · 2019-08-04T20:17:11Z

Core issue: CGroup scopes are created by Podman and the OCI runtime - one for Conmon, one for the container process. These scopes have no dependencies and as such, during shutdown, systemd will whack them with a SIGTERM and then, after a brief timeout, SIGKILL on shutdown. Under some circumstances, we don't want this to happen - for example, the Openstack teams wants an ordered shutdown, dictated by systemd and their orchestration service.

The conmon one is probably simple to deal with - #3474 should fix it once I figure out why the tests are red. Just set a long kill timeout to give time for the container to stop.

The container is a lot more complicated. We don't have control of the application in the container, so we don't know what signals are safe to send it - we can't use the same approach as Conmon. We could use systemd dependencies if the container was run from a unit file - but the chain here needs to be inverted, the systemd unit that launched the container needs to depend on the container's CGroup scope, but the unit file exists before the container is launched, and the container's CGroup doesn't exist until it's launched.

My initial proposal was to ditch CGroups entirely (optionally) - #3581

Limitations of that approach:

Requires OCI runtime support (present in crun, not in runc - I have patches, but it could be tricky to get upstream)
When CGroups are enabled, we absolutely must use a PID namespace, because we have no other way of tracking the processes in the container
Resource limits will not work, we don't have CGroups to enforce them. Limits from the launching unit file can mostly substitute for this.
Pause, unpause, stats, top, and probably some more commands just don't work, they need CGroups.

I view this as a long-term enhancement, as Openstack has (mostly) resolved the issue via adding an intermediate systemd unit to handle dependencies. This is 1.6.x timeframe or later.

mheon · 2019-08-04T20:17:45Z

Oh, I should mention that Openstack rejected the initial approach - they need --pid=host which conflicts with the mandatory PID namespace.

baude · 2019-09-23T20:16:00Z

we believe this is fixed and as such are closing it; please re-open if the problem persists.

openshift-ci-robot added the kind/bug Categorizes issue or PR as related to a bug. label May 17, 2019

baude closed this as completed Aug 2, 2019

mheon reopened this Aug 2, 2019

baude closed this as completed Sep 23, 2019

github-actions bot added the locked - please file new issue/PR Assist humans wanting to comment on an old issue or PR with locked comments. label Sep 23, 2023

github-actions bot locked as resolved and limited conversation to collaborators Sep 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

systemd service files as blogged have issues on reboot #3148

systemd service files as blogged have issues on reboot #3148

mbaldessari commented May 17, 2019 •

edited

Loading

mheon commented May 17, 2019

mbaldessari commented May 17, 2019

mheon commented May 17, 2019

mbaldessari commented May 17, 2019

rhatdan commented May 17, 2019

cjeanner commented May 21, 2019

haircommander commented Jun 6, 2019

mrunalp commented Jun 6, 2019

mbaldessari commented Jun 12, 2019

mheon commented Jun 12, 2019

mbaldessari commented Jul 3, 2019

baude commented Aug 2, 2019

mheon commented Aug 2, 2019

baude commented Aug 4, 2019

mheon commented Aug 4, 2019

mheon commented Aug 4, 2019

baude commented Sep 23, 2019

systemd service files as blogged have issues on reboot #3148

systemd service files as blogged have issues on reboot #3148

Comments

mbaldessari commented May 17, 2019 • edited Loading

mheon commented May 17, 2019

mbaldessari commented May 17, 2019

mheon commented May 17, 2019

mbaldessari commented May 17, 2019

rhatdan commented May 17, 2019

cjeanner commented May 21, 2019

haircommander commented Jun 6, 2019

mrunalp commented Jun 6, 2019

mbaldessari commented Jun 12, 2019

mheon commented Jun 12, 2019

mbaldessari commented Jul 3, 2019

baude commented Aug 2, 2019

mheon commented Aug 2, 2019

baude commented Aug 4, 2019

mheon commented Aug 4, 2019

mheon commented Aug 4, 2019

baude commented Sep 23, 2019

mbaldessari commented May 17, 2019 •

edited

Loading