checkpoint: support lazy migration #1541

adrianreber · 2017-08-01T15:54:47Z

With the help of userfaultfd CRIU supports lazy migration. Lazy
migration means that memory pages are only transferred from the
migration source to the migration destination on page fault.

This enables to reduce the downtime during process or container
migration to a minimum as the memory does not need to be transferred
during migration.

Lazy migration currently depends on userfaultfd being available on the
current Linux kernel and if the used CRIU version supports lazy
migration. Both dependencies can be checked by querying CRIU via RPC if
the lazy migration feature is available. Using feature checking instead
of version comparison enables runC to use CRIU features from the
criu-dev branch. This way the user can decide if lazy migration should
be available by choosing the right kernel and CRIU branch.

To use lazy migration the CRIU process during dump needs to dump
everything besides the memory pages and then it opens a network port
waiting for remote page fault requests:

 # runc checkpoint httpd --lazy-pages --page-server 0.0.0.0:27 \
  --status-fd /tmp/postcopy-pipe

In this example CRIU will hang/wait once it has opened the network port
and wait for network connection. As runC waits for CRIU to finish it
will also hang until the lazy migration has finished. To know when the
restore on the destination side can start the '--status-fd' parameter is
used:

 #️ runc checkpoint --help | grep status
  --status-fd value   criu writes \0 to this FD once lazy-pages is ready

The parameter '--status-fd' is directly from CRIU and this way the
process outside of runC which controls the migration knows exactly when
to transfer the checkpoint (without memory pages) to the destination and
that the restore can be started.

On the destination side it is necessary to start CRIU in 'lazy-pages'
mode like this:

 # criu lazy-pages --page-server --address 192.168.122.3 --port 27 \
  -D checkpoint

and tell runC to do a lazy restore:

 # runc restore -d --image-path checkpoint --work-path checkpoint \
  --lazy-pages httpd

If both processes on the restore side have the same working directory
'criu lazy-pages' creates a unix domain socket where it waits for
requests from the actual restore. runC starts CRIU restore in lazy
restore mode and talks to 'criu lazy-pages' that it wants to restore
memory pages on demand. CRIU continues to restore the process and once
the process is running and accesses the first non-existing memory page
the 'criu lazy-pages' server will request the page from the source
system. Thus all pages from the source system will be transferred to the
destination system. Once all pages have been transferred runC on the
source system will end and the container will have finished migration.

This can also be combined with CRIU's pre-copy support. The combination
of pre-copy and post-copy (lazy migration) provides the possibility to
migrate containers with minimal downtimes.

Some additional background about post-copy migration can be found in
these articles:

https://lisas.de/~adrian/?p=1253
https://lisas.de/~adrian/?p=1183

Signed-off-by: Adrian Reber [email protected]

adrianreber · 2017-08-01T15:56:06Z

This includes the same commits as #1535 as it needs them. The only new commits are the two latest commits.

adrianreber · 2017-08-01T16:14:02Z

@rppt: FYI

adrianreber · 2017-08-02T16:40:26Z

Updated with the review results of #1535

adrianreber · 2017-08-07T14:31:38Z

@avagin: Do you have any comments on this?

crosbymichael · 2017-08-07T15:05:52Z

@kolyshkin can you also take a look at this change? Thanks!

avagin · 2017-08-08T21:54:18Z

@adrianreber we need a test case for this. How it will be integrated with phaul? (https://github.com/xemul/criu/tree/criu-dev/phaul)

avagin · 2017-08-08T21:57:28Z

The lazy migration feature is in a criu development branch and we are going to release it in CRIU 3.4 (Sep 2017).

adrianreber · 2017-08-09T08:47:59Z

@avagin, I was also thinking about the test case but as the container is not totally destroyed before complete memory migration my first attempts failed as it was not possible to restore the same container with the same name. I will try to change the name, that might work. Let me try to add a working test-case for lazy migration into runC. I will update this PR.

About p.haul: Not that it really belongs here, but p.haul feels kind of abandoned, especially with the difficulties of integrating p.haul and all the container engines in a useful way. So I am also leaning more towards replacing the pre-copy code in runC with CRIU's go migration library. So the main question is p.haul in its current form still alive or not? For me it seems not very alive in its current form.

xemul · 2017-08-10T13:26:16Z

@adrianreber , the py version of the p.haul (that sits in a separate repo) is indeed abandoned. Mostly for the reasons you've mentioned -- too hard to integrate python code with anything else. At the same time the go p.haul, that sits in the criu repo is the place where support for criu live migration features will (well, should) go.

xemul · 2017-08-10T13:27:15Z

@avagin , 3.4 is going to be in August. And lazy pages are thus aimed at 3.5.

adrianreber · 2017-08-10T19:51:31Z

@avagin test case now included

adrianreber · 2017-09-04T09:28:11Z

Rebased after recent breakage. Any reviewers?

crosbymichael · 2017-09-05T19:59:02Z

libcontainer/container_linux.go

+
+	data := make([]byte, 1)
+	count, _ := r.Read(data)
+	logrus.Debugf("%d:%s", count, status)


can you remove these debug statements?

crosbymichael · 2017-09-05T19:59:37Z

libcontainer/container_linux.go

+func waitForCriuLazyServer(r *os.File, status string) error {
+
+	data := make([]byte, 1)
+	count, _ := r.Read(data)


Can you add error handling for various calls in this function? Read, OpenFile, and Write are all missing

Before adding the actual lazy migration support, this adds the feature check for lazy-pages. Right now lazy migration, which is based on userfaultd is only available in the criu-dev branch and not yet in a release. As the check does not dependent on a certain version but on a CRIU feature which can be queried it can be part of runC without a new version check depending on a feature from criu-dev. Signed-off-by: Adrian Reber <[email protected]>

With the help of userfaultfd CRIU supports lazy migration. Lazy migration means that memory pages are only transferred from the migration source to the migration destination on page fault. This enables to reduce the downtime during process or container migration to a minimum as the memory does not need to be transferred during migration. Lazy migration currently depends on userfaultfd being available on the current Linux kernel and if the used CRIU version supports lazy migration. Both dependencies can be checked by querying CRIU via RPC if the lazy migration feature is available. Using feature checking instead of version comparison enables runC to use CRIU features from the criu-dev branch. This way the user can decide if lazy migration should be available by choosing the right kernel and CRIU branch. To use lazy migration the CRIU process during dump needs to dump everything besides the memory pages and then it opens a network port waiting for remote page fault requests: # runc checkpoint httpd --lazy-pages --page-server 0.0.0.0:27 \ --status-fd /tmp/postcopy-pipe In this example CRIU will hang/wait once it has opened the network port and wait for network connection. As runC waits for CRIU to finish it will also hang until the lazy migration has finished. To know when the restore on the destination side can start the '--status-fd' parameter is used: #️ runc checkpoint --help | grep status --status-fd value criu writes \0 to this FD once lazy-pages is ready The parameter '--status-fd' is directly from CRIU and this way the process outside of runC which controls the migration knows exactly when to transfer the checkpoint (without memory pages) to the destination and that the restore can be started. On the destination side it is necessary to start CRIU in 'lazy-pages' mode like this: # criu lazy-pages --page-server --address 192.168.122.3 --port 27 \ -D checkpoint and tell runC to do a lazy restore: # runc restore -d --image-path checkpoint --work-path checkpoint \ --lazy-pages httpd If both processes on the restore side have the same working directory 'criu lazy-pages' creates a unix domain socket where it waits for requests from the actual restore. runC starts CRIU restore in lazy restore mode and talks to 'criu lazy-pages' that it wants to restore memory pages on demand. CRIU continues to restore the process and once the process is running and accesses the first non-existing memory page the 'criu lazy-pages' server will request the page from the source system. Thus all pages from the source system will be transferred to the destination system. Once all pages have been transferred runC on the source system will end and the container will have finished migration. This can also be combined with CRIU's pre-copy support. The combination of pre-copy and post-copy (lazy migration) provides the possibility to migrate containers with minimal downtimes. Some additional background about post-copy migration can be found in these articles: https://lisas.de/~adrian/?p=1253 https://lisas.de/~adrian/?p=1183 Signed-off-by: Adrian Reber <[email protected]>

The lazy-pages test case is not as straight forward as the other test cases. This is related to the fact that restoring requires a different name if restored on the same host. During 'runc checkpoint' the container is not destroyed before all memory pages have been transferred to the destination and thus the same container name cannot be used. As real world usage will rather migrate a container from one system to another than lazy migrate a container on the same host this is only problematic for this test case. Another reason is that it requires starting 'runc checkpoint' and 'criu lazy-pages' in the background as those process need to be running to start the final restore 'runc restore'. CRIU upstream is currently discussing to automatically start 'criu lazy-pages' which would simplify the lazy-pages test case a bit. The handling and checking of the background processes make the test case not the most elegant as at one point a 'sleep 2' is required to make sure that 'runc checkpoint' had time to do its thing before looking at log files. Before running the actual test criu is called in feature checking mode to make sure lazy migration is in the test case criu enabled. If not, the test is skipped. Signed-off-by: Adrian Reber <[email protected]>

crosbymichael · 2017-09-06T14:48:23Z

LGTM

mrunalp · 2017-09-07T20:25:01Z

LGTM

avagin · 2018-09-17T18:07:00Z

tests/integration/checkpoint.bats

+  # there is some basic error. If the lazy migration is ready can
+  # be handled by $lazy_pipe. Which probably will always be ready
+  # after sleeping two seconds.
+  sleep 2


It isn't only ugly, it is probably a signal that the interface is not designed properly

@adrianreber I think we can discuss this interface on LPC

adrianreber force-pushed the lazy branch from 6ec21ef to 126b1bf Compare August 2, 2017 16:39

adrianreber mentioned this pull request Aug 4, 2017

Use CRIU VERSION RPC if available #1535

Merged

adrianreber force-pushed the lazy branch from 126b1bf to 21f3396 Compare August 5, 2017 06:18

adrianreber force-pushed the lazy branch from 21f3396 to bbff3fd Compare August 10, 2017 19:43

adrianreber force-pushed the lazy branch from bbff3fd to b31dec8 Compare September 4, 2017 08:46

crosbymichael reviewed Sep 5, 2017

View reviewed changes

adrianreber force-pushed the lazy branch from b31dec8 to 69d43b4 Compare September 6, 2017 12:34

adrianreber added 3 commits September 6, 2017 12:35

adrianreber force-pushed the lazy branch from 69d43b4 to ec26065 Compare September 6, 2017 12:36

mrunalp merged commit 7e036aa into opencontainers:master Sep 7, 2017

adrianreber deleted the lazy branch September 8, 2017 05:34

cyphar mentioned this pull request Feb 24, 2018

VERSION: release v1.0.0-rc5 #1739

Merged

avagin reviewed Sep 17, 2018

View reviewed changes

rst0git mentioned this pull request Nov 12, 2021

Page Server dumping failed checkpoint-restore/criu#1652

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

checkpoint: support lazy migration #1541

checkpoint: support lazy migration #1541

adrianreber commented Aug 1, 2017 •

edited by cyphar

Loading

adrianreber commented Aug 1, 2017

adrianreber commented Aug 1, 2017

adrianreber commented Aug 2, 2017

adrianreber commented Aug 7, 2017

crosbymichael commented Aug 7, 2017

avagin commented Aug 8, 2017

avagin commented Aug 8, 2017

adrianreber commented Aug 9, 2017

xemul commented Aug 10, 2017

xemul commented Aug 10, 2017

adrianreber commented Aug 10, 2017

adrianreber commented Sep 4, 2017

crosbymichael Sep 5, 2017

crosbymichael Sep 5, 2017

crosbymichael commented Sep 6, 2017 •

edited by caniszczyk

Loading

mrunalp commented Sep 7, 2017 •

edited by caniszczyk

Loading

avagin Sep 17, 2018

avagin Sep 17, 2018

adrianreber Sep 17, 2018

checkpoint: support lazy migration #1541

checkpoint: support lazy migration #1541

Conversation

adrianreber commented Aug 1, 2017 • edited by cyphar Loading

adrianreber commented Aug 1, 2017

adrianreber commented Aug 1, 2017

adrianreber commented Aug 2, 2017

adrianreber commented Aug 7, 2017

crosbymichael commented Aug 7, 2017

avagin commented Aug 8, 2017

avagin commented Aug 8, 2017

adrianreber commented Aug 9, 2017

xemul commented Aug 10, 2017

xemul commented Aug 10, 2017

adrianreber commented Aug 10, 2017

adrianreber commented Sep 4, 2017

crosbymichael Sep 5, 2017

Choose a reason for hiding this comment

crosbymichael Sep 5, 2017

Choose a reason for hiding this comment

crosbymichael commented Sep 6, 2017 • edited by caniszczyk Loading

mrunalp commented Sep 7, 2017 • edited by caniszczyk Loading

avagin Sep 17, 2018

Choose a reason for hiding this comment

avagin Sep 17, 2018

Choose a reason for hiding this comment

adrianreber Sep 17, 2018

Choose a reason for hiding this comment

adrianreber commented Aug 1, 2017 •

edited by cyphar

Loading

crosbymichael commented Sep 6, 2017 •

edited by caniszczyk

Loading

mrunalp commented Sep 7, 2017 •

edited by caniszczyk

Loading