daemon: reinit health monitor on live-restore

The container may have been running without health probes for an indeterminate amount of time. The container may have become unhealthy in the interim. We should probe it sooner than in steady-state, while also giving it some leeway to recover from e.g. timed-out connections. This is easy to achieve by probing the container like a freshly-started one. The original author of health-checks came to the same conclusion; the health monitor was reinitialized on live-restored containers before v17.11.0, when health monitoring of live-restored containers was accidentally broken. Revert to the original behavior. Signed-off-by: Cory Snider <csnider@mirantis.com>
2024-01-08 19:32:21 -05:00 · 2024-01-08 19:32:21 -05:00 · 0e62dbadcd
commit 0e62dbadcd
parent 6b1baf8dd2
1 changed files with 2 additions and 2 deletions
--- a/daemon/daemon.go
+++ b/daemon/daemon.go
@ -442,7 +442,7 @@ func (daemon *Daemon) restore(cfg *configStore) error {
 						c.Lock()
 						c.Paused = false
 						daemon.setStateCounter(c)
-						daemon.updateHealthMonitor(c)
+						daemon.initHealthMonitor(c)
 						if err := c.CheckpointTo(daemon.containersReplica); err != nil {
 							baseLogger.WithError(err).Error("failed to update paused container state")
 						}
@ -451,7 +451,7 @@ func (daemon *Daemon) restore(cfg *configStore) error {
 				case !c.IsPaused() && alive:
 					logger(c).Debug("restoring healthcheck")
 					c.Lock()
-					daemon.updateHealthMonitor(c)
+					daemon.initHealthMonitor(c)
 					c.Unlock()
 				}