beenull/moby

Author	SHA1	Message	Date
Djordje Lukic	32d58144fd	c8d: Use reference counting while mounting a snapshot Some snapshotters (like overlayfs or zfs) can't mount the same directories twice. For example if the same directroy is used as an upper directory in two mounts the kernel will output this warning: overlayfs: upperdir is in-use as upperdir/workdir of another mount, accessing files from both mounts will result in undefined behavior. And indeed accessing the files from both mounts will result in an "No such file or directory" error. This change introduces reference counts for the mounts, if a directory is already mounted the mount interface will only increment the mount counter and return the mount target effectively making sure that the filesystem doesn't end up in an undefined behavior. Signed-off-by: Djordje Lukic <djordje.lukic@docker.com>	2023-06-07 15:50:01 +02:00
Cory Snider	8a094fe609	daemon: ensure OCI options play nicely together Audit the OCI spec options used for Linux containers to ensure they are less order-dependent. Ensure they don't assume that any pointer fields are non-nil and that they don't unintentionally clobber mutations to the spec applied by other options. Signed-off-by: Cory Snider <csnider@mirantis.com>	2023-06-06 13:10:05 -04:00
Cory Snider	dea870f4ea	daemon: stop setting container resources to zero Many of the fields in LinuxResources struct are pointers to scalars for some reason, presumably to differentiate between set-to-zero and unset when unmarshaling from JSON, despite zero being outside the acceptable range for the corresponding kernel tunables. When creating the OCI spec for a container, the daemon sets the container's OCI spec CPUShares and BlkioWeight parameters to zero when the corresponding Docker container configuration values are zero, signifying unset, despite the minimum acceptable value for CPUShares being two, and BlkioWeight ten. This has gone unnoticed as runC does not distingiush set-to-zero from unset as it also uses zero internally to represent unset for those fields. However, kata-containers v3.2.0-alpha.3 tries to apply the explicit-zero resource parameters to the container, exactly as instructed, and fails loudly. The OCI runtime-spec is silent on how the runtime should handle the case when those parameters are explicitly set to out-of-range values and kata's behaviour is not unreasonable, so the daemon must therefore be in the wrong. Translate unset values in the Docker container's resources HostConfig to omit the corresponding fields in the container's OCI spec when starting and updating a container in order to maximize compatibility with runtimes. Signed-off-by: Cory Snider <csnider@mirantis.com>	2023-06-06 12:13:05 -04:00
Jan Garcia	8c4dfc9e6a	vendor: github.com/rootless-containers/rootlesskit v1.1.1 Signed-off-by: Jan Garcia <github-public@n-garcia.com>	2023-06-06 09:26:32 +02:00
Jan Garcia	0b1c1877c5	update RootlessKit to v1.1.1 Signed-off-by: Jan Garcia <github-public@n-garcia.com>	2023-06-06 09:26:27 +02:00
Cory Snider	9ff169ccf4	daemon: modernize oci_linux_test.go Switch to using t.TempDir() instead of rolling our own. Clean up mounts leaked by the tests as otherwise the tests fail due to the leaked mounts because unlike the old cleanup code, t.TempDir() cleanup does not ignore errors from os.RemoveAll. Signed-off-by: Cory Snider <csnider@mirantis.com>	2023-06-05 18:30:30 -04:00
Bjorn Neergaard	1aae94074e	Merge pull request #45696 from neersighted/check_config_bpf_2 contrib/check-config: move xt_bpf check to overlay section	2023-06-05 14:03:03 -06:00
Sebastiaan van Stijn	ccfc2e5d8d	Merge pull request #45697 from vvoland/dockerfile-separate-cli-2 Dockerfile: Separate cli follow-up	2023-06-05 18:23:09 +02:00
Paweł Gronowski	0f9c8e684a	Dockerfile: Move dockercli to base-dev Avoids invalidation of dev-systemd-true and dev-base when changing the CLI version/repository. Signed-off-by: Paweł Gronowski <pawel.gronowski@docker.com>	2023-06-05 17:02:48 +02:00
Paweł Gronowski	5433b88e2d	hack/cli.sh: Quiet origin cleanup Don't show `error: No such remote: 'origin'` error when building for the first time and the cached git repository doesn't a remote yet. Signed-off-by: Paweł Gronowski <pawel.gronowski@docker.com>	2023-06-05 17:02:46 +02:00
Bjorn Neergaard	9aecb906fb	Merge pull request #45358 from vvoland/dockerfile-separate-cli Dockerfile: Use separate cli for shell and integration-cli	2023-06-05 08:17:34 -06:00
Bjorn Neergaard	800ea039ec	contrib/check-config: move xt_bpf check to overlay section Signed-off-by: Bjorn Neergaard <bjorn.neergaard@docker.com>	2023-06-05 08:11:05 -06:00
Paweł Gronowski	49f76a34b5	Dockerfile/shell: Install buildx cli plugin Installs the buildx cli plugin in the container shell by default. Previously user had to manually download the buildx binary to use buildkit. Signed-off-by: Paweł Gronowski <pawel.gronowski@docker.com>	2023-06-05 15:25:23 +02:00
Paweł Gronowski	17c99f7164	Dockerfile: Use separate cli for shell and integration-cli Use separate cli for integration-cli to allow use newer CLI for interactive dev shell usage. Both versions can be overriden with DOCKERCLI_VERSION or DOCKERCLI_INTEGRATION_VERSION. Binary is downloaded from download.docker.com if it's available, otherwise it's built from the source. For backwards compatibility DOCKER_CLI_PATH overrides BOTH clis. Signed-off-by: Paweł Gronowski <pawel.gronowski@docker.com>	2023-06-05 15:25:21 +02:00
Sebastiaan van Stijn	717cd0aaa8	Merge pull request #45694 from neersighted/check-config_xt_bpf contrib/check-config: check for xt_bpf	2023-06-03 23:56:25 +02:00
Sebastiaan van Stijn	1916c996d6	Merge pull request #45664 from thaJeztah/bump_swarmkit vendor: github.com/moby/swarmkit/v2 v2.0.0-20230531205928-01bb7a41396b	2023-06-03 18:05:03 +02:00
Bjorn Neergaard	1910fdde81	contrib/check-config: check for xt_bpf We omit xt_u32 as it's optional; since we will remove support for this module in the future, it's simpler to check for xt_bpf, which will become the new baseline. Related issues: * https://github.com/microsoft/WSL/issues/10029#issuecomment-1574440255 * https://github.com/docker/for-win/issues/13450#issuecomment-1574443139 Signed-off-by: Bjorn Neergaard <bjorn.neergaard@docker.com>	2023-06-02 19:31:13 -06:00
Sebastiaan van Stijn	4d34c1cd68	Merge pull request #45690 from vvoland/builder-use-moby-exporter-by-default builder-next: Set moby exporter as default	2023-06-02 17:04:28 +02:00
Paweł Gronowski	d63569c73d	builder-next: Set moby exporter as default Signed-off-by: Paweł Gronowski <pawel.gronowski@docker.com>	2023-06-02 15:38:53 +02:00
Bjorn Neergaard	7bdeb1dfc1	Merge pull request #45644 from vvoland/c8d-load-unpack-attestation c8d/load: Don't unpack pseudo images	2023-06-02 07:20:33 -06:00
Bjorn Neergaard	ff7ca32b79	Merge pull request #45676 from thaJeztah/dockerfile_more_link Dockerfile: use COPY --link for source code as well	2023-06-02 06:45:25 -06:00
Sebastiaan van Stijn	79c7d26495	Merge pull request #45670 from thaJeztah/c8d_useragent_more_details containerd: add c8d version and storage-driver to User-Agent	2023-06-02 11:24:02 +02:00
Paweł Gronowski	4295806736	c8d/handlers: Handle error in walkPresentChildren Signed-off-by: Paweł Gronowski <pawel.gronowski@docker.com>	2023-06-02 10:23:03 +02:00
Sebastiaan van Stijn	ff2342154b	Dockerfile: use COPY --link for source code as well I missed the most important COPY in `637ca59375` Copying the source code into the dev-container does not depend on the parent layers, so can use the --link option as well. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2023-06-01 22:17:09 +02:00
Cory Snider	cc51f0b3d3	Merge pull request #43980 from corhere/rcu-daemon-config daemon: read-copy-update the daemon config	2023-06-01 21:34:14 +02:00
Cory Snider	0f6eeecac0	daemon: consolidate runtimes config validation The daemon has made a habit of mutating the DefaultRuntime and Runtimes values in the Config struct to merge defaults. This would be fine if it was a part of the regular configuration loading and merging process, as is done with other config options. The trouble is it does so in surprising places, such as in functions with 'verify' or 'validate' in their name. It has been necessary in order to validate that the user has not defined a custom runtime named "runc" which would shadow the built-in runtime of the same name. Other daemon code depends on the runtime named "runc" always being defined in the config, but merging it with the user config at the same time as the other defaults are merged would trip the validation. The root of the issue is that the daemon has used the same config values for both validating the daemon runtime configuration as supplied by the user and for keeping track of which runtimes have been set up by the daemon. Now that a completely separate value is used for the latter purpose, surprising contortions are no longer required to make the validation work as intended. Consolidate the validation of the runtimes config and merging of the built-in runtimes into the daemon.setupRuntimes() function. Set the result of merging the built-in runtimes config and default default runtime on the returned runtimes struct, without back-propagating it onto the config.Config argument. Signed-off-by: Cory Snider <csnider@mirantis.com>	2023-06-01 14:45:25 -04:00
Cory Snider	d222bf097c	daemon: reload runtimes w/o breaking containers The existing runtimes reload logic went to great lengths to replace the directory containing runtime wrapper scripts as atomically as possible within the limitations of the Linux filesystem ABI. Trouble is, atomically swapping the wrapper scripts directory solves the wrong problem! The runtime configuration is "locked in" when a container is started, including the path to the runC binary. If a container is started with a runtime which requires a daemon-managed wrapper script and then the daemon is reloaded with a config which no longer requires the wrapper script (i.e. some args -> no args, or the runtime is dropped from the config), that container would become unmanageable. Any attempts to stop, exec or otherwise perform lifecycle management operations on the container are likely to fail due to the wrapper script no longer existing at its original path. Atomically swapping the wrapper scripts is also incompatible with the read-copy-update paradigm for reloading configuration. A handler in the daemon could retain a reference to the pre-reload configuration for an indeterminate amount of time after the daemon configuration has been reloaded and updated. It is possible for the daemon to attempt to start a container using a deleted wrapper script if a request to run a container races a reload. Solve the problem of deleting referenced wrapper scripts by ensuring that all wrapper scripts are immutable for the lifetime of the daemon process. Any given runtime wrapper script must always exist with the same contents, no matter how many times the daemon config is reloaded, or what changes are made to the config. This is accomplished by using everyone's favourite design pattern: content-addressable storage. Each wrapper script file name is suffixed with the SHA-256 digest of its contents to (probabilistically) guarantee immutability without needing any concurrency control. Stale runtime wrapper scripts are only cleaned up on the next daemon restart. Split the derived runtimes configuration from the user-supplied configuration to have a place to store derived state without mutating the user-supplied configuration or exposing daemon internals in API struct types. Hold the derived state and the user-supplied configuration in a single struct value so that they can be updated as an atomic unit. Signed-off-by: Cory Snider <csnider@mirantis.com>	2023-06-01 14:45:25 -04:00
Cory Snider	0b592467d9	daemon: read-copy-update the daemon config Ensure data-race-free access to the daemon configuration without locking by mutating a deep copy of the config and atomically storing a pointer to the copy into the daemon-wide configStore value. Any operations which need to read from the daemon config must capture the configStore value only once and pass it around to guarantee a consistent view of the config. Signed-off-by: Cory Snider <csnider@mirantis.com>	2023-06-01 14:45:24 -04:00
Cory Snider	742ac6e275	daemon: make config reloading more transactional Config reloading has interleaved validations and other fallible operations with mutating the live daemon configuration. The daemon configuration could be left in a partially-reloaded state if any of the operations returns an error. Mutating a copy of the configuration and atomically swapping the config struct on success is not currently an option as config values are not copyable due to the presence of sync.Mutex fields. Introduce a two-phase commit protocol to defer any mutations of the daemon state until after all fallible operations have succeeded. Reload transactions are not yet entirely hermetic. The platform reloading logic for custom runtimes on *nix could still leave the directory of generated runtime wrapper scripts in an indeterminate state if an error is encountered. Signed-off-by: Cory Snider <csnider@mirantis.com>	2023-06-01 14:45:24 -04:00
Cory Snider	038449467e	Update BuildKit registry config on daemon reload Historically, daemon.RegistryHosts() has returned a docker.RegistryHosts callback function which closes over a point-in-time snapshot of the daemon configuration. When constructing the BuildKit builder at daemon startup, the return value of daemon.RegistryHosts() has been used. Therefore the BuildKit builder would use the registry configuration as it was at daemon startup for the life of the process, even if the registry configuration is changed and the configuration reloaded. Provide BuildKit with a RegistryHosts callback which reflects the live daemon configuration after reloads so that registry operations performed by BuildKit always use the same configuration as the rest of the daemon. Signed-off-by: Cory Snider <csnider@mirantis.com>	2023-06-01 14:45:21 -04:00
Cory Snider	982e4fb448	api/server: get features from a callback fn Passing around a bare pointer to the map of configured features in order to propagate to consumers changes to the configuration across reloads is dangerous. Map operations are not atomic, so concurrently reading from the map while it is being updated is a data race as there is no synchronization. Use a getter function to retrieve the current features map so the features can be retrieved race-free. Remove the unused features argument from the build router. Signed-off-by: Cory Snider <csnider@mirantis.com>	2023-06-01 14:43:27 -04:00
Sebastiaan van Stijn	d099e47e00	containerd: add c8d version and storage-driver to User-Agent With this patch, the user-agent has information about the containerd-client version and the storage-driver that's used when using the containerd-integration; time="2023-06-01T11:27:07.959822887Z" level=info msg="listening on [::]:5000" go.version=go1.19.9 instance.id=53590f34-096a-4fd1-9c58-d3b8eb7e5092 service=registry version=2.8.2 ... 172.18.0.1 - - [01/Jun/2023:11:30:12 +0000] "HEAD /v2/multifoo/blobs/sha256:c7ec7661263e5e597156f2281d97b160b91af56fa1fd2cc045061c7adac4babd HTTP/1.1" 404 157 "" "docker/dev go/go1.20.4 git-commit/8d67d0c1a8 kernel/5.15.49-linuxkit-pr os/linux arch/arm64 containerd-client/1.6.21+unknown storage-driver/overlayfs UpstreamClient(Docker-Client/24.0.2 \\(linux\\))" Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2023-06-01 18:21:58 +02:00
Sebastiaan van Stijn	a6da1480b5	dockerversion: DockerUserAgent(): allow custom versions to be passed Allow additional metadata to be passed as part of the generated User-Agent. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2023-06-01 18:21:58 +02:00
Sebastiaan van Stijn	9a1f2e6d7c	dockerversion: remove insertUpstreamUserAgent() It was not really "inserting" anything, just formatting and appending. Simplify this by changing this in to a `getUpstreamUserAgent()` function which returns the upstream User-Agent (if any) into a `UpstreamClient()`. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2023-06-01 18:21:58 +02:00
Sebastiaan van Stijn	ff40d2d787	dockerversion: simplify escapeStr() Use a const for the characters to escape, instead of implementing this as a generic escaping function. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2023-06-01 18:21:58 +02:00
Sebastiaan van Stijn	eb9a5392bc	dockerversion: add a basic unit-test Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2023-06-01 18:21:55 +02:00
Sebastiaan van Stijn	c679da9ae1	Merge pull request #45669 from thaJeztah/c8d_useragent containerd: set user-agent when pushing/pulling images	2023-06-01 18:10:24 +02:00
Sebastiaan van Stijn	90e87c4753	Merge pull request #45660 from thaJeztah/dockerfile_copy_link Dockerfile: use COPY --link to copy artifacts from build-stages	2023-06-01 16:26:01 +02:00
Sebastiaan van Stijn	66137ae429	containerd: set user-agent when pushing/pulling images Before this, the client would report itself as containerd, and the containerd version from the containerd go module: time="2023-06-01T09:43:21.907359755Z" level=info msg="listening on [::]:5000" go.version=go1.19.9 instance.id=67b89d83-eac0-4f85-b36b-b1b18e80bde1 service=registry version=2.8.2 ... 172.18.0.1 - - [01/Jun/2023:09:43:33 +0000] "HEAD /v2/multifoo/blobs/sha256:cb269d7c0c1ca22fb5a70342c3ed2196c57a825f94b3f0e5ce3aa8c55baee829 HTTP/1.1" 404 157 "" "containerd/1.6.21+unknown" With this patch, the user-agent has the docker daemon information; time="2023-06-01T11:27:07.959822887Z" level=info msg="listening on [::]:5000" go.version=go1.19.9 instance.id=53590f34-096a-4fd1-9c58-d3b8eb7e5092 service=registry version=2.8.2 ... 172.18.0.1 - - [01/Jun/2023:11:27:20 +0000] "HEAD /v2/multifoo/blobs/sha256:c7ec7661263e5e597156f2281d97b160b91af56fa1fd2cc045061c7adac4babd HTTP/1.1" 404 157 "" "docker/dev go/go1.20.4 git-commit/8d67d0c1a8 kernel/5.15.49-linuxkit-pr os/linux arch/arm64 UpstreamClient(Docker-Client/24.0.2 \\(linux\\))" Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2023-06-01 14:20:45 +02:00
Sebastiaan van Stijn	06aaf87aab	vendor: github.com/moby/swarmkit/v2 v2.0.0-20230531205928-01bb7a41396b - Fix timeouts from very long raft messages - fix: code optimization - update dependencies full diff: `75e92ce14f...01bb7a4139` Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2023-05-31 23:04:47 +02:00
Sebastiaan van Stijn	637ca59375	Dockerfile: use COPY --link to copy artifacts from build-stages Build-cache for the build-stages themselves are already invalidated if the base-images they're using is updated, and the COPY operations don't depend on previous steps (as there's no overlap between artifacts copied). Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2023-05-31 11:52:18 +02:00
Sebastiaan van Stijn	8d67d0c1a8	Merge pull request #45437 from thaJeztah/vendor_image_spec vendor: github.com/opencontainers/image-spec v1.1.0-rc3	2023-05-31 11:12:51 +02:00
Paweł Gronowski	4d3238dc0b	c8d/load: Don't unpack pseudo images Don't unpack image manifests which are not a real images that can't be unpacked. Signed-off-by: Paweł Gronowski <pawel.gronowski@docker.com>	2023-05-31 10:47:26 +02:00
Paweł Gronowski	b08bff8ba3	c8d/load: Use walkImageManifests Signed-off-by: Paweł Gronowski <pawel.gronowski@docker.com>	2023-05-31 10:47:25 +02:00
Paweł Gronowski	5210f48bfc	c8d/list: Use walkImageManifests Signed-off-by: Paweł Gronowski <pawel.gronowski@docker.com>	2023-05-31 10:47:23 +02:00
Paweł Gronowski	fabc1d5bef	c8d: Add walkImageManifests and ImageManifest wrapper The default implementation of the containerd.Image interface provided by the containerd operates on the parent index/manifest list of the image and the platform matcher. This isn't convenient when a specific manifest is already known and it's redundant to search the whole index for a manifest that matches the given platform matcher. It can also result in a different manifest picked up than expected when multiple manifests with the same platform are present. This introduces a walkImageManifests which walks the provided image and calls a handler with a ImageManifest, which is a simple wrapper that implements containerd.Image interfaces and performs all containerd.Image operations against a platform specific manifest instead of the root manifest list/index. Signed-off-by: Paweł Gronowski <pawel.gronowski@docker.com>	2023-05-31 10:47:22 +02:00
Bjorn Neergaard	abc05cf335	Merge pull request #45645 from sebthom/patch-1 Update blogpost URL	2023-05-30 15:36:37 -06:00
Bjorn Neergaard	988f5ac342	Merge pull request #45647 from rumpl/fix-snapshotter-change c8d: Fix re-pull of an image when the snapshotter is changed	2023-05-30 15:32:55 -06:00
Cory Snider	d43b398746	Merge pull request #45657 from corhere/libn/setup-resolver-with-verbose-iptables libnetwork: fix resolver restore w/ chatty 'iptables -C'	2023-05-30 21:44:14 +02:00
Cory Snider	a25434654e	Merge pull request #45654 from corhere/libn/fix-embedded-resolver-live-reload libnetwork: fix sandbox restore	2023-05-30 21:43:46 +02:00

... 2 3 4 5 6 ...

46466 commits