0ct0pu5/moby

Author	SHA1	Message	Date
Sebastiaan van Stijn	f8795ed364	daemon: allow "builtin" as valid value for seccomp profiles This allows containers to use the embedded default profile if a different default is set (e.g. "unconfined") in the daemon configuration. Without this option, users would have to copy the default profile to a file in order to use the default. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-08-07 15:40:47 +02:00
Sebastiaan van Stijn	68e96f88ee	Fix daemon.json and daemon --seccomp-profile not accepting "unconfined" Commit `b237189e6c` implemented an option to set the default seccomp profile in the daemon configuration. When that PR was reviewed, it was discussed to have the option accept the path to a custom profile JSON file; https://github.com/moby/moby/pull/26276#issuecomment-253546966 However, in the implementation, the special "unconfined" value was not taken into account. The "unconfined" value is meant to disable seccomp (more factually: run with an empty profile). While it's likely possible to achieve this by creating a file with an an empty (`{}`) profile, and passing the path to that file, it's inconsistent with the `--security-opt seccomp=unconfined` option on `docker run` and `docker create`, which is both confusing, and makes it harder to use (especially on Docker Desktop, where there's no direct access to the VM's filesystem). This patch adds the missing check for the special "unconfined" value. Co-authored-by: Tianon Gravi <admwiggin@gmail.com> Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-08-07 15:40:45 +02:00
Sebastiaan van Stijn	09cf117b31	api/types: hostconfig: create enum for CgroupnsMode Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-08-06 19:05:54 +02:00
Sebastiaan van Stijn	98f0f0dd87	api/types: hostconfig: define consts for IpcMode Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-08-06 19:05:51 +02:00
Brian Goff	9674540ccf	Merge pull request #42520 from thaJeztah/remove_lcow_step5_alternative Remove LCOW (step 5): volumes/mounts: remove LCOW code (alternative)	2021-07-26 10:24:52 -07:00
Justin Cormack	b337c70bdc	Merge pull request #42639 from thaJeztah/system_info_clean pkg/sysinfo: assorted cleanup/refactoring for handling warnings and logging	2021-07-19 15:17:07 +01:00
Sebastiaan van Stijn	9b795c3e50	pkg/sysinfo.New(), daemon.RawSysInfo(): remove "quiet" argument The "quiet" argument was only used in a single place (at daemon startup), and every other use had to pass "false" to prevent this function from logging warnings. Now that SysInfo contains the warnings that occurred when collecting the system information, we can make leave it up to the caller to use those warnings (and log them if wanted). Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-07-14 23:10:07 +02:00
Sebastiaan van Stijn	115b37b8f7	daemon: use object literal for stats Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-07-11 14:16:13 +02:00
Sebastiaan van Stijn	300c11c7c9	volume/mounts: remove "containerOS" argument from NewParser (LCOW code) This changes mounts.NewParser() to create a parser for the current operatingsystem, instead of one specific to a (possibly non-matching, in case of LCOW) OS. With the OS-specific handling being removed, the "OS" parameter is also removed from `daemon.verifyContainerSettings()`, and various other container-related functions. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-07-02 13:51:55 +02:00
Sebastiaan van Stijn	472f21b923	replace uses of deprecated containerd/sys.RunningInUserNS() This utility was moved to a separate package, which has no dependencies. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-06-18 11:01:24 +02:00
Sebastiaan van Stijn	2773f81aa5	Merge pull request #42445 from thaJeztah/bump_golang_ci [testing] ~update~ fix linting issues found by golangci-lint v1.40.1	2021-06-16 22:15:01 +02:00
Tianon Gravi	a060328874	Merge pull request #42472 from thaJeztah/improve_rootless_option daemon: improve handling of ROOTLESSKIT_PARENT_EUID	2021-06-11 13:03:31 -07:00
Sebastiaan van Stijn	bb17074119	reformat "nolint" comments Unlike regular comments, nolint comments should not have a leading space. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-06-10 13:03:42 +02:00
Akihiro Suda	0ad2293d0e	Merge pull request #41656 from thaJeztah/unexport_things	2021-06-08 12:07:40 +09:00
Sebastiaan van Stijn	aa4dce742f	daemon: improve handling of ROOTLESSKIT_PARENT_EUID - daemon.WithRootless(): make sure ROOTLESSKIT_PARENT_EUID is valid int - daemon.RawSysInfo(): minor simplification, and rename variable that clashed with imported package. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-06-05 21:12:32 +02:00
Brian Goff	4b981436fe	Fixup libnetwork lint errors Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2021-06-01 23:48:32 +00:00
Brian Goff	a0a473125b	Fix libnetwork imports After moving libnetwork to this repo, we need to update all the import paths for libnetwork to point to docker/docker/libnetwork instead of docker/libnetwork. This change implements that. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2021-06-01 21:51:23 +00:00
Sebastiaan van Stijn	bf07c06c63	daemon: move DefaultShimBinary, DefaultRuntimeBinary to config package Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-05-31 19:06:16 +02:00
Sebastiaan van Stijn	95d69658be	daemon: un-export VerifyCgroupDriver() it's only used internally, so no need to export Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-05-31 19:06:12 +02:00
Sebastiaan van Stijn	a506630e57	daemon: use sync.Once for systemd detection Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-05-31 19:06:10 +02:00
Sebastiaan van Stijn	e7ba5cacc6	daemon: un-export IsRunningSystemd() This utility was added after 19.03, and is only used in the daemon code itself, so we can un-export it, until there's an external use for it. Also updated the description, because the runc code already copied it from coreos/go-systemd, so better to describe the actual source. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-05-31 19:06:07 +02:00
Brian Goff	7f5e39bd4f	Use real root with 0701 perms Various dirs in /var/lib/docker contain data that needs to be mounted into a container. For this reason, these dirs are set to be owned by the remapped root user, otherwise there can be permissions issues. However, this uneccessarily exposes these dirs to an unprivileged user on the host. Instead, set the ownership of these dirs to the real root (or rather the UID/GID of dockerd) with 0701 permissions, which allows the remapped root to enter the directories but not read/write to them. The remapped root needs to enter these dirs so the container's rootfs can be configured... e.g. to mount /etc/resolve.conf. This prevents an unprivileged user from having read/write access to these dirs on the host. The flip side of this is now any user can enter these directories. Signed-off-by: Brian Goff <cpuguy83@gmail.com> (cherry picked from commit `e908cc3901`) Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2021-02-02 13:01:25 +01:00
gunadhya	64465f3b5f	Fix Error in daemon_unix.go and docker_cli_run_unit_test.go Signed-off-by: gunadhya <6939749+gunadhya@users.noreply.github.com>	2021-01-05 16:56:29 +05:30
Sebastiaan van Stijn	1c0af18c6c	vendor: opencontainers/selinux v1.8.0, and remove selinux build-tag and stubs full diff: https://github.com/opencontainers/selinux/compare/v1.7.0...v1.8.0 Remove "selinux" build tag Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-12-24 00:47:16 +01:00
Sebastiaan van Stijn	cf31b9622a	Merge pull request #41622 from bboehmke/ipv6_nat IPv6 iptables config option	2020-12-07 11:59:42 +01:00
Benjamin Böhmke	cd63cc846e	mark ip6tables as experimental feature Signed-off-by: Benjamin Böhmke <benjamin@boehmke.net>	2020-12-02 22:23:33 +01:00
Sebastiaan van Stijn	6458f750e1	use containerd/cgroups to detect cgroups v2 libcontainer does not guarantee a stable API, and is not intended for external consumers. this patch replaces some uses of libcontainer/cgroups with containerd/cgroups. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-11-09 15:00:32 +01:00
Benjamin Böhmke	66459cc623	Added ip6tables config option Signed-off-by: Benjamin Böhmke <benjamin@boehmke.net>	2020-11-05 16:18:23 +01:00
Sebastiaan van Stijn	182795cff6	Do not call mount.RecursiveUnmount() on Windows Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-10-29 23:00:16 +01:00
Sebastiaan van Stijn	cf7a5be0f2	daemon: don't adjust oom-score if score is 0 This patch makes two changes if --oom-score-adj is set to 0 - do not adjust the oom-score-adjust cgroup for dockerd - do not set the hard-coded -999 score for containerd if containerd is running as child process Before this change: oom-score-adj \| dockerd \| containerd as child-process --------------\|---------------\|---------------------------- - \| -500 \| -500 (same as dockerd) -100 \| -100 \| -100 (same as dockerd) 0 \| 0 \| -999 (hard-coded default) With this change: oom-score-adj \| dockerd \| containerd as child-process --------------\|---------------\|---------------------------- - \| -500 \| -500 (same as dockerd) -100 \| -100 \| -100 (same as dockerd) 0 \| not adjusted \| not adjusted Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-10-05 19:52:02 +02:00
Jeff Zvier	a7c279f203	Add more error message for ops when container limit use an device which not exist Signed-off-by: Jeff Zvier <zvier20@gmail.com>	2020-08-11 06:33:22 +08:00
Akihiro Suda	51e3cd4761	statsV2: implement Failcnt Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-07-30 14:31:20 +09:00
Akihiro Suda	b8ca7de823	Deprecate KernelMemory Kernel memory limit is not supported on cgroup v2. Even on cgroup v1, kernel memory limit (`kmem.limit_in_bytes`) has been deprecated since kernel 5.4. `0158115f70` Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-07-24 20:44:29 +09:00
Brian Goff	260c26b7be	Merge pull request #41016 from kolyshkin/cgroup-init	2020-07-16 11:26:52 -07:00
Brian Goff	61b73ee714	Merge pull request #41182 from cpuguy83/runtime_configure_shim	2020-07-14 14:16:04 -07:00
Brian Goff	f63f73a4a8	Configure shims from runtime config In dockerd we already have a concept of a "runtime", which specifies the OCI runtime to use (e.g. runc). This PR extends that config to add containerd shim configuration. This option is only exposed within the daemon itself (cannot be configured in daemon.json). This is due to issues in supporting unknown shims which will require more design work. What this change allows us to do is keep all the runtime config in one place. So the default "runc" runtime will just have it's already existing shim config codified within the runtime config alone. I've also added 2 more "stock" runtimes which are basically runc+shimv1 and runc+shimv2. These new runtime configurations are: - io.containerd.runtime.v1.linux - runc + v1 shim using the V1 shim API - io.containerd.runc.v2 - runc + shim v2 These names coincide with the actual names of the containerd shims. This allows the user to essentially control what shim is going to be used by either specifying these as a `--runtime` on container create or by setting `--default-runtime` on the daemon. For custom/user-specified runtimes, the default shim config (currently shim v1) is used. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2020-07-13 14:18:02 -07:00
Sebastiaan van Stijn	d2e23405be	Set minimum memory limit to 6M, to account for higher startup memory use For some time, we defined a minimum limit for `--memory` limits to account for overhead during startup, and to supply a reasonable functional container. Changes in the runtime (runc) introduced a higher memory footprint during container startup, which now lead to obscure error-messages that are unfriendly for users: run --rm --memory=4m alpine echo success docker: Error response from daemon: OCI runtime create failed: container_linux.go:349: starting container process caused "process_linux.go:449: container init caused \"process_linux.go:415: setting cgroup config for procHooks process caused \\\"failed to write \\\\\\\"4194304\\\\\\\" to \\\\\\\"/sys/fs/cgroup/memory/docker/1254c8d63f85442e599b17dff895f4543c897755ee3bd9b56d5d3d17724b38d7/memory.limit_in_bytes\\\\\\\": write /sys/fs/cgroup/memory/docker/1254c8d63f85442e599b17dff895f4543c897755ee3bd9b56d5d3d17724b38d7/memory.limit_in_bytes: device or resource busy\\\"\"": unknown. ERRO[0000] error waiting for container: context canceled Containers that fail to start because of this limit, will not be marked as OOMKilled, which makes it harder for users to find the cause of the failure. Note that _after_ this memory is only required during startup of the container. After the container was started, the container may not consume this memory, and limits could (manually) be lowered, for example, an alpine container running only a shell can run with 512k of memory; echo 524288 > /sys/fs/cgroup/memory/docker/acdd326419f0898be63b0463cfc81cd17fb34d2dae6f8aa3768ee6a075ca5c86/memory.limit_in_bytes However, restarting the container will reset that manual limit to the container's configuration. While `docker container update` would allow for the updated limit to be persisted, (re)starting the container after updating produces the same error message again, so we cannot use different limits for `docker run` / `docker create` and `docker update`. This patch raises the minimum memory limnit to 6M, so that a better error-message is produced if a user tries to create a container with a memory-limit that is too low: docker create --memory=4m alpine echo success docker: Error response from daemon: Minimum memory limit allowed is 6MB. Possibly, this constraint could be handled by runc, so that different runtimes could set a best-matching limit (other runtimes may require less overhead). Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-07-01 13:29:07 +02:00
Kir Kolyshkin	e3cff19dd1	Untangle CPU RT controller init Commit `56f77d5ade` added code that is doing some very ugly things. In partucular, calling cgroups.FindCgroupMountpointAndRoot() and daemon.SysInfoRaw() inside a recursively-called initCgroupsPath() not not a good thing to do. This commit tries to partially untangle this by moving some expensive checks and calls earlier, in a minimally invasive way (meaning I tried hard to not break any logic, however weird it is). This also removes double call to MkdirAll (not important, but it sticks out) and renames the function to better reflect what it's doing. Finally, this wraps some of the errors returned, and fixes the init function to not ignore the error from itself. This could be reworked more radically, but at least this this commit we are calling expensive functions once, and only if necessary. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-06-26 16:19:52 -07:00
Kir Kolyshkin	afbeaf6f29	pkg/sysinfo: rm duplicates The CPU CFS cgroup-aware scheduler is one single kernel feature, not two, so it does not make sense to have two separate booleans (CPUCfsQuota and CPUCfsPeriod). Merge these into CPUCfs. Same for CPU realtime. For compatibility reasons, /info stays the same for now. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-06-26 16:19:52 -07:00
Sebastiaan van Stijn	4534a7afc3	daemon: use containerd/sys to detect UserNamespaces The implementation in libcontainer/system is quite complicated, and we only use it to detect if user-namespaces are enabled. In addition, the implementation in containerd uses a sync.Once, so that detection (and reading/parsing `/proc/self/uid_map`) is only performed once. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-06-15 13:06:08 +02:00
Sebastiaan van Stijn	3aac5f0bbb	Merge pull request #41018 from akhilerm/identity-mapping remove group name from identity mapping	2020-06-08 15:15:05 +02:00
Akhil Mohan	7ad0da7051	remove group name from identity mapping NewIdentityMapping took group name as an argument, and used the group name also to parse the /etc/sub{uid,gui}. But as per linux man pages, the sub{uid,gid} file maps username or uid, not a group name. Therefore, all occurrences where mapping is used need to consider only username and uid. Code trying to map using gid and group name in the daemon is also removed. Signed-off-by: Akhil Mohan <akhil.mohan@mayadata.io>	2020-06-03 20:04:42 +05:30
Brian Goff	763f9e799b	Merge pull request #40846 from AkihiroSuda/cgroup2-use-systemd-by-default cgroup2: use "systemd" cgroup driver by default when available	2020-05-28 11:37:39 -07:00
Sebastiaan van Stijn	b453b64d04	Merge pull request #40845 from AkihiroSuda/allow-privileged-cgroupns-private-on-cgroup-v1 support `--privileged --cgroupns=private` on cgroup v1	2020-05-07 21:11:42 +02:00
Brian Goff	f6163d3f7a	Merge pull request #40673 from kolyshkin/scan Simplify daemon.overlaySupportsSelinux(), fix use of bufio.Scanner.Err()	2020-04-29 17:18:37 -07:00
Akihiro Suda	4714ab5d6c	cgroup2: use "systemd" cgroup driver by default when available The "systemd" cgroup driver is always preferred over "cgroupfs" on systemd-based hosts. This commit does not affect cgroup v1 hosts. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-04-22 05:13:37 +09:00
Akihiro Suda	33ee7941d4	support `--privileged --cgroupns=private` on cgroup v1 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-04-21 23:11:32 +09:00
Akihiro Suda	f350b53241	cgroup2: implement `docker info` ref: https://www.kernel.org/doc/html/latest/admin-guide/cgroup-v2.html Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-04-17 07:20:01 +09:00
Sebastiaan van Stijn	eb14d936bf	daemon: rename variables that collide with imported package names Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-04-14 17:22:23 +02:00
Sebastiaan van Stijn	5d040cbd16	daemon: fix capitalization of some functions Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-04-14 17:22:19 +02:00

1 2 3 4 5 ...

447 commits