beenull/moby

Author	SHA1	Message	Date
Akihiro Suda	b8ca7de823	Deprecate KernelMemory Kernel memory limit is not supported on cgroup v2. Even on cgroup v1, kernel memory limit (`kmem.limit_in_bytes`) has been deprecated since kernel 5.4. `0158115f70` Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-07-24 20:44:29 +09:00
Brian Goff	260c26b7be	Merge pull request #41016 from kolyshkin/cgroup-init	2020-07-16 11:26:52 -07:00
Brian Goff	61b73ee714	Merge pull request #41182 from cpuguy83/runtime_configure_shim	2020-07-14 14:16:04 -07:00
Brian Goff	f63f73a4a8	Configure shims from runtime config In dockerd we already have a concept of a "runtime", which specifies the OCI runtime to use (e.g. runc). This PR extends that config to add containerd shim configuration. This option is only exposed within the daemon itself (cannot be configured in daemon.json). This is due to issues in supporting unknown shims which will require more design work. What this change allows us to do is keep all the runtime config in one place. So the default "runc" runtime will just have it's already existing shim config codified within the runtime config alone. I've also added 2 more "stock" runtimes which are basically runc+shimv1 and runc+shimv2. These new runtime configurations are: - io.containerd.runtime.v1.linux - runc + v1 shim using the V1 shim API - io.containerd.runc.v2 - runc + shim v2 These names coincide with the actual names of the containerd shims. This allows the user to essentially control what shim is going to be used by either specifying these as a `--runtime` on container create or by setting `--default-runtime` on the daemon. For custom/user-specified runtimes, the default shim config (currently shim v1) is used. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2020-07-13 14:18:02 -07:00
Sebastiaan van Stijn	d2e23405be	Set minimum memory limit to 6M, to account for higher startup memory use For some time, we defined a minimum limit for `--memory` limits to account for overhead during startup, and to supply a reasonable functional container. Changes in the runtime (runc) introduced a higher memory footprint during container startup, which now lead to obscure error-messages that are unfriendly for users: run --rm --memory=4m alpine echo success docker: Error response from daemon: OCI runtime create failed: container_linux.go:349: starting container process caused "process_linux.go:449: container init caused \"process_linux.go:415: setting cgroup config for procHooks process caused \\\"failed to write \\\\\\\"4194304\\\\\\\" to \\\\\\\"/sys/fs/cgroup/memory/docker/1254c8d63f85442e599b17dff895f4543c897755ee3bd9b56d5d3d17724b38d7/memory.limit_in_bytes\\\\\\\": write /sys/fs/cgroup/memory/docker/1254c8d63f85442e599b17dff895f4543c897755ee3bd9b56d5d3d17724b38d7/memory.limit_in_bytes: device or resource busy\\\"\"": unknown. ERRO[0000] error waiting for container: context canceled Containers that fail to start because of this limit, will not be marked as OOMKilled, which makes it harder for users to find the cause of the failure. Note that _after_ this memory is only required during startup of the container. After the container was started, the container may not consume this memory, and limits could (manually) be lowered, for example, an alpine container running only a shell can run with 512k of memory; echo 524288 > /sys/fs/cgroup/memory/docker/acdd326419f0898be63b0463cfc81cd17fb34d2dae6f8aa3768ee6a075ca5c86/memory.limit_in_bytes However, restarting the container will reset that manual limit to the container's configuration. While `docker container update` would allow for the updated limit to be persisted, (re)starting the container after updating produces the same error message again, so we cannot use different limits for `docker run` / `docker create` and `docker update`. This patch raises the minimum memory limnit to 6M, so that a better error-message is produced if a user tries to create a container with a memory-limit that is too low: docker create --memory=4m alpine echo success docker: Error response from daemon: Minimum memory limit allowed is 6MB. Possibly, this constraint could be handled by runc, so that different runtimes could set a best-matching limit (other runtimes may require less overhead). Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-07-01 13:29:07 +02:00
Kir Kolyshkin	e3cff19dd1	Untangle CPU RT controller init Commit `56f77d5ade` added code that is doing some very ugly things. In partucular, calling cgroups.FindCgroupMountpointAndRoot() and daemon.SysInfoRaw() inside a recursively-called initCgroupsPath() not not a good thing to do. This commit tries to partially untangle this by moving some expensive checks and calls earlier, in a minimally invasive way (meaning I tried hard to not break any logic, however weird it is). This also removes double call to MkdirAll (not important, but it sticks out) and renames the function to better reflect what it's doing. Finally, this wraps some of the errors returned, and fixes the init function to not ignore the error from itself. This could be reworked more radically, but at least this this commit we are calling expensive functions once, and only if necessary. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-06-26 16:19:52 -07:00
Kir Kolyshkin	afbeaf6f29	pkg/sysinfo: rm duplicates The CPU CFS cgroup-aware scheduler is one single kernel feature, not two, so it does not make sense to have two separate booleans (CPUCfsQuota and CPUCfsPeriod). Merge these into CPUCfs. Same for CPU realtime. For compatibility reasons, /info stays the same for now. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-06-26 16:19:52 -07:00
Sebastiaan van Stijn	4534a7afc3	daemon: use containerd/sys to detect UserNamespaces The implementation in libcontainer/system is quite complicated, and we only use it to detect if user-namespaces are enabled. In addition, the implementation in containerd uses a sync.Once, so that detection (and reading/parsing `/proc/self/uid_map`) is only performed once. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-06-15 13:06:08 +02:00
Sebastiaan van Stijn	3aac5f0bbb	Merge pull request #41018 from akhilerm/identity-mapping remove group name from identity mapping	2020-06-08 15:15:05 +02:00
Akhil Mohan	7ad0da7051	remove group name from identity mapping NewIdentityMapping took group name as an argument, and used the group name also to parse the /etc/sub{uid,gui}. But as per linux man pages, the sub{uid,gid} file maps username or uid, not a group name. Therefore, all occurrences where mapping is used need to consider only username and uid. Code trying to map using gid and group name in the daemon is also removed. Signed-off-by: Akhil Mohan <akhil.mohan@mayadata.io>	2020-06-03 20:04:42 +05:30
Brian Goff	763f9e799b	Merge pull request #40846 from AkihiroSuda/cgroup2-use-systemd-by-default cgroup2: use "systemd" cgroup driver by default when available	2020-05-28 11:37:39 -07:00
Sebastiaan van Stijn	b453b64d04	Merge pull request #40845 from AkihiroSuda/allow-privileged-cgroupns-private-on-cgroup-v1 support `--privileged --cgroupns=private` on cgroup v1	2020-05-07 21:11:42 +02:00
Brian Goff	f6163d3f7a	Merge pull request #40673 from kolyshkin/scan Simplify daemon.overlaySupportsSelinux(), fix use of bufio.Scanner.Err()	2020-04-29 17:18:37 -07:00
Akihiro Suda	4714ab5d6c	cgroup2: use "systemd" cgroup driver by default when available The "systemd" cgroup driver is always preferred over "cgroupfs" on systemd-based hosts. This commit does not affect cgroup v1 hosts. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-04-22 05:13:37 +09:00
Akihiro Suda	33ee7941d4	support `--privileged --cgroupns=private` on cgroup v1 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-04-21 23:11:32 +09:00
Akihiro Suda	f350b53241	cgroup2: implement `docker info` ref: https://www.kernel.org/doc/html/latest/admin-guide/cgroup-v2.html Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-04-17 07:20:01 +09:00
Sebastiaan van Stijn	eb14d936bf	daemon: rename variables that collide with imported package names Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-04-14 17:22:23 +02:00
Sebastiaan van Stijn	5d040cbd16	daemon: fix capitalization of some functions Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2020-04-14 17:22:19 +02:00
Sebastiaan van Stijn	af0415257e	Merge pull request #40694 from kolyshkin/moby-sys-mount-part-II switch to moby/sys/{mount,mountinfo} part II	2020-04-02 21:52:21 +02:00
Akihiro Suda	3802830989	cgroup2: implement `docker stats` The following fields are unsupported: * BlkioStats: all fields other than IoServiceBytesRecursive * CPUStats: CPUUsage.PercpuUsage * MemoryStats: MaxUsage and Failcnt Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-04-02 17:51:34 +09:00
Kir Kolyshkin	5b658a0348	daemon.overlaySupportsSelinux: simplify check 1. Sscanf is very slow, and we don't use the first two fields -- get rid of it. 2. Since the field we search for is at the end of line and prepended by a space, we can just use strings.HaveSuffix. 3. Error checking for bufio.Scanner should be done after the Scan() loop, not inside it. Fixes: `885b29df09` Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-03-31 14:32:42 -07:00
Kir Kolyshkin	39048cf656	Really switch to moby/sys/mount* Switch to moby/sys/mount and mountinfo. Keep the pkg/mount for potential outside users. This commit was generated by the following bash script: ``` set -e -u -o pipefail for file in $(git grep -l 'docker/docker/pkg/mount"' \| grep -v ^pkg/mount); do sed -i -e 's#/docker/docker/pkg/mount"#/moby/sys/mount"#' \ -e 's#mount\.$GetMounts\\|Mounted\\|Info\\|[A-Za-z]*Filter$#mountinfo.\1#g' \ $file goimports -w $file done ``` Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2020-03-20 09:46:25 -07:00
Akihiro Suda	92e7f8f67c	daemon: fail early if rootless && cgroupdriver == "systemd" && cgroup v1 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-03-11 12:49:03 +09:00
Akihiro Suda	ca4b51868a	rootless: support `--exec-opt native.cgroupdriver=systemd` Support cgroup as in Rootless Podman. Requires cgroup v2 host with crun. Tested with Ubuntu 19.10 (kernel 5.3, systemd 242), crun v0.12.1. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-02-14 15:32:31 +09:00
Arko Dasgupta	f800d5f786	Set the bip network value as the subnet Dont assign the --bip value directly to the subnet for the default bridge. Instead use the network value from the ParseCIDR output Addresses: https://github.com/moby/moby/issues/40392 Signed-off-by: Arko Dasgupta <arko.dasgupta@docker.com>	2020-02-10 17:38:54 -08:00
Sebastiaan van Stijn	ca20bc4214	Merge pull request #40007 from arkodg/add-host-docker-internal Support host.docker.internal in dockerd on Linux	2020-01-27 13:42:26 +01:00
Arko Dasgupta	92e809a680	Support host.docker.internal in dockerd on Linux Docker Desktop (on MAC and Windows hosts) allows containers running inside a Linux VM to connect to the host using the host.docker.internal DNS name, which is implemented by VPNkit (DNS proxy on the host) This PR allows containers to connect to Linux hosts by appending a special string "host-gateway" to --add-host e.g. "--add-host=host.docker.internal:host-gateway" which adds host.docker.internal DNS entry in /etc/hosts and maps it to host-gateway-ip This PR also add a daemon flag call host-gateway-ip which defaults to the default bridge IP Docker Desktop will need to set this field to the Host Proxy IP so DNS requests for host.docker.internal can be routed to VPNkit Addresses: https://github.com/docker/for-linux/issues/264 Signed-off-by: Arko Dasgupta <arko.dasgupta@docker.com>	2020-01-22 13:30:00 -08:00
Sebastiaan van Stijn	be095a1859	Merge pull request #40366 from arkodg/check-cidr-ipv6 Handle the error case when fixed-cidr-ipv6 is empty and ipv6 is enabled	2020-01-14 13:53:45 +01:00
Arko Dasgupta	bdad16b0ee	Handle error case when fixed-cidr-ipv6 is empty When IPv6 is enabled, make sure fixed-cidr-ipv6 is set by the user since there is no default IPv6 local subnet in the IPAM Signed-off-by: Arko Dasgupta <arko.dasgupta@docker.com>	2020-01-13 09:56:41 -08:00
Akihiro Suda	491531c12b	cgroup2: mark cpu-rt-{period,runtime} unimplemented Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-01-01 02:58:40 +09:00
Akihiro Suda	19baeaca26	cgroup2: enable cgroup namespace by default For cgroup v1, we were unable to change the default because of compatibility issue. For cgroup v2, we should change the default right now because switching to cgroup v2 is already breaking change. See also containers/libpod#4363 containers/libpod#4374 Privileged containers also use cgroupns=private by default. https://github.com/containers/libpod/pull/4374#issuecomment-549776387 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-01-01 02:58:40 +09:00
Akihiro Suda	612343618d	cgroup2: use shim V2 * Requires containerd binaries from containerd/containerd#3799 . Metrics are unimplemented yet. * Works with crun v0.10.4, but `--security-opt seccomp=unconfined` is needed unless using master version of libseccomp ( containers/crun#156, seccomp/libseccomp#177 ) * Doesn't work with master runc yet * Resource limitations are unimplemented Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-01-01 02:58:40 +09:00
Yong Tang	f09dc2f4fc	Fix docker crash when creating namespaces with UID in /etc/subuid and /etc/subgid This fix tries to address the issue raised in 39353 where docker crash when creating namespaces with UID in /etc/subuid and /etc/subgid. The issue was that, mapping to `/etc/sub[u,g]id` in docker does not allow numeric ID. This fix fixes the issue by probing other combinations (uid:groupname, username:gid, uid:gid) when normal username:groupname fails. This fix fixes 39353. Signed-off-by: Yong Tang <yong.tang.github@outlook.com>	2019-11-07 20:17:11 +00:00
Sebastiaan van Stijn	9a7e96b5b7	Rename "v1" to "statsV1" follow-up to `27552ceb15`, where this was left as a review comment, but the PR was already merged. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2019-11-01 16:18:06 +01:00
Sebastiaan van Stijn	27552ceb15	bump containerd/cgroups 5fbad35c2a7e855762d3c60f2e474ffcad0d470a full diff: `c4b9ac5c76...5fbad35c2a` - containerd/cgroups#82 Add go module support - containerd/cgroups#96 Move metrics proto package to stats/v1 - containerd/cgroups#97 Allow overriding the default /proc folder in blkioController - containerd/cgroups#98 Allows ignoring memory modules - containerd/cgroups#99 Add Go 1.13 to Travis - containerd/cgroups#100 stats/v1: export per-cgroup stats Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2019-10-31 01:09:12 +01:00
Sebastiaan van Stijn	05469b5fa2	daemon: add "isWindows" const Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2019-10-17 23:49:43 +02:00
Sebastiaan van Stijn	422067ba7b	Return "invalid parameter" when linking to non-existing container Trying to link to a non-existing container is not valid, and should return an "invalid parameter" (400) error. Returning a "not found" error in this situation would make the client report the container's image could not be found. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2019-09-10 23:06:56 +02:00
Rob Gulewich	530f2d65c3	Explicity set Cgroup NS mode to "host" when running privileged Signed-off-by: Rob Gulewich <rgulewich@netflix.com>	2019-08-23 11:27:27 -07:00
Sebastiaan van Stijn	1ea8b413d1	initBridgeDriver: minor cleanup and linting fixes Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2019-08-09 18:34:35 +02:00
Dominic	5f0231bca1	cast Dev and Rdev of Stat_t to uint64 for mips Signed-off-by: Dominic <yindongchao@inspur.com> Signed-off-by: Dominic Yin <yindongchao@inspur.com>	2019-08-01 20:22:49 +08:00
Michael Crosby	a4a1e57e9d	Merge pull request #39496 from cpuguy83/fix_missing_dir_cleanup_file Ensure parent dir exists for mount cleanup file	2019-07-12 13:39:58 -04:00
Brian Goff	24ad2f486d	Add (hidden) flags to set containerd namespaces This allows our tests, which all share a containerd instance, to be a bit more isolated by setting the containerd namespaces to the generated daemon ID's rather than the default namespaces. This came about because I found in some cases we had test daemons failing to start (really very slow to start) because it was (seemingly) processing events from other tests. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2019-07-11 17:27:48 -07:00
Brian Goff	7725b88edc	Ensure parent dir exists for mount cleanup file While investigating a test failure, I found this in the logs: ``` time="2019-07-04T15:06:32.622506760Z" level=warning msg="Error while setting daemon root propagation, this is not generally critical but may cause some functionality to not work or fallback to less desirable behavior" dir=/go/src/github.com/docker/docker/bundles/test-integration/d1285b8250308/root error="error writing file to signal mount cleanup on shutdown: open /tmp/dxr/d1285b8250308/unmount-on-shutdown: no such file or directory" ``` This path is generated from the daemon's exec-root, which appears to not exist yet. This change just makes sure it exists before we try to write a file. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2019-07-11 13:30:36 -07:00
Akihiro Suda	153466ba0a	info: report cgroup driver as "none" when running rootless Previously `docker info` had reported "cgroupfs" as the cgroup driver but the driver wasn't actually used at all. This PR reports "none" as the cgroup driver so as to avoid confusion. e.g. kubeadm/kubelet will detect cgroupless-ness by checking this docker info field. https://github.com/rootless-containers/usernetes/pull/97 Note that user still cannot specify `native.cgroupdriver=none` manually. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2019-06-03 00:11:21 +09:00
frankyang	b9f31912de	bugfix: fetch the right device number which great than 255 Signed-off-by: frankyang <yyb196@gmail.com>	2019-05-16 15:32:59 +08:00
Rob Gulewich	072400fc4b	Make cgroup namespaces configurable This adds both a daemon-wide flag and a container creation property: - Set the `CgroupnsMode: "host\|private"` HostConfig property at container creation time to control what cgroup namespace the container is created in - Set the `--default-cgroupns-mode=host\|private` daemon flag to control what cgroup namespace containers are created in by default - Set the default if the daemon flag is unset to "host", for backward compatibility - Default to CgroupnsMode: "host" for client versions < 1.40 Signed-off-by: Rob Gulewich <rgulewich@netflix.com>	2019-05-07 10:22:16 -07:00
Sebastiaan van Stijn	ffa1728d4b	Normalize values for pids-limit - Don't set `PidsLimit` when creating a container and no limit was set (or the limit was set to "unlimited") - Don't set `PidsLimit` if the host does not have pids-limit support (previously "unlimited" was set). - Do not generate a warning if the host does not have pids-limit support, but pids-limit was set to unlimited (having no limit set, or the limit set to "unlimited" is equivalent, so no warning is nescessary in that case). - When updating a container, convert `0`, and `-1` to "unlimited" (`0`). Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2019-03-13 00:27:05 +01:00
Sebastiaan van Stijn	dd94555787	Merge pull request #32519 from darkowlzz/32443-docker-update-pids-limit Add pids-limit support in docker update	2019-02-23 15:20:59 +01:00
Sunny Gogoi	74eb258ffb	Add pids-limit support in docker update - Adds updating PidsLimit in UpdateContainer(). - Adds setting PidsLimit in toContainerResources(). Signed-off-by: Sunny Gogoi <indiasuny000@gmail.com> Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2019-02-21 14:17:38 -08:00
Akihiro Suda	ec87479b7e	allow running `dockerd` in an unprivileged user namespace (rootless mode) Please refer to `docs/rootless.md`. TLDR: * Make sure `/etc/subuid` and `/etc/subgid` contain the entry for you * `dockerd-rootless.sh --experimental` * `docker -H unix://$XDG_RUNTIME_DIR/docker.sock run ...` Signed-off-by: Akihiro Suda <suda.akihiro@lab.ntt.co.jp>	2019-02-04 00:24:27 +09:00
Akihiro Suda	2cb26cfe9c	Merge pull request #38301 from cyphar/waitgroup-limits daemon: switch to semaphore-gated WaitGroup for startup tasks	2018-12-22 00:07:55 +09:00
Aleksa Sarai	5a52917e4d	daemon: switch to semaphore-gated WaitGroup for startup tasks Many startup tasks have to run for each container, and thus using a WaitGroup (which doesn't have a limit to the number of parallel tasks) can result in Docker exceeding the NOFILE limit quite trivially. A more optimal solution is to have a parallelism limit by using a semaphore. In addition, several startup tasks were not parallelised previously which resulted in very long startup times. According to my testing, 20K dead containers resulted in ~6 minute startup times (during which time Docker is completely unusable). This patch fixes both issues, and the parallelStartupTimes factor chosen (128 * NumCPU) is based on my own significant testing of the 20K container case. This patch (on my machines) reduces the startup time from 6 minutes to less than a minute (ideally this could be further reduced by removing the need to scan all dead containers on startup -- but that's beyond the scope of this patchset). In order to avoid the NOFILE limit problem, we also detect this on-startup and if NOFILE < 2128NumCPU we will reduce the parallelism factor to avoid hitting NOFILE limits (but also emit a warning since this is almost certainly a mis-configuration). Signed-off-by: Aleksa Sarai <asarai@suse.de>	2018-12-21 21:51:02 +11:00
Sebastiaan van Stijn	f6002117a4	Extract container-config and container-hostconfig validation Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-12-19 13:09:12 +01:00
Sebastiaan van Stijn	b6e373c525	Rename verifyContainerResources to verifyPlatformContainerResources This validation function is platform-specific; rename it to be more explicit. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-12-19 10:24:09 +01:00
Sebastiaan van Stijn	e278678705	Remove unused argument from verifyPlatformContainerSettings Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-12-19 09:23:09 +01:00
Sebastiaan van Stijn	10c97b9357	Unify logging container validation warnings Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-12-19 09:15:21 +01:00
Sebastiaan van Stijn	2e23ef5350	Move port-publishing check to linux platform-check Windows does not have host-mode networking, so on Windows, this check was a no-op Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-12-18 22:46:05 +01:00
Sebastiaan van Stijn	57f1305e74	Move "OOM Kill disable" warning to the daemon Disabling the oom-killer for a container without setting a memory limit is dangerous, as it can result in the container consuming unlimited memory, without the kernel being able to kill it. A check for this situation is curently done in the CLI, but other consumers of the API won't receive this warning. This patch adds a check for this situation to the daemon, so that all consumers of the API will receive this warning. This patch will have one side-effect; docker cli's that also perform this check client-side will print the warning twice; this can be addressed by disabling the cli-side check for newer API versions, but will generate a bit of extra noise when using an older CLI. With this patch applied (and a cli that does not take the new warning into account); ``` docker create --oom-kill-disable busybox WARNING: OOM killer is disabled for the container, but no memory limit is set, this can result in the system running out of resources. 669933b9b237fa27da699483b5cf15355a9027050825146587a0e5be0d848adf docker run --rm --oom-kill-disable busybox WARNING: Disabling the OOM killer on containers without setting a '-m/--memory' limit may be dangerous. WARNING: OOM killer is disabled for the container, but no memory limit is set, this can result in the system running out of resources. ``` Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-12-18 22:30:56 +01:00
Andrew Hsu	78045a5419	use empty string as cgroup path to grab first find Signed-off-by: Andrew Hsu <andrewhsu@docker.com> Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-12-07 18:44:00 +01:00
Yong Tang	f023816608	Add memory.kernelTCP support for linux This fix tries to address the issue raised in 37038 where there were no memory.kernelTCP support for linux. This fix add MemoryKernelTCP to HostConfig, and pass the config to runtime-spec. Additional test case has been added. This fix fixes 37038. Signed-off-by: Yong Tang <yong.tang.github@outlook.com>	2018-11-26 21:03:08 +00:00
Justin Cormack	f8e876d761	Fix denial of service with large numbers in cpuset-cpus and cpuset-mems Using a value such as `--cpuset-mems=1-9223372036854775807` would cause `dockerd` to run out of memory allocating a map of the values in the validation code. Set limits to the normal limit of the number of CPUs, and improve the error handling. Reported by Huawei PSIRT. Signed-off-by: Justin Cormack <justin.cormack@docker.com> Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-10-05 15:09:02 +02:00
Tibor Vass	34eede0296	Remove 'docker-' prefix for containerd and runc binaries This allows to run the daemon in environments that have upstream containerd installed. Signed-off-by: Tibor Vass <tibor@docker.com>	2018-09-24 21:49:03 +00:00
Salahuddin Khan	763d839261	Add ADD/COPY --chown flag support to Windows This implements chown support on Windows. Built-in accounts as well as accounts included in the SAM database of the container are supported. NOTE: IDPair is now named Identity and IDMappings is now named IdentityMapping. The following are valid examples: ADD --chown=Guest . <some directory> COPY --chown=Administrator . <some directory> COPY --chown=Guests . <some directory> COPY --chown=ContainerUser . <some directory> On Windows an owner is only granted the permission to read the security descriptor and read/write the discretionary access control list. This fix also grants read/write and execute permissions to the owner. Signed-off-by: Salahuddin Khan <salah@docker.com>	2018-08-13 21:59:11 -07:00
Sebastiaan van Stijn	3737194b9f	daemon/*.go: fix some Wrap[f]/Warn[f] errors In particular, these two: > daemon/daemon_unix.go:1129: Wrapf format %v reads arg #1, but call has 0 args > daemon/kill.go:111: Warn call has possible formatting directive %s and a few more. Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com> Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-07-11 15:51:51 +02:00
Sebastiaan van Stijn	f23c00d870	Various code-cleanup remove unnescessary import aliases, brackets, and so on. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-05-23 17:50:54 +02:00
Alessandro Boch	173b3c364e	Allow user to control the default address pools - Via daemon flag --default-address-pools base=<CIDR>,size=<int> Signed-off-by: Elango Siva <elango@docker.com>	2018-04-30 11:14:08 -04:00
Sebastiaan van Stijn	cf9c48bb3e	Merge pull request #36879 from cpuguy83/extra_unmount_check Extra check before unmounting on shutdown	2018-04-20 17:08:11 -07:00
Brian Goff	6a70fd222b	Move mount parsing to separate package. This moves the platform specific stuff in a separate package and keeps the `volume` package and the defined interfaces light to import. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2018-04-19 06:35:54 -04:00
Brian Goff	c403f0036b	Extra check before unmounting on shutdown This makes sure that if the daemon root was already a self-binded mount (thus meaning the daemonc only performed a remount) that the daemon does not try to unmount. Example: ``` $ sudo mount --bind /var/lib/docker /var/lib/docker $ sudo dockerd & ``` Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2018-04-18 20:43:42 -04:00
Daniel Nephin	4ceea53b5e	Remove duplicate rootFSToAPIType Signed-off-by: Daniel Nephin <dnephin@docker.com>	2018-02-14 11:59:18 -05:00
Daniel Nephin	c502bcff33	Remove unnecessary getLayerInit Signed-off-by: Daniel Nephin <dnephin@docker.com>	2018-02-14 11:59:10 -05:00
Tianon Gravi	3a633a712c	Merge pull request #36194 from dnephin/add-canonical-import Add canonical import path	2018-02-07 13:06:45 -08:00
Daniel Nephin	4f0d95fa6e	Add canonical import comment Signed-off-by: Daniel Nephin <dnephin@docker.com>	2018-02-05 16:51:57 -05:00
Dennis Chen	44b074d199	Daemon: passdown the `--oom-kill-disable` option to containerd Current implementaion of docke daemon doesn't pass down the `--oom-kill-disable` option specified by the end user to the containerd when spawning a new docker instance with help from `runc` component, which results in the `--oom-kill-disable` doesn't work no matter the flag is `true` or `false`. This PR will fix this issue reported by #36090 Signed-off-by: Dennis Chen <dennis.chen@arm.com> Signed-off-by: Jianyong Wu <jianyong.wu@arm.com>	2018-02-05 03:25:59 +00:00
Brian Goff	a510192b86	Set daemon root to use shared propagation This change sets an explicit mount propagation for the daemon root. This is useful for people who need to bind mount the docker daemon root into a container. Since bind mounting the daemon root should only ever happen with at least `rlsave` propagation (to prevent the container from holding references to mounts making it impossible for the daemon to clean up its resources), we should make sure the user is actually able to this. Most modern systems have shared root (`/`) propagation by default already, however there are some cases where this may not be so (e.g. potentially docker-in-docker scenarios, but also other cases). So this just gives the daemon a little more control here and provides a more uniform experience across different systems. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2018-01-23 14:17:08 -08:00
Sebastiaan van Stijn	6121a8429b	Move reload-related functions to reload.go Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-01-21 00:55:49 +01:00
John Howard	ce8e529e18	LCOW: Re-coalesce stores Signed-off-by: John Howard <jhoward@microsoft.com> The re-coalesces the daemon stores which were split as part of the original LCOW implementation. This is part of the work discussed in https://github.com/moby/moby/issues/34617, in particular see the document linked to in that issue.	2018-01-18 08:29:19 -08:00
Sebastiaan van Stijn	b4a6313969	Golint: remove redundant ifs Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-01-15 00:42:25 +01:00
Sebastiaan van Stijn	16fe5a1289	Remove unused experimental code Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2017-12-18 17:07:48 +01:00
Kir Kolyshkin	516010e92d	Simplify/fix MkdirAll usage This subtle bug keeps lurking in because error checking for `Mkdir()` and `MkdirAll()` is slightly different wrt to `EEXIST`/`IsExist`: - for `Mkdir()`, `IsExist` error should (usually) be ignored (unless you want to make sure directory was not there before) as it means "the destination directory was already there" - for `MkdirAll()`, `IsExist` error should NEVER be ignored. Mostly, this commit just removes ignoring the IsExist error, as it should not be ignored. Also, there are a couple of cases then IsExist is handled as "directory already exist" which is wrong. As a result, some code that never worked as intended is now removed. NOTE that `idtools.MkdirAndChown()` behaves like `os.MkdirAll()` rather than `os.Mkdir()` -- so its description is amended accordingly, and its usage is handled as such (i.e. IsExist error is not ignored). For more details, a quote from my runc commit 6f82d4b (July 2015): TL;DR: check for IsExist(err) after a failed MkdirAll() is both redundant and wrong -- so two reasons to remove it. Quoting MkdirAll documentation: > MkdirAll creates a directory named path, along with any necessary > parents, and returns nil, or else returns an error. If path > is already a directory, MkdirAll does nothing and returns nil. This means two things: 1. If a directory to be created already exists, no error is returned. 2. If the error returned is IsExist (EEXIST), it means there exists a non-directory with the same name as MkdirAll need to use for directory. Example: we want to MkdirAll("a/b"), but file "a" (or "a/b") already exists, so MkdirAll fails. The above is a theory, based on quoted documentation and my UNIX knowledge. 3. In practice, though, current MkdirAll implementation [1] returns ENOTDIR in most of cases described in #2, with the exception when there is a race between MkdirAll and someone else creating the last component of MkdirAll argument as a file. In this very case MkdirAll() will indeed return EEXIST. Because of #1, IsExist check after MkdirAll is not needed. Because of #2 and #3, ignoring IsExist error is just plain wrong, as directory we require is not created. It's cleaner to report the error now. Note this error is all over the tree, I guess due to copy-paste, or trying to follow the same usage pattern as for Mkdir(), or some not quite correct examples on the Internet. [1] https://github.com/golang/go/blob/f9ed2f75/src/os/path.go Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2017-11-27 17:32:12 -08:00
Kenfe-Mickael Laventure	ddae20c032	Update libcontainerd to use containerd 1.0 Signed-off-by: Kenfe-Mickael Laventure <mickael.laventure@gmail.com>	2017-10-20 07:11:37 -07:00
Brian Goff	c6a2044497	Don't abort when setting may_detach_mounts `83c2152de5` sets the kernel param for fs.may_detach_mounts, but this is not neccessary for the daemon to operate. Instead of erroring out (and thus aborting startup) just log the error. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2017-10-11 14:54:24 -04:00
Brian Goff	83c2152de5	Automatically set `may_detach_mounts=1` on startup This is kernel config available in RHEL7.4 based kernels that enables mountpoint removal where the mountpoint exists in other namespaces. In particular this is important for making this pattern work: ``` umount -l /some/path rm -r /some/path ``` Where `/some/path` exists in another mount namespace. Setting this value will prevent `device or resource busy` errors when attempting to the removal of `/some/path` in the example. This setting is the default, and non-configurable, on upstream kernels since 3.15. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2017-09-20 09:57:25 -04:00
Victor Vieux	a5f9783c93	Merge pull request #34252 from Microsoft/akagup/lcow-remotefs-sandbox LCOW: Support for docker cp, ADD/COPY on build	2017-09-15 16:49:48 -07:00
Simon Ferquel	e89b6e8c2d	Volume refactoring for LCOW Signed-off-by: Simon Ferquel <simon.ferquel@docker.com>	2017-09-14 12:33:31 -07:00
Akash Gupta	7a7357dae1	LCOW: Implemented support for docker cp + build This enables docker cp and ADD/COPY docker build support for LCOW. Originally, the graphdriver.Get() interface returned a local path to the container root filesystem. This does not work for LCOW, so the Get() method now returns an interface that LCOW implements to support copying to and from the container. Signed-off-by: Akash Gupta <akagup@microsoft.com>	2017-09-14 12:07:52 -07:00
Yong Tang	70214f95b2	Merge pull request #34352 from ChenMin46/fix_rename_shared_namespace Use ID rather than Name to identify a container when sharing namespace	2017-08-25 09:39:58 -07:00
Chen Min	b6e5ea8e57	Use ID rather than Name to identify a container when sharing namespace Fix: https://github.com/moby/moby/issues/34307 Signed-off-by: Chen Min <chenmin46@huawei.com>	2017-08-25 01:55:50 +08:00
Kenfe-Mickael Laventure	45d85c9913	Update containerd to 06b9cb35161009dcb7123345749fef02f7cea8e0 This also update: - runc to 3f2f8b84a77f73d38244dd690525642a72156c64 - runtime-specs to v1.0.0 Signed-off-by: Kenfe-Mickael Laventure <mickael.laventure@gmail.com>	2017-08-21 12:04:07 -07:00
Daniel Nephin	9b47b7b151	Fix golint errors. Signed-off-by: Daniel Nephin <dnephin@docker.com>	2017-08-18 14:23:44 -04:00
Brian Goff	ebcb7d6b40	Remove string checking in API error handling Use strongly typed errors to set HTTP status codes. Error interfaces are defined in the api/errors package and errors returned from controllers are checked against these interfaces. Errors can be wraeped in a pkg/errors.Causer, as long as somewhere in the line of causes one of the interfaces is implemented. The special error interfaces take precedence over Causer, meaning if both Causer and one of the new error interfaces are implemented, the Causer is not traversed. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2017-08-15 16:01:11 -04:00
Kir Kolyshkin	7120976d74	Implement none, private, and shareable ipc modes Since the commit `d88fe447df` ("Add support for sharing /dev/shm/ and /dev/mqueue between containers") container's /dev/shm is mounted on the host first, then bind-mounted inside the container. This is done that way in order to be able to share this container's IPC namespace (and the /dev/shm mount point) with another container. Unfortunately, this functionality breaks container checkpoint/restore (even if IPC is not shared). Since /dev/shm is an external mount, its contents is not saved by `criu checkpoint`, and so upon restore any application that tries to access data under /dev/shm is severily disappointed (which usually results in a fatal crash). This commit solves the issue by introducing new IPC modes for containers (in addition to 'host' and 'container:ID'). The new modes are: - 'shareable': enables sharing this container's IPC with others (this used to be the implicit default); - 'private': disables sharing this container's IPC. In 'private' mode, container's /dev/shm is truly mounted inside the container, without any bind-mounting from the host, which solves the issue. While at it, let's also implement 'none' mode. The motivation, as eloquently put by Justin Cormack, is: > I wondered a while back about having a none shm mode, as currently it is > not possible to have a totally unwriteable container as there is always > a /dev/shm writeable mount. It is a bit of a niche case (and clearly > should never be allowed to be daemon default) but it would be trivial to > add now so maybe we should... ...so here's yet yet another mode: - 'none': no /dev/shm mount inside the container (though it still has its own private IPC namespace). Now, to ultimately solve the abovementioned checkpoint/restore issue, we'd need to make 'private' the default mode, but unfortunately it breaks the backward compatibility. So, let's make the default container IPC mode per-daemon configurable (with the built-in default set to 'shareable' for now). The default can be changed either via a daemon CLI option (--default-shm-mode) or a daemon.json configuration file parameter of the same name. Note one can only set either 'shareable' or 'private' IPC modes as a daemon default (i.e. in this context 'host', 'container', or 'none' do not make much sense). Some other changes this patch introduces are: 1. A mount for /dev/shm is added to default OCI Linux spec. 2. IpcMode.Valid() is simplified to remove duplicated code that parsed 'container:ID' form. Note the old version used to check that ID does not contain a semicolon -- this is no longer the case (tests are modified accordingly). The motivation is we should either do a proper check for container ID validity, or don't check it at all (since it is checked in other places anyway). I chose the latter. 3. IpcMode.Container() is modified to not return container ID if the mode value does not start with "container:", unifying the check to be the same as in IpcMode.IsContainer(). 3. IPC mode unit tests (runconfig/hostconfig_test.go) are modified to add checks for newly added values. [v2: addressed review at https://github.com/moby/moby/pull/34087#pullrequestreview-51345997] [v3: addressed review at https://github.com/moby/moby/pull/34087#pullrequestreview-53902833] [v4: addressed the case of upgrading from older daemon, in this case container.HostConfig.IpcMode is unset and this is valid] [v5: document old and new IpcMode values in api/swagger.yaml] [v6: add the 'none' mode, changelog entry to docs/api/version-history.md] Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2017-08-14 10:50:39 +03:00
Yong Tang	7ccd8bda77	Merge pull request #33722 from TomSweeneyRedHat/tsweeney/privmessage Add clarification to --privileged error message	2017-08-09 16:08:10 -07:00
Derek McGowan	1009e6a40b	Update logrus to v1.0.1 Fixes case sensitivity issue Signed-off-by: Derek McGowan <derek@mcgstyle.net>	2017-07-31 13:16:46 -07:00
Tobias Klauser	01f70b028e	Switch Stat syscalls to x/sys/unix Switch some more usage of the Stat function and the Stat_t type from the syscall package to golang.org/x/sys. Those were missing in PR #33399. Signed-off-by: Tobias Klauser <tklauser@distanz.ch>	2017-07-27 10:09:02 +02:00
Yuanhong Peng	4a6cbf9bcb	Return an empty stats if "container not found" If we get "container not found" error from containerd, it's possibly because that this container has already been stopped. It will be ok to ignore this error and just return an empty stats. Signed-off-by: Yuanhong Peng <pengyuanhong@huawei.com>	2017-07-10 16:30:48 +08:00
Michael Crosby	9d87e6e0fb	Do not set -1 for swappiness Do not set a default value for swappiness as the default value should be `nil` Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2017-07-03 11:23:15 -07:00
TomSweeneyRedHat	38e26f0d8d	Add clarification to --privileged error message Signed-off-by: TomSweeneyRedHat <tsweeney@redhat.com>	2017-06-25 14:02:20 -04:00
Fabio Kung	a43be3431e	avoid re-reading json files when copying containers Signed-off-by: Fabio Kung <fabio.kung@gmail.com>	2017-06-23 07:52:34 -07:00
John Howard	3aa4a00715	LCOW: Move daemon stores to per platform Signed-off-by: John Howard <jhoward@microsoft.com>	2017-06-20 19:49:52 -07:00
Daniel Nephin	09cd96c5ad	Partial refactor of UID/GID usage to use a unified struct. Signed-off-by: Daniel Nephin <dnephin@docker.com>	2017-06-07 11:44:33 -04:00
Michael Crosby	005506d36c	Update moby to runc and oci 1.0 runtime final rc Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2017-05-05 13:45:45 -07:00
Antonio Murdaca	abbbf91498	Switch to using opencontainers/selinux for selinux bindings Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2017-04-24 21:29:47 +02:00
Vincent Demeester	b0235fd515	Merge pull request #31827 from erikstmartin/master Add mount point to cgroup root when initializing cgroup paths for cpu.rt_runtime	2017-04-03 22:39:49 +02:00
Daniel J Walsh	881e20ee0b	If caller specifies label overrides, don't override security options If a caller specifies an SELinux type or MCS Label and still wants to share an IPC Namespace or the host namespace, we should allow them. Currently we are ignoring the label specification if ipcmod=container or pidmode=host. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2017-03-16 20:33:37 -04:00
Erik St. Martin	40e075532a	Add mount point to cgroup root when initializing cgroup paths for cpu.rt_runtime PR https://github.com/docker/docker/pull/23430 introduced a couple more flags including `--cpu-rt-runtime` to the docker daemon. It appears recent changes or merge issues may have broken this. It currently does not take the cgroup mount point into account when determining the cgroup files to write values to. This breaks docker setting its own `cpu.rt_runtime` for the daemon. This also means containers aren't able to set theirs. Also, the cgroups.FindCgroupMountpointAndRoot returns back a mount point that includes the cgroup of the currently running container when docker is run inside a docker container. this breaks the `--cpu-rt-runtime` flag when running docker in docker. A fix has been placed here, but potentially could be pulled up into libcontainer if this is a better place for it. Signed-off-by: Erik St. Martin <alakriti@gmail.com>	2017-03-14 16:18:36 -04:00
Vincent Demeester	f7819fcb25	Refactor cpu-realtime file creation to remove duplication Signed-off-by: Vincent Demeester <vincent@sbr.pm>	2017-02-28 11:12:06 +01:00
Victor Vieux	cf5fe9ed0b	Merge pull request #31075 from vieux/switch_to_new_versioning_scheme Switch to new versioning scheme	2017-02-17 15:43:28 -08:00
Daniel Zhang	d7fda019bb	Add daemon flag to set no_new_priv as default for unprivileged containers. Signed-off-by: Daniel Zhang <jmzwcn@gmail.com>	2017-02-16 19:05:14 +08:00
Victor Vieux	47396d6377	Switch to new versioning scheme Signed-off-by: Victor Vieux <victorvieux@gmail.com>	2017-02-15 23:22:50 -08:00
allencloud	75f5d63ec8	rewrite reload code Signed-off-by: allencloud <allen.sun@daocloud.io>	2017-02-15 21:57:27 +08:00
Vincent Demeester	db63f9370e	Extract daemon configuration and discovery to their own package This also moves some cli specific in `cmd/dockerd` as it does not really belong to the `daemon/config` package. Signed-off-by: Vincent Demeester <vincent@sbr.pm>	2017-02-08 09:53:38 +01:00
Vincent Demeester	354bd4aadd	Merge pull request #29692 from yongtang/29492-daemon-shm-size Add daemon option `--default-shm-size`	2017-02-01 16:56:10 +01:00
Alexander Morozov	ebaf1ced72	Merge pull request #27071 from rhvgoyal/docker-overlay-error Warn if kernel does not support overlay with selinux	2017-01-31 13:41:02 -08:00
Yong Tang	db575ef626	Add daemon option --default-shm-size This fix fixes issue raised in 29492 where it was not possible to specify a default `--default-shm-size` in daemon configuration for each `docker run``. The flag `--default-shm-size` which is reloadable, has been added to the daemon configuation. Related docs has been updated. This fix fixes 29492. Signed-off-by: Yong Tang <yong.tang.github@outlook.com>	2017-01-27 12:17:06 -08:00
Alexander Morozov	cd6a61f1b1	Merge pull request #29887 from vdemeester/daemon-extract-stats-collector Extract daemon statsCollector to its own package	2017-01-27 10:28:03 -08:00
Akihiro Suda	4a8799dc0a	validate mount path for tmpfs There was no validation for `docker run --tmpfs foo`. In this PR, only two obvious rules are implemented: - path must be absolute - path must not be "/" We should add more rules carefully. Signed-off-by: Akihiro Suda <suda.akihiro@lab.ntt.co.jp>	2017-01-20 06:01:48 +00:00
Daehyeok Mun	6306019d0b	Refactoring ineffectual assignments This patch fixed below 4 types of code line 1. Remove unnecessary variable assignment 2. Use variables declaration instead of explicit initial zero value 3. Change variable name to underbar when variable not used 4. Add erro check and return for ignored error Signed-off-by: Daehyeok Mun <daehyeok@gmail.com>	2017-01-13 00:27:01 -07:00
Sebastiaan van Stijn	f285d5b3e8	do not create init-dir if not needed commit `56f77d5ade` added support for cpu-rt-period and cpu-rt-runtime, but always initialized the cgroup path, even if not used. As a result, containers failed to start on a read-only filesystem. This patch only creates the cgroup path if one of these options is set. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2017-01-09 15:28:51 +01:00
Vincent Demeester	835971c6fd	Extract daemon statsCollector to its own package Signed-off-by: Vincent Demeester <vincent@sbr.pm>	2017-01-04 18:18:30 +01:00
Sebastiaan van Stijn	1dd9410776	Merge pull request #29683 from vdemeester/runconfig-clean Clean some stuff from runconfig that are cli only…	2016-12-29 17:42:08 +01:00
allencloud	6fb05778ba	fix nits in comments Signed-off-by: allencloud <allen.sun@daocloud.io>	2016-12-27 23:30:50 +08:00
Vincent Demeester	c424be21b7	Clean some stuff from runconfig that are cli only… … or could be in `opts` package. Having `runconfig/opts` and `opts` doesn't really make sense and make it difficult to know where to put some code. Signed-off-by: Vincent Demeester <vincent@sbr.pm>	2016-12-24 13:16:00 +01:00
Tonis Tiigi	3d86b0c79b	Implement content addressability for plugins Move plugins to shared distribution stack with images. Create immutable plugin config that matches schema2 requirements. Ensure data being pushed is same as pulled/created. Store distribution artifacts in a blobstore. Run init layer setup for every plugin start. Fix breakouts from unsafe file accesses. Add support for `docker plugin install --alias` Uses normalized references for default names to avoid collisions when using default hosts/tags. Some refactoring of the plugin manager to support the change, like removing the singleton manager and adding manager config struct. Signed-off-by: Tonis Tiigi <tonistiigi@gmail.com> Signed-off-by: Derek McGowan <derek@mcgstyle.net>	2016-12-23 13:29:58 -08:00
Doug Davis	ff42a2eb41	Only show global warnings once Upon each container create I'm seeing these warning every time in the daemon output: ``` WARN[0002] Your kernel does not support swap memory limit WARN[0002] Your kernel does not support cgroup rt period WARN[0002] Your kernel does not support cgroup rt runtime ``` Showing them for each container.create() fills up the logs and encourages people to ignore the output being generated - which means its less likely they'll see real issues when they happen. In short, I don't think we need to show these warnings more than once, so let's only show these warnings at daemon start-up time. Signed-off-by: Doug Davis <dug@us.ibm.com>	2016-11-30 10:11:42 -08:00
Antonio Murdaca	30b8712e93	daemon: remove not needed unmarshal Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2016-11-23 12:53:25 +01:00
Brian Goff	06e92cc2c1	Merge pull request #28577 from chchliang/defercode function setupOOMScoreAdj file close illogicality	2016-11-18 11:48:58 -05:00
chchliang	e82fa89bcd	function setupOOMScoreAdj file close illogicality Signed-off-by: chchliang <chen.chuanliang@zte.com.cn>	2016-11-18 17:34:01 +08:00
Michael Crosby	aff99136b4	Fix cfs period calculation Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2016-11-18 00:47:43 -08:00
Yong Tang	d22ac2f3a0	Fix error messages for `--cpus` from daemon This fix fixes error messages for `--cpus` from daemon. When `docker run` takes `--cpus`, it will translate into NanoCPUs and pass the value to daemon. The `NanoCPU` is not visible to the user. The error message generated from daemon used 'NanoCPU' which may cause some confusion to the user. This fix fixes this issue by returning the error in CPUs instead. This fix fixes 28456. Signed-off-by: Yong Tang <yong.tang.github@outlook.com>	2016-11-15 17:16:09 -08:00
Yong Tang	846baf1fd3	Add `--cpus` flag to control cpu resources This fix tries to address the proposal raised in 27921 and add `--cpus` flag for `docker run/create`. Basically, `--cpus` will allow user to specify a number (possibly partial) about how many CPUs the container will use. For example, on a 2-CPU system `--cpus 1.5` means the container will take 75% (1.5/2) of the CPU share. This fix adds a `NanoCPUs` field to `HostConfig` since swarmkit alreay have a concept of NanoCPUs for tasks. The `--cpus` flag will translate the number into reused `NanoCPUs` to be consistent. This fix adds integration tests to cover the changes. Related docs (`docker run` and Remote APIs) have been updated. This fix fixes 27921. Signed-off-by: Yong Tang <yong.tang.github@outlook.com>	2016-11-04 09:43:10 -07:00
Justin Cormack	efa5e85cf7	Merge pull request #26276 from runcom/seccomp-conf daemon: add a flag to override the default seccomp profile	2016-11-04 15:45:30 +00:00
Mrunal Patel	4c10c2ded3	Ensure that SELinux Options are set when seccomp is already set Signed-off-by: Mrunal Patel <mrunalp@gmail.com>	2016-11-03 13:23:53 -07:00
Justin Cormack	81683e898a	Merge pull request #27599 from estesp/getent-path Add support for looking up user/groups via `getent`	2016-11-03 15:11:42 +00:00
Antonio Murdaca	b237189e6c	daemon: add a flag to override the default seccomp profile Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2016-11-02 21:41:29 +01:00
Michael Crosby	74bdacb908	Merge pull request #27953 from thaJeztah/fix-deprecation-version Update deprecation versions for "email" and colon in "security options"	2016-11-01 14:31:32 -07:00
Alexander Morozov	bf16fa47b7	Merge pull request #27929 from daehyeok/logrus_refactoring Fix logrus formatting	2016-11-01 11:12:46 -07:00
Sebastiaan van Stijn	e41a39dbae	Update deprecation versions for "email" and colon in "security options" These features were originally scheduled for removal in docker 1.13, but we changed our deprecation policy to keep features for three releases instead of two. This updates the deprecation version to match the deprecation policy. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2016-11-01 09:12:27 -07:00
Daehyeok Mun	fa710e504b	Fix logrus formatting This fix tries to fix logrus formatting by removing `f` from `logrus.[Error\|Warn\|Debug\|Fatal\|Panic\|Info]f` when formatting string is not present. Fixed issue #23459 Signed-off-by: Daehyeok Mun <daehyeok@gmail.com>	2016-10-31 22:05:01 -06:00
Yanqiang Miao	a9b6319e67	Optimized the indentation of codes and fix two typos Signed-off-by: Yanqiang Miao <miao.yanqiang@zte.com.cn>	2016-11-01 10:35:18 +08:00
Phil Estes	6cb8392be9	Add support for looking up user/groups via `getent` When processing the --userns-remap flag, add the capability to call out to `getent` if the user and group information is not found via local file parsing code already in libcontainer/user. Signed-off-by: Phil Estes <estesp@linux.vnet.ibm.com>	2016-10-28 19:06:07 -04:00
Yong Tang	e9c4c513d1	Fix issue for `--fixed-cidr` when bridge has multiple addresses This fix tries to address the issue raised in 26341 where multiple addresses in a bridge may cause `--fixed-cidr` to not have the correct addresses. The issue is that `netutils.ElectInterfaceAddresses(bridgeName)` only returns the first IPv4 address. This fix (together with the PR created in libnetwork ) changes `ElectInterfaceAddresses()` and `addresses()` so that all IPv4 addresses are returned. This will allow the possibility of selectively choose the address needed. In `daemon_unix.go`, bridge address is chosen by comparing with the `--fixed-cidr` first, thus resolve the issue in 26341. This fix is tested manually, as is described in 26341: ``` brctl addbr cbr0 ip addr add 10.111.111.111/20 dev cbr0 label cbr0:main ip addr add 10.222.222.222/12 dev cbr0 label cbr0:docker ip link set cbr0 up docker daemon --bridge=cbr0 --iptables=false --ip-masq=false --fixed-cidr=10.222.222.222/24 docker run --rm busybox ip route get 8.8.8.8 \| grep -Po 'src.*' src 10.222.222.0 ``` This fix fixes 26341. Signed-off-by: Yong Tang <yong.tang.github@outlook.com>	2016-10-26 20:11:29 -07:00
Erik St. Martin	56f77d5ade	Implementing support for --cpu-rt-period and --cpu-rt-runtime so that containers may specify these cgroup values at runtime. This will allow processes to change their priority to real-time within the container when CONFIG_RT_GROUP_SCHED is enabled in the kernel. See #22380. Also added sanity checks for the new --cpu-rt-runtime and --cpu-rt-period flags to ensure that that the kernel supports these features and that runtime is not greater than period. Daemon will support a --cpu-rt-runtime flag to initialize the parent cgroup on startup, this prevents the administrator from alotting runtime to docker after each restart. There are additional checks that could be added but maybe too far? Check parent cgroups to ensure values are <= parent, inspecting rtprio ulimit and issuing a warning. Signed-off-by: Erik St. Martin <alakriti@gmail.com>	2016-10-26 11:33:06 -04:00
Yong Tang	40f25809ab	Fix an incorrect `WARNING` output in `docker run/create` This fix tries to fix an incorrect `WARNING` output in `docker run/create`: ``` ubuntu@ubuntu:~/docker$ docker run -d --cpu-percent 80 busybox top WARNING: %s does not support CPU percent. Percent discarded. WARNING: linux e963d1108e455e7f8f57626ca1305b5f1999e46025d2865b9a21fc8abc51a546 ``` The reason was that in `daemon/daemon_unix.go`, the warning string was not combined with `fmt.Sprintf` before appended to the output. This fix fixes this issue. This fix has been manually tested and verified: ``` ubuntu@ubuntu:~/docker$ docker run -d --cpu-percent 80 busybox top WARNING: linux does not support CPU percent. Percent discarded. fcf53f79d389235bae846d3d40804834659ac025edbc0d075ed91841a8e4c740 ``` Signed-off-by: Yong Tang <yong.tang.github@outlook.com>	2016-10-16 17:11:39 -07:00
Anusha Ragunathan	c5393ee147	Make authorization plugins use pluginv2. Signed-off-by: Anusha Ragunathan <anusha@docker.com>	2016-10-11 13:09:28 -07:00
Justin Cormack	d316e172da	Merge pull request #26690 from mwhudson/ignore-oom_score_adj-failure Ignore failure to set oom_score_adj, as happens in an unprivileged container.	2016-10-11 10:01:22 +01:00
Tõnis Tiigi	2945f902bd	Merge pull request #27259 from LK4D4/no_map_pointers daemon: do not use pointers to map	2016-10-10 13:15:03 -07:00
Alexander Morozov	44c280afbf	daemon: do not use pointers to map Also, do not shadow err with := Signed-off-by: Alexander Morozov <lk4d4@docker.com>	2016-10-10 11:37:36 -07:00
Vincent Demeester	91312f71aa	Merge pull request #26882 from runcom/proxy-path Specify userland proxy path	2016-10-07 09:44:39 +02:00
Antonio Murdaca	dd2e1947dc	daemon: add --userland-proxy-path flag Signed-off-by: Antonio Murdaca <runcom@redhat.com>	2016-10-06 17:31:53 +02:00

1 2 3 4 5 ...

465 commits