beenull/moby

Author	SHA1	Message	Date
Olli Janatuinen	e1cae011e2	Windows: Use system specific parallelism value on containers restart Signed-off-by: Olli Janatuinen <olli.janatuinen@gmail.com> (cherry picked from commit `447a840254`) Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2019-11-13 14:54:41 -08:00
John Howard	85ad4b16c1	Windows: Experimental: Allow containerd for runtime Signed-off-by: John Howard <jhoward@microsoft.com> This is the first step in refactoring moby (dockerd) to use containerd on Windows. Similar to the current model in Linux, this adds the option to enable it for runtime. It does not switch the graphdriver to containerd snapshotters. - Refactors libcontainerd to a series of subpackages so that either a "local" containerd (1) or a "remote" (2) containerd can be loaded as opposed to conditional compile as "local" for Windows and "remote" for Linux. - Updates libcontainerd such that Windows has an option to allow the use of a "remote" containerd. Here, it communicates over a named pipe using GRPC. This is currently guarded behind the experimental flag, an environment variable, and the providing of a pipename to connect to containerd. - Infrastructure pieces such as under pkg/system to have helper functions for determining whether containerd is being used. (1) "local" containerd is what the daemon on Windows has used since inception. It's not really containerd at all - it's simply local invocation of HCS APIs directly in-process from the daemon through the Microsoft/hcsshim library. (2) "remote" containerd is what docker on Linux uses for it's runtime. It means that there is a separate containerd service running, and docker communicates over GRPC to it. To try this out, you will need to start with something like the following: Window 1: containerd --log-level debug Window 2: $env:DOCKER_WINDOWS_CONTAINERD=1 dockerd --experimental -D --containerd \\.\pipe\containerd-containerd You will need the following binary from github.com/containerd/containerd in your path: - containerd.exe You will need the following binaries from github.com/Microsoft/hcsshim in your path: - runhcs.exe - containerd-shim-runhcs-v1.exe For LCOW, it will require and initrd.img and kernel in `C:\Program Files\Linux Containers`. This is no different to the current requirements. However, you may need updated binaries, particularly initrd.img built from Microsoft/opengcs as (at the time of writing), Linuxkit binaries are somewhat out of date. Note that containerd and hcsshim for HCS v2 APIs do not yet support all the required functionality needed for docker. This will come in time - this is a baby (although large) step to migrating Docker on Windows to containerd. Note that the HCS v2 APIs are only called on RS5+ builds. RS1..RS4 will still use HCS v1 APIs as the v2 APIs were not fully developed enough on these builds to be usable. This abstraction is done in HCSShim. (Referring specifically to runtime) Note the LCOW graphdriver still uses HCS v1 APIs regardless. Note also that this does not migrate docker to use containerd snapshotters rather than graphdrivers. This needs to be done in conjunction with Linux also doing the same switch.	2019-03-12 18:41:55 -07:00
Sebastiaan van Stijn	dd94555787	Merge pull request #32519 from darkowlzz/32443-docker-update-pids-limit Add pids-limit support in docker update	2019-02-23 15:20:59 +01:00
Sunny Gogoi	74eb258ffb	Add pids-limit support in docker update - Adds updating PidsLimit in UpdateContainer(). - Adds setting PidsLimit in toContainerResources(). Signed-off-by: Sunny Gogoi <indiasuny000@gmail.com> Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2019-02-21 14:17:38 -08:00
akolomentsev	e017717d96	keep old network ids for windows all networks are re-populated in the store during network controller initialization. In current version it also regenerate network Ids which may be referenced by other components and it may cause broken references to a networks. This commit avoids regeneration of network ids. Signed-off-by: Andrey Kolomentsev <andrey.kolomentsev@docker.com>	2019-01-23 14:53:27 -08:00
Akihiro Suda	2cb26cfe9c	Merge pull request #38301 from cyphar/waitgroup-limits daemon: switch to semaphore-gated WaitGroup for startup tasks	2018-12-22 00:07:55 +09:00
Aleksa Sarai	5a52917e4d	daemon: switch to semaphore-gated WaitGroup for startup tasks Many startup tasks have to run for each container, and thus using a WaitGroup (which doesn't have a limit to the number of parallel tasks) can result in Docker exceeding the NOFILE limit quite trivially. A more optimal solution is to have a parallelism limit by using a semaphore. In addition, several startup tasks were not parallelised previously which resulted in very long startup times. According to my testing, 20K dead containers resulted in ~6 minute startup times (during which time Docker is completely unusable). This patch fixes both issues, and the parallelStartupTimes factor chosen (128 * NumCPU) is based on my own significant testing of the 20K container case. This patch (on my machines) reduces the startup time from 6 minutes to less than a minute (ideally this could be further reduced by removing the need to scan all dead containers on startup -- but that's beyond the scope of this patchset). In order to avoid the NOFILE limit problem, we also detect this on-startup and if NOFILE < 2128NumCPU we will reduce the parallelism factor to avoid hitting NOFILE limits (but also emit a warning since this is almost certainly a mis-configuration). Signed-off-by: Aleksa Sarai <asarai@suse.de>	2018-12-21 21:51:02 +11:00
Sebastiaan van Stijn	f6002117a4	Extract container-config and container-hostconfig validation Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-12-19 13:09:12 +01:00
Sebastiaan van Stijn	b6e373c525	Rename verifyContainerResources to verifyPlatformContainerResources This validation function is platform-specific; rename it to be more explicit. Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-12-19 10:24:09 +01:00
Sebastiaan van Stijn	e278678705	Remove unused argument from verifyPlatformContainerSettings Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-12-19 09:23:09 +01:00
Sebastiaan van Stijn	10c97b9357	Unify logging container validation warnings Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-12-19 09:15:21 +01:00
John Howard	c907c2486c	Windows:Allow process isolation Signed-off-by: John Howard <jhoward@microsoft.com>	2018-10-09 11:58:26 -07:00
Simon Ferquel	6a1a4f9721	Fix long startup on windows, with non-hns governed Hyper-V networks Similar to a related issue where previously, private Hyper-V networks would each add 15 secs to the daemon startup, non-hns governed internal networks are reported by hns as network type "internal" which is not mapped to any network plugin (and thus we get the same plugin load retry loop as before). This issue hits Docker for Desktop because we setup such a network for the Linux VM communication. Signed-off-by: Simon Ferquel <simon.ferquel@docker.com>	2018-09-06 11:54:23 +02:00
Salahuddin Khan	763d839261	Add ADD/COPY --chown flag support to Windows This implements chown support on Windows. Built-in accounts as well as accounts included in the SAM database of the container are supported. NOTE: IDPair is now named Identity and IDMappings is now named IdentityMapping. The following are valid examples: ADD --chown=Guest . <some directory> COPY --chown=Administrator . <some directory> COPY --chown=Guests . <some directory> COPY --chown=ContainerUser . <some directory> On Windows an owner is only granted the permission to read the security descriptor and read/write the discretionary access control list. This fix also grants read/write and execute permissions to the owner. Signed-off-by: Salahuddin Khan <salah@docker.com>	2018-08-13 21:59:11 -07:00
Flavio Crisciani	e353e7e3f0	Fixes for resolv.conf Handle the case of systemd-resolved, and if in place use a different resolv.conf source. Set appropriately the option on libnetwork. Move unix specific code to container_operation_unix Signed-off-by: Flavio Crisciani <flavio.crisciani@docker.com>	2018-07-26 11:17:56 -07:00
Brian Goff	0023abbad3	Remove old/uneeded volume migration from vers 1.7 Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2018-04-17 14:06:53 -04:00
Daniel Nephin	4ceea53b5e	Remove duplicate rootFSToAPIType Signed-off-by: Daniel Nephin <dnephin@docker.com>	2018-02-14 11:59:18 -05:00
Daniel Nephin	c502bcff33	Remove unnecessary getLayerInit Signed-off-by: Daniel Nephin <dnephin@docker.com>	2018-02-14 11:59:10 -05:00
Daniel Nephin	4f0d95fa6e	Add canonical import comment Signed-off-by: Daniel Nephin <dnephin@docker.com>	2018-02-05 16:51:57 -05:00
Sebastiaan van Stijn	6121a8429b	Move reload-related functions to reload.go Signed-off-by: Sebastiaan van Stijn <github@gone.nl>	2018-01-21 00:55:49 +01:00
John Howard	ce8e529e18	LCOW: Re-coalesce stores Signed-off-by: John Howard <jhoward@microsoft.com> The re-coalesces the daemon stores which were split as part of the original LCOW implementation. This is part of the work discussed in https://github.com/moby/moby/issues/34617, in particular see the document linked to in that issue.	2018-01-18 08:29:19 -08:00
Yong Tang	0866dee5fd	Remove getBlkioWeightDevices in daemon_windows.go as it is not needed Signed-off-by: Yong Tang <yong.tang.github@outlook.com>	2017-12-13 17:31:28 +00:00
Daniel Dao	4d1d486202	remove import of opencontainers/runc in windows We are planning to remove supports for non-Linux platform in runc (https://github.com/opencontainers/runc/pull/1654). The current import here is the only thing that i found in docker that is windows-related so fixing this would remove the rest of windows code in runc. This changes some functions in daemon_windows to be the same as daemon_unix to use runtime-spec public API instead of runc. Signed-off-by: Daniel Dao <dqminh89@gmail.com>	2017-12-13 17:18:56 +00:00
Kir Kolyshkin	516010e92d	Simplify/fix MkdirAll usage This subtle bug keeps lurking in because error checking for `Mkdir()` and `MkdirAll()` is slightly different wrt to `EEXIST`/`IsExist`: - for `Mkdir()`, `IsExist` error should (usually) be ignored (unless you want to make sure directory was not there before) as it means "the destination directory was already there" - for `MkdirAll()`, `IsExist` error should NEVER be ignored. Mostly, this commit just removes ignoring the IsExist error, as it should not be ignored. Also, there are a couple of cases then IsExist is handled as "directory already exist" which is wrong. As a result, some code that never worked as intended is now removed. NOTE that `idtools.MkdirAndChown()` behaves like `os.MkdirAll()` rather than `os.Mkdir()` -- so its description is amended accordingly, and its usage is handled as such (i.e. IsExist error is not ignored). For more details, a quote from my runc commit 6f82d4b (July 2015): TL;DR: check for IsExist(err) after a failed MkdirAll() is both redundant and wrong -- so two reasons to remove it. Quoting MkdirAll documentation: > MkdirAll creates a directory named path, along with any necessary > parents, and returns nil, or else returns an error. If path > is already a directory, MkdirAll does nothing and returns nil. This means two things: 1. If a directory to be created already exists, no error is returned. 2. If the error returned is IsExist (EEXIST), it means there exists a non-directory with the same name as MkdirAll need to use for directory. Example: we want to MkdirAll("a/b"), but file "a" (or "a/b") already exists, so MkdirAll fails. The above is a theory, based on quoted documentation and my UNIX knowledge. 3. In practice, though, current MkdirAll implementation [1] returns ENOTDIR in most of cases described in #2, with the exception when there is a race between MkdirAll and someone else creating the last component of MkdirAll argument as a file. In this very case MkdirAll() will indeed return EEXIST. Because of #1, IsExist check after MkdirAll is not needed. Because of #2 and #3, ignoring IsExist error is just plain wrong, as directory we require is not created. It's cleaner to report the error now. Note this error is all over the tree, I guess due to copy-paste, or trying to follow the same usage pattern as for Mkdir(), or some not quite correct examples on the Internet. [1] https://github.com/golang/go/blob/f9ed2f75/src/os/path.go Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2017-11-27 17:32:12 -08:00
Cheng-mean Liu	cef1578ac4	Added support for persisting Windows network driver specific options over reboot or service restart Signed-off-by: Cheng-mean Liu <soccerl@microsoft.com>	2017-11-21 14:11:12 -08:00
Kenfe-Mickael Laventure	ddae20c032	Update libcontainerd to use containerd 1.0 Signed-off-by: Kenfe-Mickael Laventure <mickael.laventure@gmail.com>	2017-10-20 07:11:37 -07:00
John Howard	0380fbff37	LCOW: API: Add platform to /images/create and /build Signed-off-by: John Howard <jhoward@microsoft.com> This PR has the API changes described in https://github.com/moby/moby/issues/34617. Specifically, it adds an HTTP header "X-Requested-Platform" which is a JSON-encoded OCI Image-spec `Platform` structure. In addition, it renames (almost all) uses of a string variable platform (and associated) methods/functions to os. This makes it much clearer to disambiguate with the swarm "platform" which is really os/arch. This is a stepping stone to getting the daemon towards fully multi-platform/arch-aware, and makes it clear when "operating system" is being referred to rather than "platform" which is misleadingly used - sometimes in the swarm meaning, but more often as just the operating system.	2017-10-06 11:44:18 -07:00
Sebastiaan van Stijn	6af60b3c61	Merge pull request #34928 from darrenstahlmsft/HnsRunning Ensure Host Network Service exists	2017-09-27 17:35:08 +02:00
Darren Stahl	31405b556f	Fix error string about containers feature Signed-off-by: Darren Stahl <darst@microsoft.com>	2017-09-25 12:39:27 -07:00
Darren Stahl	1edcc63560	Ensure Host Network Service exists If HNS does not exist on the Docker host, the daemon may fail with unexpected and difficult to diagnose errors. This check prevents the daemon from starting on a system that does not have the correct prerequisites. Signed-off-by: Darren Stahl <darst@microsoft.com>	2017-09-25 11:07:44 -07:00
Akash Gupta	7a7357dae1	LCOW: Implemented support for docker cp + build This enables docker cp and ADD/COPY docker build support for LCOW. Originally, the graphdriver.Get() interface returned a local path to the container root filesystem. This does not work for LCOW, so the Get() method now returns an interface that LCOW implements to support copying to and from the container. Signed-off-by: Akash Gupta <akagup@microsoft.com>	2017-09-14 12:07:52 -07:00
Daniel Nephin	62c1f0ef41	Add deadcode linter Signed-off-by: Daniel Nephin <dnephin@docker.com>	2017-08-21 18:18:50 -04:00
Brian Goff	ebcb7d6b40	Remove string checking in API error handling Use strongly typed errors to set HTTP status codes. Error interfaces are defined in the api/errors package and errors returned from controllers are checked against these interfaces. Errors can be wraeped in a pkg/errors.Causer, as long as somewhere in the line of causes one of the interfaces is implemented. The special error interfaces take precedence over Causer, meaning if both Causer and one of the new error interfaces are implemented, the Causer is not traversed. Signed-off-by: Brian Goff <cpuguy83@gmail.com>	2017-08-15 16:01:11 -04:00
Kir Kolyshkin	7120976d74	Implement none, private, and shareable ipc modes Since the commit `d88fe447df` ("Add support for sharing /dev/shm/ and /dev/mqueue between containers") container's /dev/shm is mounted on the host first, then bind-mounted inside the container. This is done that way in order to be able to share this container's IPC namespace (and the /dev/shm mount point) with another container. Unfortunately, this functionality breaks container checkpoint/restore (even if IPC is not shared). Since /dev/shm is an external mount, its contents is not saved by `criu checkpoint`, and so upon restore any application that tries to access data under /dev/shm is severily disappointed (which usually results in a fatal crash). This commit solves the issue by introducing new IPC modes for containers (in addition to 'host' and 'container:ID'). The new modes are: - 'shareable': enables sharing this container's IPC with others (this used to be the implicit default); - 'private': disables sharing this container's IPC. In 'private' mode, container's /dev/shm is truly mounted inside the container, without any bind-mounting from the host, which solves the issue. While at it, let's also implement 'none' mode. The motivation, as eloquently put by Justin Cormack, is: > I wondered a while back about having a none shm mode, as currently it is > not possible to have a totally unwriteable container as there is always > a /dev/shm writeable mount. It is a bit of a niche case (and clearly > should never be allowed to be daemon default) but it would be trivial to > add now so maybe we should... ...so here's yet yet another mode: - 'none': no /dev/shm mount inside the container (though it still has its own private IPC namespace). Now, to ultimately solve the abovementioned checkpoint/restore issue, we'd need to make 'private' the default mode, but unfortunately it breaks the backward compatibility. So, let's make the default container IPC mode per-daemon configurable (with the built-in default set to 'shareable' for now). The default can be changed either via a daemon CLI option (--default-shm-mode) or a daemon.json configuration file parameter of the same name. Note one can only set either 'shareable' or 'private' IPC modes as a daemon default (i.e. in this context 'host', 'container', or 'none' do not make much sense). Some other changes this patch introduces are: 1. A mount for /dev/shm is added to default OCI Linux spec. 2. IpcMode.Valid() is simplified to remove duplicated code that parsed 'container:ID' form. Note the old version used to check that ID does not contain a semicolon -- this is no longer the case (tests are modified accordingly). The motivation is we should either do a proper check for container ID validity, or don't check it at all (since it is checked in other places anyway). I chose the latter. 3. IpcMode.Container() is modified to not return container ID if the mode value does not start with "container:", unifying the check to be the same as in IpcMode.IsContainer(). 3. IPC mode unit tests (runconfig/hostconfig_test.go) are modified to add checks for newly added values. [v2: addressed review at https://github.com/moby/moby/pull/34087#pullrequestreview-51345997] [v3: addressed review at https://github.com/moby/moby/pull/34087#pullrequestreview-53902833] [v4: addressed the case of upgrading from older daemon, in this case container.HostConfig.IpcMode is unset and this is valid] [v5: document old and new IpcMode values in api/swagger.yaml] [v6: add the 'none' mode, changelog entry to docs/api/version-history.md] Signed-off-by: Kir Kolyshkin <kolyshkin@gmail.com>	2017-08-14 10:50:39 +03:00
Derek McGowan	1009e6a40b	Update logrus to v1.0.1 Fixes case sensitivity issue Signed-off-by: Derek McGowan <derek@mcgstyle.net>	2017-07-31 13:16:46 -07:00
Yuanhong Peng	4a6cbf9bcb	Return an empty stats if "container not found" If we get "container not found" error from containerd, it's possibly because that this container has already been stopped. It will be ok to ignore this error and just return an empty stats. Signed-off-by: Yuanhong Peng <pengyuanhong@huawei.com>	2017-07-10 16:30:48 +08:00
Michael Crosby	9d87e6e0fb	Do not set -1 for swappiness Do not set a default value for swappiness as the default value should be `nil` Signed-off-by: Michael Crosby <crosbymichael@gmail.com>	2017-07-03 11:23:15 -07:00
John Howard	f8aa70055e	LCOW: Don't mount for linux containers either Signed-off-by: John Howard <jhoward@microsoft.com>	2017-06-20 19:50:12 -07:00
John Howard	ed10ac6ee9	LCOW: Create layer folders with correct ACL Signed-off-by: John Howard <jhoward@microsoft.com>	2017-06-20 19:50:12 -07:00
John Howard	3aa4a00715	LCOW: Move daemon stores to per platform Signed-off-by: John Howard <jhoward@microsoft.com>	2017-06-20 19:49:52 -07:00
John Howard	b931c35a46	Merge pull request #33498 from darrenstahlmsft/IoTDataPartition Skip evaluation of symlinks to data root on IoT Core	2017-06-15 15:52:01 -07:00
Vincent Demeester	0c2f3bcd82	Merge pull request #33053 from simonferquel/ignore-private-networks Ignore HNS networks with type `Private`	2017-06-14 14:20:39 +02:00
Darren Stahl	8e71b1e210	Skip evaluation of symlinks to data root on IoT Core Signed-off-by: Darren Stahl <darst@microsoft.com>	2017-06-13 15:02:35 -07:00
Simon Ferquel	b91fd26bb5	Ignore HNS networks with type Private Fix #33052 (workaround style) - What I did HNS reports networks that don't have anything to do with the Daemon, and for which no networking plugin is available. This make the Daemon start sequence pause for 15 secs, as the plugin resolving logic has a wait & retry logic - How I did it Just after retrieving the HNS networks, I filter out those with type `Private` - How to verify it Replace dockerd coming with Docker for Windows from one built from this PR. Windows containers daemon should now launch pretty quickly Signed-off-by: Simon Ferquel <simon.ferquel@docker.com>	2017-06-13 13:25:00 +02:00
Brian Goff	2ae085f309	Merge pull request #33414 from darrenstahlmsft/IoTServerContainers Check for Windows 10 IoT Core to use process isolation on IoT	2017-06-12 18:02:15 -05:00
Daniel Nephin	09cd96c5ad	Partial refactor of UID/GID usage to use a unified struct. Signed-off-by: Daniel Nephin <dnephin@docker.com>	2017-06-07 11:44:33 -04:00
Darren Stahl	75f7f2a83a	Check for Windows 10 IoT Core to use process isolation on IoT Signed-off-by: Darren Stahl <darst@microsoft.com>	2017-05-30 12:01:38 -07:00
Darren Stahl	3b5af0a289	Fix scaling of NanoCPUs on Hyper-V containers Signed-off-by: Darren Stahl <darst@microsoft.com>	2017-04-12 16:54:27 -07:00
Darren Stahl	6eed7f0cac	Windows:Revert change to wait for OOBE Signed-off-by: Darren Stahl <darst@microsoft.com>	2017-03-27 14:32:18 -07:00
Anusha Ragunathan	6dd2a82458	Merge pull request #29984 from jmzwcn/issueNNP [feature]: add daemon flag to set no_new_priv as default for unprivileged containers	2017-02-17 11:43:43 -08:00

1 2 3 4

170 commits