Swarm Gurus,
I have just setup my very first Docker Swarm environment with 3 hosts. By following the manuals here:
https://docs.docker.com/engine/install/ubuntu/
https://docs.docker.com/engine/swarm/swarm-tutorial/
https://docs.docker.com/engine/swarm/swarm-tutorial/create-swarm/
https://docs.docker.com/engine/swarm/swarm-tutorial/deploy-service/
https://docs.docker.com/engine/swarm/swarm-tutorial/scale-service/
I was able to set it up and was able to create a service with 5 replicas and working as expected. The containers were spread across 3 Nodes (Manager and 2 Worker Nodes).
Then I started to experiment by shutting down all the 3 Nodes and starting them up. The service I have created (named helloworld) was automatically spawned up by docker and was restored as a swarm.
But I noticed one thing, the original containers were no longer there but instead I got this:
someuser@manager:~$ docker service ps helloworld --no-trunc
ID NAME IMAGE NODE DESIRED STATE CURRENT STATE ERROR PORTS
8vlswsfq8ub5xn9vd401ilskn helloworld.1 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 manager Running Running 30 minutes ago
jqfgg41xppf7xcchnkvjyesyx \_ helloworld.1 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 manager Shutdown Failed 30 minutes ago "No such container: helloworld.1.jqfgg41xppf7xcchnkvjyesyx"
wy382jy2yncpv6b3y1y0qfq3h helloworld.2 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 manager Running Running 30 minutes ago
mq7w469vck8hzr7p9w22f0rt1 \_ helloworld.2 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 manager Shutdown Failed 30 minutes ago "No such container: helloworld.2.mq7w469vck8hzr7p9w22f0rt1"
jp5wbvbdxxgh60vzef9iz73aj helloworld.3 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 worker01 Running Running 30 minutes ago
t5wgad0dhu5hoyp3kjrdela4b \_ helloworld.3 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 worker01 Shutdown Failed 30 minutes ago "No such container: helloworld.3.t5wgad0dhu5hoyp3kjrdela4b"
km03jrxlvam162i8pt2ix6vlf helloworld.4 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 worker02 Running Running 29 minutes ago
8hjnbjz4nmpqncmva4ubeqpx6 \_ helloworld.4 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 worker02 Shutdown Failed 30 minutes ago "No such container: helloworld.4.8hjnbjz4nmpqncmva4ubeqpx6"
knbvl6el13l0poofdv1g6j11z helloworld.5 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 worker02 Running Running 29 minutes ago
thlnyngdbwwsi30fuxx4wx7cd \_ helloworld.5 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 worker02 Shutdown Failed 30 minutes ago "No such container: helloworld.5.thlnyngdbwwsi30fuxx4wx7cd"
I am totally fine with the new containers, since I had not gracefully shutdown the nodes and not shutting them down gracefully is part of the test case.
But I want to get rid of the nodes that have failed. Which are the following:
jqfgg41xppf7xcchnkvjyesyx \_ helloworld.1 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 manager Shutdown Failed 30 minutes ago "No such container: helloworld.1.jqfgg41xppf7xcchnkvjyesyx"
mq7w469vck8hzr7p9w22f0rt1 \_ helloworld.2 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 manager Shutdown Failed 30 minutes ago "No such container: helloworld.2.mq7w469vck8hzr7p9w22f0rt1"
t5wgad0dhu5hoyp3kjrdela4b \_ helloworld.3 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 worker01 Shutdown Failed 30 minutes ago "No such container: helloworld.3.t5wgad0dhu5hoyp3kjrdela4b"
8hjnbjz4nmpqncmva4ubeqpx6 \_ helloworld.4 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 worker02 Shutdown Failed 30 minutes ago "No such container: helloworld.4.8hjnbjz4nmpqncmva4ubeqpx6"
thlnyngdbwwsi30fuxx4wx7cd \_ helloworld.5 alpine:latest@sha256:21a3deaa0d32a8057914f36584b5288d2e5ecc984380bc0118285c70fa8c9300 worker02 Shutdown Failed 30 minutes ago "No such container: helloworld.5.thlnyngdbwwsi30fuxx4wx7cd"
I tried the following:
$ docker rm \_ helloworld.1
$ docker rm \helloworld.1.jqfgg41xppf7xcchnkvjyesyx
$ docker rm --link \_ helloworld.1
$ docker rm --link \helloworld.1.jqfgg41xppf7xcchnkvjyesyx
But all these didn't work.
Your advice is much appreciated.
Thanks