Pod 823/cache #1245

bkneis · 2024-09-03T14:46:55Z

This PR contains the changes to facilitate the use of remote caching via a registry to speed up build times. It supports docker and kubernetes (via kaniko) and uses the context options REGISTRY_CACHE as the registry url. Caching options are not exposed currently in order to ensure the expected functionality.

The PR was tested using the following workflow:

Without cache

Build examples/build 2m35s
Add ghcr.io/devcontainers/features/github-cli:1 as a feature to devcontainer.json 2m45s
Add files causing the build context to change (echo "test" > examples/build/app/test) 2m52s

With cache

Build examples/build 2m13s
Add ghcr.io/devcontainers/features/github-cli:1 as a feature to devcontainer.json 1m25s
Add files causing the build context to change (echo "test" > examples/build/app/test) 13s

As you can see step 1 is almost the same, since no cache has been made yet. Step 2 however is interesting, we saved around 50% of the build time with the cache but 1m16s was used to upload the cache back to the registry. This shows that it is important for us to either omit the --cache-to parameter for up (but not build), or suppress pushing of the cache manifest until the end of the command in the background once the workspace / IDE has already launched.

Also note that I am using my local docker / kind cluster and a remote registry, when the registry is closer to the devcontainer I would expect even greater savings in build time due to shorter download times.

Lastly we have step 3 that simulates a common problematic workflow, which is a user updating the build context then uping a workspace. Here we see significant savings where only building the last layer is needed, instead of the entire image due to a cache miss.

EDIT: I have now implemented a boolean ExportCache in toggle the --cache-to parameter, we now only upload the cache when running build, not up. This provides up even faster start times of workspaces as we don't wait until the cache is uploaded.

Also note for machine providers I needed to enable the containerd snapshotter flag in the docker daemon, this was done during initWorkspace. For non machine providers like local docker we expect the user to enable this themselves. When trying to use the remote cache without this a WARNING will be printed from docker.

NOTE: We need to merge loft-sh/dockerless#27 first and update the dockerless release tag in single.go

Lastly, I have updated the CalculatePrebuildHash function to traverse the parsed Dockerfile and extract any file paths that could affect the build context. I then filter for these files as an "include" list before adding the path to the hashed contents. This should cause fewer cache misses and allow developers to reuse similar images.

cmd/agent/workspace/up.go

bkneis · 2024-09-05T08:39:55Z

cmd/agent/workspace/up.go

-
-	return nil
+	// reload docker daemon
+	return exec.CommandContext(ctx, "pkill", "-HUP", "dockerd").Run()


is this the best way to reload the docker daemon? I was surprised docker did not have a command for this and I didn't think I could assume systemd was being used to manage docker

this looks scary 😬

can't think of a better way to do it though, unless we'd start this build container with already mounted config? is that even possible?

I agree, maybe I could try to detect how dockerd is managed (systemd,systemctl etc) using uname and if not available use this as a last resort? Yes that is possible, I thought of the mounted config but this would be provider specific and need to be implemented by each machine provider, gcloud, aws, digital ocean etc. :/ At least with HUP it's a reload and not restart but it's a bit "under the hood"

pkg/dockerfile/parse.go

janekbaraniewski · 2024-09-05T11:53:28Z

LGTM!

cmd/agent/container/setup.go

examples/build/README.md

pkg/config/context.go

pkg/devcontainer/build.go

cmd/agent/workspace/up.go

pkg/devcontainer/single.go

pascalbreuninger · 2024-09-10T12:10:19Z

@bkneis approved with comments, feel free to ignore if not applicable

bkneis · 2024-09-10T12:30:10Z

@pascalbreuninger fab thanks! Just finishing testing with a remote k8s cluster on GKE then it should be ready to merge. Do I need to update any file to reference the new version of the kubernetes driver? i.e. loft-sh/devpod-provider-kubernetes#55

pascalbreuninger · 2024-09-10T16:02:58Z

@bkneis nope, that's done by releasing a new version of the kubernetes provider over in the other repo👍
We'll need to wait until we've released it though, right?

bkneis · 2024-09-11T11:04:31Z

@pascalbreuninger just finished testing with GKE 🥳 It was my auto pilot cluster giving me issues and killing workspaces with a return code 137 (OOM). Using the new GKE cluster workspaces are spinning up no problem and using the cache. In the end I didn't need to make any changes to the kubernetes driver, only dockerless so this PR is good to go IMO

bkneis marked this pull request as draft September 3, 2024 14:47

bkneis mentioned this pull request Sep 4, 2024

POD-823: Add repo cache loft-sh/dockerless#27

Merged

bkneis marked this pull request as ready for review September 5, 2024 08:35

bkneis added 11 commits September 5, 2024 09:36

POD-823: Use private dockerless WIP

8635fad

POD-823: WIP implment cache for docker

e2335f4

POD-823: Run example and collect logs

ad06732

POD-823: Remove devpod internal

aa350ee

POD-823: Cleanup

02968d6

POD-823: Configure docker daemon

bacb6da

POD-823: Append registry cache to build info#

5b16fbf

POD-823: Test kaniko build times

590fde7

POD-823: Implement toggle to export cache manifest

c89d96b

POD-823: Improve hashing of image tag using parsed dockerfile

6b478a7

POD-823: Add todo comment

1d91603

bkneis force-pushed the POD-823/cache branch from dea14c3 to 1d91603 Compare September 5, 2024 08:36

bkneis requested review from pascalbreuninger and janekbaraniewski September 5, 2024 08:37

bkneis commented Sep 5, 2024

View reviewed changes

cmd/agent/workspace/up.go Outdated Show resolved Hide resolved

POD-823: Fix rebase error

2a14b7c

bkneis commented Sep 5, 2024

View reviewed changes

bkneis added 3 commits September 5, 2024 09:41

POD-823: Simplify examples build

8af4580

POD-823: Remove default registry£

a26cb5a

POD-823: Fix e2e tests

368c752

janekbaraniewski reviewed Sep 5, 2024

View reviewed changes

pkg/dockerfile/parse.go Show resolved Hide resolved

POD-823: Implement unit test for parser

86c89a1

pascalbreuninger reviewed Sep 5, 2024

View reviewed changes

cmd/agent/container/setup.go Outdated Show resolved Hide resolved

examples/build/README.md Show resolved Hide resolved

pkg/config/context.go Outdated Show resolved Hide resolved

pkg/devcontainer/build.go Outdated Show resolved Hide resolved

POD-823: PR feedback

d1177c8

pascalbreuninger reviewed Sep 5, 2024

View reviewed changes

cmd/agent/workspace/up.go Outdated Show resolved Hide resolved

POD-823: PR feedback

b5169bd

bkneis added 2 commits September 6, 2024 16:07

POD-823: Use official dockerless image

b2530b2

POD-823: Mount cluster wide cache for dockerless

f24e013

bkneis requested review from janekbaraniewski and pascalbreuninger September 10, 2024 07:05

bkneis added 3 commits September 10, 2024 08:24

POD-823: Fix unit tests

7a2f783

POD-823: Configure docker daemon after install

6b568d8

POD-823: Print warning instead of error for configuring daemon

712436d

pascalbreuninger approved these changes Sep 10, 2024

View reviewed changes

cmd/agent/workspace/up.go Show resolved Hide resolved

pkg/devcontainer/single.go Outdated Show resolved Hide resolved

POD-823: Comment on use of http1

2e15836

Remove local mount caching

555cbe3

POD-823: Remove inner cache

aa1e749

bkneis merged commit 2b47efa into loft-sh:main Sep 12, 2024
17 of 23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pod 823/cache #1245

Pod 823/cache #1245

bkneis commented Sep 3, 2024 •

edited

Loading

bkneis Sep 5, 2024

janekbaraniewski Sep 5, 2024

janekbaraniewski Sep 5, 2024

bkneis Sep 5, 2024

janekbaraniewski commented Sep 5, 2024

pascalbreuninger commented Sep 10, 2024

bkneis commented Sep 10, 2024

pascalbreuninger commented Sep 10, 2024 •

edited

Loading

bkneis commented Sep 11, 2024

Pod 823/cache #1245

Pod 823/cache #1245

Conversation

bkneis commented Sep 3, 2024 • edited Loading

bkneis Sep 5, 2024

Choose a reason for hiding this comment

janekbaraniewski Sep 5, 2024

Choose a reason for hiding this comment

janekbaraniewski Sep 5, 2024

Choose a reason for hiding this comment

bkneis Sep 5, 2024

Choose a reason for hiding this comment

janekbaraniewski commented Sep 5, 2024

pascalbreuninger commented Sep 10, 2024

bkneis commented Sep 10, 2024

pascalbreuninger commented Sep 10, 2024 • edited Loading

bkneis commented Sep 11, 2024

bkneis commented Sep 3, 2024 •

edited

Loading

pascalbreuninger commented Sep 10, 2024 •

edited

Loading