feat: introduce the copy by chunk for replication #202

chlins · 2022-09-16T06:48:10Z

Signed-off-by: chlins chenyuzh@vmware.com

chlins · 2022-09-19T06:02:30Z

@goharbor/all-maintainers Hi, please review this proposal, thanks!

proposals/new/replication-chunk.md

wy65701436 · 2022-09-19T06:22:26Z

proposals/new/replication-chunk.md

+
+#### Phase 2 Implementation
+
+The key point of phase 2 is **`Breakpoint & Resume`**.


because we decided to do phase 1 at this stage, and the details in the phase 2 have not been fully discussed, it may not be very appropriate.

wy65701436

lgtm

reasonerjt · 2022-09-27T09:30:14Z

proposals/new/replication-chunk.md

+```go
+type Client interface {
+    // PullBlobChunk pulls the specified blob, but by chunked
+    PullBlobChunk(repository, digest string, start, end int64) (size int64, blob io.ReadCloser, err error)


is size needed in the return value since it can be calculated via start and end?

yes, but just return this as unify with original PullBlob method.

reasonerjt · 2022-09-27T09:30:36Z

proposals/new/replication-chunk.md

+*update policy*
+
+```rest
+PUT /replication/policies


I believe it has the policy ID in the URI?

Sure, hide the existed column.

reasonerjt · 2022-09-27T09:32:09Z

proposals/new/replication-chunk.md

+
+**Scope:**
+
+Chunk resuming crossing the job/execution/policy and multiple times execution(e.g. job retry), cache chunk location and last end range in redis.


I think this is cool but a little unsure from the cost-effective perspective

reasonerjt · 2022-09-27T09:35:37Z

proposals/new/replication-chunk.md

+
+The retry for chunk adopts the same strategy with blob, default 5 times and can be configured by environment.
+
+#### Phase 2 Implementation


IMHO, If we want to implement phase 1 in next minor release and spark more conversations around phase 2, would it be better we split them into two proposals?

I think we can keep the phase2 idea in this proposal for understand, and when we plan to do phase2 in the future, then we can re-design the entire phase2 and push a new proposal for review.

reasonerjt · 2022-09-27T09:37:41Z

proposals/new/replication-chunk.md

+
+The key point of phase 2 is **`Breakpoint & Resume`**.
+
+From the process of chunk API, we need to store the location for next chunk push and the last pushed chunk end range, so we need to define common interface for easily integration and adapter in the future.


I need more explanation for the concepts like breakpoint and location to understand the idea....

Just initialize idea, we can discuss this for more details when we do phase2.

wy65701436

lgtm

OrlinVasilev · 2022-11-02T09:20:26Z

@chlins @wy65701436 the Phase1 and Phase2 are not defined at least I didn't find anything.
Can you please provide update!

chlins · 2022-11-02T09:24:57Z

@OrlinVasilev Hi, the phase1 is defined in https://github.com/goharbor/community/blob/9cca0f1085a1b238a99dc52c08a97982edb17795/proposals/new/replication-chunk.md#phase-1, and phase2 is defined in https://github.com/goharbor/community/blob/9cca0f1085a1b238a99dc52c08a97982edb17795/proposals/new/replication-chunk.md#phase-2

OrlinVasilev · 2022-11-02T09:43:49Z

@chlins my confusion comes from "Phase 1" and "Phase1"

proposals/new/replication-chunk.md

wy65701436

LGTM

AllForNothing

LGTM

Signed-off-by: chlins <chenyuzh@vmware.com>

chlins requested review from a team as code owners September 16, 2022 06:48

wy65701436 reviewed Sep 19, 2022

View reviewed changes

proposals/new/replication-chunk.md Outdated Show resolved Hide resolved

wy65701436 reviewed Sep 19, 2022

View reviewed changes

proposals/new/replication-chunk.md Show resolved Hide resolved

wy65701436 reviewed Sep 19, 2022

View reviewed changes

proposals/new/replication-chunk.md Show resolved Hide resolved

wy65701436 reviewed Sep 19, 2022

View reviewed changes

proposals/new/replication-chunk.md Show resolved Hide resolved

wy65701436 reviewed Sep 19, 2022

View reviewed changes

proposals/new/replication-chunk.md Outdated Show resolved Hide resolved

wy65701436 reviewed Sep 19, 2022

View reviewed changes

wy65701436 previously approved these changes Sep 19, 2022

View reviewed changes

chlins dismissed wy65701436’s stale review via 9cca0f1 September 21, 2022 07:57

chlins force-pushed the feat/replication-chunk-transfer branch from 13d4e87 to 9cca0f1 Compare September 21, 2022 07:57

reasonerjt reviewed Sep 27, 2022

View reviewed changes

wy65701436 previously approved these changes Nov 2, 2022

View reviewed changes

OrlinVasilev requested review from qnetter and Vad1mo November 2, 2022 09:20

OrlinVasilev reviewed Nov 2, 2022

View reviewed changes

proposals/new/replication-chunk.md Outdated Show resolved Hide resolved

OrlinVasilev reviewed Nov 2, 2022

View reviewed changes

proposals/new/replication-chunk.md Outdated Show resolved Hide resolved

Vad1mo previously approved these changes Nov 2, 2022

View reviewed changes

chlins dismissed stale reviews from Vad1mo and wy65701436 via fc9140a November 3, 2022 03:26

chlins force-pushed the feat/replication-chunk-transfer branch from 9cca0f1 to fc9140a Compare November 3, 2022 03:26

wy65701436 approved these changes Nov 3, 2022

View reviewed changes

AllForNothing approved these changes Nov 3, 2022

View reviewed changes

YangJiao0817 approved these changes Nov 3, 2022

View reviewed changes

feat: introduce the copy by chunk for replication

fe62c84

Signed-off-by: chlins <chenyuzh@vmware.com>

chlins force-pushed the feat/replication-chunk-transfer branch from fc9140a to fe62c84 Compare November 3, 2022 06:34

chlins merged commit dd91b0a into goharbor:main Nov 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: introduce the copy by chunk for replication #202

feat: introduce the copy by chunk for replication #202

chlins commented Sep 16, 2022

chlins commented Sep 19, 2022

wy65701436 Sep 19, 2022

wy65701436 left a comment

reasonerjt Sep 27, 2022

chlins Nov 3, 2022

reasonerjt Sep 27, 2022

chlins Nov 3, 2022

reasonerjt Sep 27, 2022

reasonerjt Sep 27, 2022

chlins Nov 3, 2022

reasonerjt Sep 27, 2022

chlins Nov 3, 2022

wy65701436 left a comment

OrlinVasilev commented Nov 2, 2022

chlins commented Nov 2, 2022

OrlinVasilev commented Nov 2, 2022

wy65701436 left a comment

AllForNothing left a comment


		#### Phase 2 Implementation

		The key point of phase 2 is `Breakpoint & Resume`.


		Scope:

		Chunk resuming crossing the job/execution/policy and multiple times execution(e.g. job retry), cache chunk location and last end range in redis.


		The retry for chunk adopts the same strategy with blob, default 5 times and can be configured by environment.

		#### Phase 2 Implementation


		The key point of phase 2 is `Breakpoint & Resume`.

		From the process of chunk API, we need to store the location for next chunk push and the last pushed chunk end range, so we need to define common interface for easily integration and adapter in the future.

feat: introduce the copy by chunk for replication #202

feat: introduce the copy by chunk for replication #202

Conversation

chlins commented Sep 16, 2022

chlins commented Sep 19, 2022

Choose a reason for hiding this comment

wy65701436 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wy65701436 left a comment

Choose a reason for hiding this comment

OrlinVasilev commented Nov 2, 2022

chlins commented Nov 2, 2022

OrlinVasilev commented Nov 2, 2022

wy65701436 left a comment

Choose a reason for hiding this comment

AllForNothing left a comment

Choose a reason for hiding this comment