KEP-3294: Provision volumes from cross-namespace snapshots

Implementation History
ALPHA Implementable
Created 2022-04-06
Latest v1.26
Milestones
Alpha v1.26
Beta v1.27
Stable v1.29
Ownership
Owning SIG
SIG Storage
Participating SIGs

KEP-3294: Provision volumes from cross-namespace snapshots

Release Signoff Checklist

Items marked with (R) are required prior to targeting to a milestone / release.

  • (R) Enhancement issue in release milestone, which links to KEP dir in kubernetes/enhancements (not the initial KEP PR)
  • (R) KEP approvers have approved the KEP status as implementable
  • (R) Design details are appropriately documented
  • (R) Test plan is in place, giving consideration to SIG Architecture and SIG Testing input (including test refactors)
    • e2e Tests for all Beta API Operations (endpoints)
    • (R) Ensure GA e2e tests for meet requirements for Conformance Tests
    • (R) Minimum Two Week Window for GA e2e tests to prove flake free
  • (R) Graduation criteria is in place
  • (R) Production readiness review completed
  • (R) Production readiness review approved
  • “Implementation History” section is up-to-date for milestone
  • User-facing documentation has been created in kubernetes/website , for publication to kubernetes.io
  • Supporting documentation—e.g., additional design documents, links to mailing list discussions/SIG meetings, relevant PRs/issues, release notes

Summary

This KEP proposes a method for provisioning volumes from cross-namespace snapshots.

Motivation

By using volume snapshots feature , users can provision volumes from snapshots. However, it only works for the VolumeSnapshot in the same namespace, therefore users can’t provision a persistent volume claim in one namespace from a VolumeSnapshot in the other namespace. On the other hand, as discussed in other KEP PRs (https://github.com/kubernetes/enhancements/pull/643 , https://github.com/kubernetes/enhancements/pull/1112 , and https://github.com/kubernetes/enhancements/pull/2849) , there are use cases that require to share the VolumeSnapshot across namespaces. For such use cases, this KEP proposes a method for provisioning volumes from cross-namespace snapshots.

Goals

  • Provision of PVCs from VolumeSnapshots in other namespaces

Non-Goals

Proposal

Extend PersistentVolumeClaim API to handle a cross-namespace VolumeSnapshot as a data source and extend CSI external provisioner to handle it. To restrict only allowed VolumeSnapshot to be consumed from other namespaces, ReferenceGrant CRD (formerly ReferencePolicy) is used.

An initial discussion of this idea can be found here and PoC implementation can be found here .

User Stories (Optional)

Story 1

VolumeSnapshots for PVCs in prod namespace are taken on a regular basis, PVCs are created from the VolumeSnapshots in other test namespaces for testing.

Story 2

The same VolumeSnapshots are expected to be consumed as golden images from multiple namespaces. Using PVs as VM images for KubeVirt is one of the examples of this use case.

Notes/Constraints/Caveats (Optional)

Provisioning PVCs from cross-namespace PVCs

The conclusion of the original discussion (here and here ) on transfer feature was that we should avoid implementing transfer of PVCs, because there will be more race conditions for PVCs than snapshots. However, we might have a room to reconsider if this cross-namespace-provision approach can solve the issue of race for PVCs, although transfer approach can’t seem to resolve the issue easily.

Risks and Mitigations

Secret Handling

Unlike transfer feature, this idea doesn’t need to involve any transfers of Secret, therefore there will be no issue on Secret handling. From a populator, Secrets are only referenced through snapshots that exist in the same namespace (As commented here , depending on the driver implementation, there may be very little chance that some CSI drivers won’t work well in a very rare situation).

Security

By using ReferenceGrant , only allowed snapshots can be accessed beyond the namespace boundary (Please also see original discussion on security ). Therefore, no malicious user will be able to access to prohibited snapshots.

In addition, there will be cases that ReferenceGrant may be created/deleted/re-created while referencing PVC is being handled, however, no inconsistent behavior will be expected, as described below.

  1. No ReferenceGrant for a referencing PVC exists when the PVC is handled, and the ReferenceGrant is created later A controller won’t use the VolumeSnapshot until the ReferenceGrant that allows the access is created.

  2. ReferenceGrant for a referencing PVC exists when the PVC is handled, and the ReferenceGrant is deleted later A controller will use the PVC if the ReferenceGrant that allows the access exists when it checks. If all the processes succeed without any error, it succeeds even the ReferenceGrant is deleted in the middle of the processes. If any errors happened in the processes and the controller retries, it may detect that there is no ReferenceGrant . Then, it won’t use the PVC until the ReferenceGrant that allows the access is re-created.

ReferenceGrant API change

Currently, ReferenceGrant exists in gateway.networking.k8s.io API, so users may be confused by using networking API for storage purpose. To avoid it, ReferenceGrant is planned to be moved to more generic place. As a result, ReferenceGrant API will be changed in the future, so csi external provisioner implementation will also need to be changed.

In addition, it will be better that library for checking ReferenceGrant is provided in a generic place. Depending on the implementation of the library code, consuming API may need to be changed.

Therefore, changes in VolumeSnapshotLink API related to ReferenceGrant API changes will be a beta blocker.

Design Details

Example flow of how this proposal works

Let’s use Story 1 as an example and let’s assume the following:

  • There are two namespaces, prod and test,
  • Alice manages the prod namespace and Bob manages the test namespace,
  • Alice would like to allow VolumeSnapshot foo-backup in the prod namespace to be consumed in the test namespace for testing,
  • Bob would like to create a PV for PVC foo-testing in the test namespace from the VolumeSnapshot foo-backup in the prod namespace.

Once this proposal is implemented, it can be achieved by doing the following steps:

  1. In the prod namespace, Alice creates a ReferenceGrant bar that allows referencing to the VolumeSnapshot foo-backup in the prod namespace from any PersistentVolumeClaim in the test namespace,
    apiVersion: gateway.networking.k8s.io/v1alpha2
    kind: ReferenceGrant
    metadata:
      name: bar
      namespace: prod
    spec:
      from:
      - kind: PersistentVolumeClaim
        namespace: test
      to:
      - group: snapshot.storage.k8s.io/v1
        kind: VolumeSnapshot
        name: foo-backup
    
  2. In the test namespace, Bob creates a PersistentVolumeClaim foo-testing that references the VolumeSnapshot foo-backup in prod namespace as a data source,
    apiVersion: v1
    kind: PersistentVolumeClaim
    metadata:
      name: foo-testing
      namespace: test
    spec:
      accessModes:
      - ReadWriteOnce
      resources:
        requests:
          storage: 10Mi
      dataSourceRef2:
        apiGroup: snapshot.storage.k8s.io/v1
        kind: VolumeSnapshot
        name: foo-backup
        namespace: prod
      volumeMode: Filesystem
    
  3. Once the csi-provisioner finds a VolumeSnapshot is specified with non-empty namespace as dataSourceRef2, it checks all ReferenceGrants in PersistentVolumeClaim.spec.dataSourceRef2.namespace to see if access to the snapshot is allowed. If it is allowed, the csi-provisioner provisions a volume from the snapshot. Note that how ReferenceGrant is checked depends on the implementation, however controllers that are trying to use the VolumeSnapshot in another namespace must check ReferenceGrant if the access is allowed before it actually starts exposing the data and metadata from the VolumeSnapshot to the VolumeSnapshotLink’s namespace.

API

A new DataSourceRef2 field is added to PersistentVolumeClaimSpec:

type PersistentVolumeClaimSpec struct {

	.
	.
	.

	DataSourceRef2 *TypedCrossNamespaceObjectReference
}
type TypedCrossNamespaceObjectReference struct {
	// APIGroup is the group for the resource being referenced.
	// If APIGroup is not specified, the specified Kind must be in the core API group.
	// For any other third-party types, APIGroup is required.
	// +optional
	APIGroup *string
	// Kind is the type of resource being referenced
	Kind string
	// Namespace is the namespace of resource being referenced
	// Note that when a namespace is specified, a ReferenceGrant object is required in the referent namespace to allow that namespace's owner to accept the reference. See the ReferenceGrant documentation for details.
	// +optional
	Namespace string
	// Name is the name of resource being referenced
	Name string
}

Implementation

CSI external provisioner

  • If CrossNamespaceSourceProvisioning feature gate isn’t enabled: Do the existing provisioner behavior.
  • Otherwise: Datasource is handled in the below reconciler loop:
    • If data source is specified through DataSource or DataSourceRef: Do the existing provisioner behavior for the data source.
    • If data source is specified through DataSourceRef2:
      • If namepace isn’t specified: Provision a volume from the snapshot specified through DataSourceRef2`.
      • Otherwise:
        • If the access to the VolumeSnapshot referenced by the PersistentVolumeClaim is allowed by any ReferenceGrants: Provision a volume from the snapshot specified through DataSourceRef2`.
        • Otherwise: Do nothing (the ReferenceGrant check above will be repeated in the loop).

API server

To provide compatibility of PersistentVolumeClaim API with older versions, the API servers need to properly handle DataSourceRef2, DataSource, and DataSourceRef. There will be two approaches:

  1. Eventually allow only DataSourceRef2 fields

This is a similar approach to DataSourceRef and DataSource compatibility handling.

Deprecate the DataSourceRef field once DataSourceRef2 reaches GA, but continue to support it in a backwards-compatible way, never removing it.

In particular:

  1. If DataSourceRef and DataSourceRef2 are both unset; accept
  2. If DataSourceRef is set AND DataSourceRef2 is not set; set DataSourceRef2 from DataSourceRef.
  3. If DataSourceRef is set AND DataSourceRef2 is set the same (namespace is empty); accept
  4. If DataSourceRef is set AND DataSourceRef2 is set differently; reject
  5. If DataSourceRef is not set AND DataSourceRef2 is set (namespace is empty); set DataSourceRef from DataSourceRef2 and accept
  6. If DataSourceRef is not set AND DataSourceRef2 is set (namespace is not empty); accept

Note that we can’t set DataSourceRef from DataSourceRef2 in case 6, because namespace field can’t be set to DataSourceRef. Impact of not setting fields in this case, needs to be considered.

  1. Allow both DataSourceRef field and DataSourceRef2 fields to coexist, but only allow specifying either of them or not specifying at all

In this case, DataSourceRef2 is used for cross-namespace provisioning purpose and DataSourceRef is used for existing provisioing purpose.

  1. If DataSource and DataSourceRef are both set; reject

For this case, DataSourceRef2 could be renamed like CrossNamespaceDataSourceRef.

Test Plan

[x] I/we understand the owners of the involved components may require updates to existing tests to make this code solid enough prior to committing the changes necessary to implement this enhancement.

Prerequisite testing updates
Unit tests
  • external-provisioner/pkg/controller/: 2022/5/31 - 81.1%
Integration tests
  • No integration tests for csi external provisioner.
e2e tests
  • Verify that PV is provisioned from VS in other NS if specified through datasourceRef2 and allowed by ReferenceGrant:
  • Verify that PV isn’t provisioned from VS in other NS if specified through datasourceRef2 and not allowed by ReferenceGrant:
  • Verify that PV is provisioned from VS in the same NS if specified through datasourceRef2 with empty namspace and not allowed by ReferenceGrant:
  • Verify that PV isn’t provisioned from VS in the same NS if specified through datasourceRef2 with non-empty namspace and not allowed by ReferenceGrant:

Graduation Criteria

Alpha

  • Feature implemented behind a feature gate
  • Initial e2e tests completed and enabled

Beta

  • Gather feedback from developers and surveys
  • Add scale tests
  • Additional tests are in Testgrid and linked in KEP

GA

  • Allowing time for feedback

Note: Generally we also wait at least two releases between beta and GA/stable, because there’s no opportunity for user feedback, or even bug reports, in back-to-back releases.

For non-optional features moving to GA, the graduation criteria must include conformance tests .

Upgrade / Downgrade Strategy

  • Upgrade:
    • Method:
      • Enable CrossNamespaceSourceProvisioning feature gate.
    • Behavior:
      • Provisioning volumes from snapshots in other namespaces is enabled.
  • Downgrade:
    • Method:
      • Disable CrossNamespaceSourceProvisioning feature gate.
    • Behavior:
      • Provisioning volumes from snapshots in other namespaces is disabled.

Version Skew Strategy

Production Readiness Review Questionnaire

Feature Enablement and Rollback

How can this feature be enabled / disabled in a live cluster?
  • Feature gate (also fill in values in kep.yaml)
    • Feature gate name: CrossNamespaceSourceProvisioning
      • Components depending on the feature gate: kube-apiserver, csi-provisioner
Does enabling the feature change any default behavior?

No. Existing behavior of provisioning volumes from PVC and VolumeSnapshot remains unchanged. By enabling the feature, volumes can be provisioned from snapshots in another namespace.

Can the feature be disabled once it has been enabled (i.e. can we roll back the enablement)?

Yes, by diabling CrossNamespaceSourceProvisioning feature gate.

After the feature is disabled, provisioing volumes from snapshots in another namespace fails.

What happens if we reenable the feature if it was previously rolled back?

Volumes can be provisioned from snapshots in another namespace, again.

Volumes with snapshots in another namespace may be created while the feature is disabled (and such volumes aren’t provisioned). After the feature is reenabled, volumes will be provisioned retrospectively.

Are there any tests for feature enablement/disablement?

No. Unit tests that cover scenarios where the feature is disabled or enabled will be added.

Rollout, Upgrade and Rollback Planning

How can a rollout or rollback fail? Can it impact already running workloads?
What specific metrics should inform a rollback?
Were upgrade and rollback tested? Was the upgrade->downgrade->upgrade path tested?
Is the rollout accompanied by any deprecations and/or removals of features, APIs, fields of API types, flags, etc.?

Monitoring Requirements

How can an operator determine if the feature is in use by workloads?
How can someone using this feature know that it is working for their instance?
  • Events
    • Event Reason:
  • API .status
    • Condition name: Bound for a PV that is provisioned from a PVC referencing VolumeSnapshotLink
    • Other field:
  • Other (treat as last resort)
    • Details: Check if a VolumePopulator CRD to allow populating from VolumeSnapshotLink CRD exists.
What are the reasonable SLOs (Service Level Objectives) for the enhancement?
What are the SLIs (Service Level Indicators) an operator can use to determine the health of the service?
  • Metrics
    • Metric name:
      • cross_namespace_persistentvolumeclaim_provision_total: Total number of persistent volumes provisioned from cross namespace data source successfully. Broken down by storage class name.
      • cross_namespace_persistentvolumeclaim_provision_failed_total: Total number of persistent volume provision from cross namespace data source failed attempts. Broken down by storage class name.
    • [Optional] Aggregation method: prometheus
    • Components exposing the metric: CSI external-provisioner
Are there any missing metrics that would be useful to have to improve observability of this feature?

Existing metrics only provides number of claims remains in the queue and number of retries for the claims. To identify what kind of data source was specified for the claim with errors, per data source type metrics may be needed.

Dependencies

Does this feature depend on any specific services running in the cluster?
  • Features:

    • CSI feature (GA in v1.13)
  • Services:

    • CSI external provisioner and CSI plugins
      • Usage description:
        • Impact of its outage on the feature: The PV isn’t provisioned.
        • Impact of its degraded performance or high-error rates on the feature: Provision of PV becomes slow or error.
    • volume-data-source-validator
      • Usage description:
        • Impact of its outage on the feature: The PV isn’t populated.
        • Impact of its degraded performance or high-error rates on the feature: Populating PV becomes slow or error.

Scalability

Will enabling / using this feature result in any new API calls?
  • GET ReferenceGrant API call:
    • originating component(s): CSI external-provisioner
    • this API call is triggered once after matching ReferenceGrant is found in informer cache on Provision call in CSI external-provisioner when VolumeSnapshot in another namespace is referenced from PVC.
Will enabling / using this feature result in introducing new API types?
  • API type: VolumeSnapshotLink CRD
  • Supported number of objects per namespace (for namespace-scoped objects): TBD (Estimated maximum number is the number of VolumeSnapshots that should be shared across namespace or the number of PVs per namespace).
Will enabling / using this feature result in any new calls to the cloud provider?

No.

Will enabling / using this feature result in increasing size or count of the existing API objects?

No.

Will enabling / using this feature result in increasing time taken by any operations covered by existing SLIs/SLOs?

No.

Currently, no SLIs/SLOs are defined for PV provisioning, but no performance change is expected for existing PV provisioning by this feature. For a new provisioning from cross-namespace snapshot, it may take more time than existing PV provisioning due to the extra API calls.

Will enabling / using this feature result in non-negligible increase of resource usage (CPU, RAM, disk, IO, …) in any components?

No.

Troubleshooting

How does this feature react if the API server and/or etcd is unavailable?

Existing PV provisioning also fails.

What are other known failure modes?
What steps should be taken if SLOs are not being met to determine the problem?

Implementation History

Drawbacks

Alternatives

Implement transfer feature for VolumeSnapshot:

  • Pros:
    • Can have more control over the transferred VolumeSnapshot, like modifying and deleting
    • Can potentially be used to directly clone the VolumeSnapshot to other namespaces for backup use case
  • Cons:
  • Pros:
    • Doesn’t require a core API change
    • Doesn’t need to handle compatibility issues
  • Cons:
    • Provide a complex API with an extra object, which will provide a better UX
    • Conflict on installing VolumePopulator CR for VolumeSnapshotLink across CSI drivers

Let’s use Story 1 as an example and let’s assume the following:

  • There are two namespaces, prod and test,
  • Alice manages the prod namespace and Bob manages the test namespace,
  • Alice would like to allow VolumeSnapshot foo-backup in the prod namespace to be consumed in the test namespace for testing,
  • Bob would like to create a PV for PVC foo-testing in the test namespace from the VolumeSnapshot foo-backup in the prod namespace.

Once this proposal is implemented, it can be achieved by doing the following steps:

  1. In the prod namespace, Alice creates a ReferenceGrant bar that allows referencing to the VolumeSnapshot foo-backup in the prod namespace from any VolumeSnapshotLinks in the test namespace,
    apiVersion: gateway.networking.k8s.io/v1alpha2
    kind: ReferenceGrant
    metadata:
      name: bar
      namespace: prod
    spec:
      from:
      - group: snapshot.storage.k8s.io/v1alpha1
        kind: VolumeSnapshotLink
        namespace: test
      to:
      - group: snapshot.storage.k8s.io/v1
        kind: VolumeSnapshot
        name: foo-backup
    
  2. In the test namespace, Bob creates a VolumeSnapshotLink foo-link that references the VolumeSnapshot foo-backup in the prod namespace as a source,
    apiVersion: snapshot.storage.k8s.io/v1alpha1
    kind: VolumeSnapshotLink
    metadata:
      name: foo-link
      namespace: test
    spec:
      source:
        name: foo-backup
        namespace: prod
    
  3. In the test namespace, Bob creates a PersistentVolumeClaim foo-testing that references the VolumeSnapshotLink foo-link as a data source,
    apiVersion: v1
    kind: PersistentVolumeClaim
    metadata:
      name: foo-testing
      namespace: test
    spec:
      accessModes:
      - ReadWriteOnce
      resources:
        requests:
          storage: 10Mi
      dataSourceRef:
        apiGroup: snapshot.storage.k8s.io/v1alpha1
        kind: VolumeSnapshotLink
        name: foo-link
      volumeMode: Filesystem
    
  4. Once the populator finds a VolumeSnapshotLink is specified as dataSourceRef, it checks all ReferenceGrants in VolumeSnapshotLink.spec.source.namespace to see if populating the VolumeSnapshotLink.spec.source is allowed. If it is allowed, the populator populates the volume. Note that how ReferenceGrant is checked depends on the implementation, however controllers that are trying to use the VolumeSnapshot in another namespace must check ReferenceGrant if the access is allowed before it actually starts exposing the data and metadata from the VolumeSnapshot to the VolumeSnapshotLink’s namespace.

A new VolumeSnapshotLink CRD is introduced in snapshot.storage.k8s.io API group:

type VolumeSnapshotLink struct {
	metav1.TypeMeta `json:",inline"`
	// +optional
	metav1.ObjectMeta `json:"metadata,omitempty" protobuf:"bytes,1,opt,name=metadata"`

	Spec VolumeSnapshotLinkSpec `json:"spec" protobuf:"bytes,2,opt,name=spec"`
}

type VolumeSnapshotLinkList struct {
	metav1.TypeMeta `json:",inline"`
	// +optional
	metav1.ListMeta `json:"metadata,omitempty" protobuf:"bytes,1,opt,name=metadata"`

	Items []VolumeSnapshotLink `json:"items" protobuf:"bytes,2,rep,name=items"`
}

// VolumeSnapshotLinkSpec describes attributes of a volume snapshot link.
type VolumeSnapshotLinkSpec struct {
	// This field is immutable after creation.
	Source VolumeSnapshotLinkSource `json:"source" protobuf:"bytes,1,opt,name=source"`
}

// VolumeSnapshotLinkSource specifies a reference to VolumeSnapshot.
type VolumeSnapshotLinkSource struct {
	Name string `json:"name" protobuf:"bytes,1,opt,name=name"`
	// Namespace is the namespace of the snapshot. When unspecified, the local namespace is inferred.
	// Note that when a namespace is specified, a ReferenceGrant object is required in the referent namespace to allow that namespace's owner to accept the reference. See the ReferenceGrant documentation for details.
	// +optional
	Namespace string `json:"namespace" protobuf:"bytes,2,opt,name=namespace"`
}

Note that when a VolumeSnapshotLink.spec.source.namespace is specified, a ReferenceGrant object is required in the referent namespace to allow that namespace’s owner to accept the reference. See the ReferenceGrant documentation for details.

Populator implementation

The populator logic can be implemented either (a) inside the existing CSI external-provisioner or (b) as a separate populator . Cluster admins can choose which implementation to be used per CSI driver basis. As a reference implementation, only (a) will be implemented in the community.

Regardless of the implementation,

  • VolumeSnapshotLink CRD and ReferenceGrant CRD must exist in the cluster before the populator is deployed.
  • VolumePopulator CR to allow populating from VolumeSnapshotLink CRD needs to be created to enable this feature, as AnyVolumeDataSource feature defines. The VolumePopulator CR needed for this feature will be as follows:
kind: VolumePopulator
apiVersion: populator.storage.k8s.io/v1beta1
metadata:
  name: volumesnapshotlink
sourceKind:
  group: snapshot.storage.k8s.io
  kind: VolumeSnapshotLink
(a) inside the existing CSI external-provisioner

Once populator is implemented inside the existing CSI external-provisioner, the CSI external provisioner:

  1. Handles VolumeSnapshotLink CRD and ReferenceGrant CRD,
  2. Datasource is handled in the below reconciler loop:
  • If VolumeSnapshotLink isn’t specified as DataSourceRef: Do the existing provisioner behavior.
  • Otherwise:
    • If namespace isn’t specified in VolumeSnapshotLink: use the VolumeSnapshot referenced by the VolumeSnapshotLink as a SnapshotSource to pass to the CSI driver for provision.
    • Otherwise:
      • If the access to the VolumeSnapshot referenced by the VolumeSnapshotLink is allowed by any ReferenceGrants: Use the VolumeSnapshot as a SnapshotSource to pass to the CSI driver for provision.
      • Otherwise: Do nothing (the ReferenceGrant check above will be repeated in the loop).

To enable this feature in CSI external provisioner, --cross-namespace-snapshot=true command line flag needs to be passed to the provisioner for each CSI plugin.

The initial PoC implementation uses this approach, but it doesn’t implements the command line flag to enable/disable this feature.

(b) as a separate populator

There will be two approaches to implement as a separate populator:

(1) Populate data from snapshot to provisioned PV

This is a straightforward implementation that AnyVolumeDataSource feature defines. Developers will be able to utilize lib-volume-populator to implement this way. One of the challenges to achieve it will be how to copy the data from a snapshot in one namespace to an already provisioned PV that will need to be bound to a PVC in the other namespace.

A naive implementation will be:

  1. Create another PV from the snapshot in the snapshot’s namespace,
  2. Copy the data from the PV to somewhere accessible from any namespaces,
  3. Copy the data in step 2 to the originally intended PV,
  4. Delete the temporary data in step 1 and step 2.

If the naive implementation is used, unintended transient states, for example a temporary PVC in the snapshot namespace, may be visible to users. Also, there may be performance issues depending on where and how data is copied.

On the other hand, although it completely depends on the implementation, this approach can have advantages, like the ability to populate volumes from snapshot across different CSI drivers or the ability to efficiently copy data by using CSI driver specific way.

There will be no generic way to implement by using this approach, because the implementations rely too much on backup tools or CSI drivers. Therefore, no community implementation of this approach will be provided.

Note that a PoC implementation for this approach can be found here . It works only for csi-hostpath driver and is intended to be just for discussion purpose.

(2) Provision PV with data via CSI call

Current CSI external provisioners provision volume regardless of the data source, therefore populators need to populate data to already provisioned PVs. However, this behavior may be changed and provisioners may offload provisioning to populators for PV with VolumeSnapshotLink CRD data source.

The implementation of provisioner and populator of this approach will be as follows:

  • Provisioner:

    1. Handles VolumeSnapshotLink CRD,
    2. Checks if VolumeSnapshotLink is specified as DataSourceRef:
    • If specified, skip provisioning the volume
  • Populator:

    1. Handles VolumeSnapshotLink CRD and ReferenceGrant CRD,
    2. Datasource is handled in the below reconciler loop:
    • If VolumeSnapshotLink isn’t specified as DataSourceRef: Skip handling the Datasource.
    • Otherwise:
      • If namespace isn’t specified in VolumeSnapshotLink: use the VolumeSnapshot referenced by the VolumeSnapshotLink as a SnapshotSource to pass to the CSI driver for provision.
      • Otherwise:
        • If the access to the VolumeSnapshot referenced by the VolumeSnapshotLink is allowed by any ReferenceGrants: Use the VolumeSnapshot as a SnapshotSource to pass to the CSI driver for provision.
        • Otherwise: Do nothing (the ReferenceGrant check above will be repeated in the loop).

The above implementation is just separating the logics in approach (a) to two components, and it won’t help improve efficiency nor simplify implementations. Therefore, the description in this section is just for discussion purpose and won’t be implemented.

A PoC implementation for this approach, forking existing provisioner and modify it to only handle VolumeSnapshotLink, can be found here . Note that just to separate the containers for normal provision and provision from VolumeSnapshotLink, we don’t need to fork the codes, instead we can use a command line option. Fork is only needed if we need to keep the existing CSI external provisioner codes separated from this feature.

In addition, there may be an alternative way to populate to an already provisioned volume via CSI call from populator, if CSI spec is modified to allow populating an already provisioned volume.

Infrastructure Needed (Optional)