KEP-5573: Remove cgroup v1 support

KEP-5573: Remove cgroup v1

Release Signoff Checklist
Summary
Motivation
Proposal
- Risks and Mitigations
Design Details
Production Readiness Review Questionnaire
Implementation History
Drawbacks
Alternatives

Release Signoff Checklist

Items marked with (R) are required prior to targeting to a milestone / release.

(R) Enhancement issue in release milestone, which links to KEP dir in kubernetes/enhancements (not the initial KEP PR)
(R) KEP approvers have approved the KEP status as implementable
(R) Design details are appropriately documented
(R) Test plan is in place, giving consideration to SIG Architecture and SIG Testing input (including test refactors)
(R) Graduation criteria is in place
(R) Production readiness review completed
(R) Production readiness review approved
“Implementation History” section is up-to-date for milestone
User-facing documentation has been created in kubernetes/website , for publication to kubernetes.io
Supporting documentation—e.g., additional design documents, links to mailing list discussions/SIG meetings, relevant PRs/issues, release notes

Summary

Remove cgroup v1 support from Kubernetes codebase, building upon the maintenance mode introduced in KEP-4569.

This KEP will have a phased approach where beta will prevent kubelet from starting on a cgroup v1 node.

We will commit to removing cgroup v1 code but there is not yet a timeline for removal. The removal will be done no earlier than 1.38 to maintain the k8s deprecation policy.

Motivation

Following the transition of cgroup v1 support to maintenance mode in KEP-4569, the next logical step is to move cgroup v1 to a deprecated state. This aligns with the broader ecosystem’s migration to cgroup v2, including major Linux distributions and the Linux kernel community’s focus on cgroup v2 for new features and improvements.

The motivation builds on the rationale established in KEP-4569:

The Linux kernel community has made cgroup v2 the focus for new features
systemd announced deprecation 1.5 years ago in v256 and removal . Other critical components are moving beyond cgroup v1
cgroup v2 offers better functionality, more consistent interfaces, and improved scalability

The goal of this enhancement would be to eventually remove cgroup v1 related code once the timeline is figured out.

Linux Community Momentum

Systemd

Systemd has started phasing out cgroup v1 in v256 .

Support for cgroup v1 (’legacy’ and ‘hybrid’ hierarchies) is now considered obsolete and systemd by default will refuse to boot under it. To forcibly reenable cgroup v1 support, SYSTEMD_CGROUP_ENABLE_LEGACY_FORCE=1 must be set on kernel command line. The meson option ‘default-hierarchy=’ is also deprecated, i.e. only cgroup v2 (‘unified’ hierarchy) can be selected as build-time default.

Systemd in v258 announces the removal of cgroup v1.

Red Hat Enterprise Linux

In RHEL 9.4, Red Hat has announced the deprecation of cgroups v1 .

In Red Hat Enterprise Linux 9, the default mode is v2. In Red Hat Enterprise Linux 10, systemd will not support booting in the cgroups v1 mode and only cgroups v2 mode will be available.

Fedora

Fedora 41 and Fedora 42 inherit systemd v256 so they will not boot cgroup v1 by default.

Fedora 43 will inherit cgroup v1 removal so one will not be able to use cgroup v1 on latest Fedora version.

Debian

The latest release of debian “Trixie” is using systemd 257 so users of debian would need to set SYSTEMD_CGROUP_ENABLE_LEGACY_FORCE=1 to enable cgroup v1 support for systemd.

Amazon Linux

Amazon Linux 2023 announces that cgroup v1 is unsupported and not recommended.

Although AL2023 still includes code that can make the system run using cgroupv1, this is not a recommended or supported configuration, and will be completely removed in a future major release of Amazon Linux.

Goals

Disable cgroup v1 support by default: Set the kubelet flag FailCgroupV1 to true by default, effectively making cgroup v1 unsupported unless explicitly enabled.
Clear messaging: Update warning messages and events to reflect that cgroup v1 is now deprecated rather than in maintenance mode.
Documentation updates: Update all relevant documentation to reflect the deprecated status of cgroup v1 and provide migration guidance.
Preparation for removal: This change prepares the codebase for eventual removal of cgroup v1 support in future releases.
Community alignment: Provide clear signals to the Kubernetes community about the deprecation timeline and encourage adoption of cgroup v2.
Remove cgroup v1 code: Removing the code is the next logic step once we stop testing it. This will clean up the codebase.
Update system-validators: system-validator should react correctly to kubelet not starting on a cgroup v1 node. See issue for more details.

Non-Goals

In this KEP, we will focus on the kubelet related work to remove cgroup v1. Projects like minikube, kubeadm, kubespray and others may need work to support this flag going to false but that is out of scope for this KEP.

Proposal

This proposal builds upon the foundation laid by KEP-4569 (Moving cgroup v1 support into maintenance mode) and formally removes cgroup v1 code from kubelet.

Risks and Mitigations

The primary risks involve potential disruptions for users who have not yet migrated to cgroup v2:

Existing clusters running cgroup v1: Users running Kubernetes on hosts with cgroup v1 will need to either:
- Migrate their hosts to cgroup v2 (recommended)
- Explicitly set FailCgroupV1=false to continue using cgroup v1 (not recommended)
Workload compatibility: Users depending on technologies that require specific versions for cgroup v2 support:
- OpenJDK / HotSpot: jdk8u372, 11.0.16, 15 and later
- NodeJs 20.3.0 or later
- IBM Semeru Runtimes: jdk8u345-b01, 11.0.16.0, 17.0.4.0, 18.0.2.0 and later
- IBM SDK Java Technology Edition Version (IBM Java): 8.0.7.15 and later
- Third-party monitoring and security agents need to support cgroup v2
oom.group: In cgroup v2, the kernel community introduced a feature to allow the Out-of-Memory (OOM) killer to terminate an entire group of processes as an indivisible unit.
- Users who need the cgroup v1 behavior can toggle singleProcessOOMKill to true on the kubelet config.
- This will allow the kernel to kill the process that triggers OOMs without killing the rest of the processes.

Mitigations:

Provide comprehensive migration documentation and guidance
Change the KubeletConfig field FailCgroupV1 to false.
Clear warning messages when cgroup v1 is detected
Community support through migration period
Advance notice through multiple release cycles

Design Details

This enhancement primarily involves configuration changes and messaging updates, building on the infrastructure already implemented in KEP-4569.

Move to Deprecated as first phase

Enable FailCgroupV1 by default

The key technical change is to modify the default value of the kubelet config api FailCgroupV1 from false to true. This change will be implemented in the kubelet configuration types.

Current behavior:

// Default: false (cgroup v1 support enabled by default)
FailCgroupV1: false,

Proposed behavior:

// Default: true (cgroup v1 support disabled by default)
FailCgroupV1: true,

Update warning messages and events

Update the warning messages and events introduced in KEP-4569 to reflect the new unsupported status:

From (maintenance mode):

klog.Warning("cgroup v1 detected. cgroup v1 support has been transitioned into maintenance mode, please plan for the migration towards cgroup v2. More information at https://git.k8s.io/enhancements/keps/sig-node/4569-cgroup-v1-maintenance-mode")

To (deprecated):

klog.Warning("cgroup v2 detected. cgroup v1 support is deprecated and will be removed in a future release. Please migrate to cgroup v2. More information at https://git.k8s.io/enhancements/keps/sig-node/5573-remove-cgroup-v1")

Similar updates will be made to corresponding events.

Documentation updates

Update all relevant documentation across the Kubernetes ecosystem:

Kubernetes.io documentation
Kubelet configuration documentation
Migration guides
Release notes
Blog posts about the transition

Testing of cgroup v1

There is still a lane in Kubernetes upstream testing cgroup v1.

Other lanes will be removed but we will continue supporting this lane until we fully remove cgroup v1 from the code base.

Removal of cgroup v1

<UNRESOLVED @haircommander> Once all supported releases of Kubernetes have FailCgroupV1 set to true, we can begin the removal of the cgroup v1 support.

In this section, we should call the places where we are going to remove cgroup v1.

Meaning of deprecation

Kubernetes will continue to test and verify no regressions appear in the following job .

Until cgroup v1 removal happens in the code base, this job will continue to be maintained and supported.

If there are any regressions caused by code in this lane, the Kubernetes community will treat that as a regression and this would be a candidate for cherry-picks.

CGroup v1 related features or UX improvements are not in scope since cgroup v1 is deprecated.

Test Plan

[X] I/we understand the owners of the involved components may require updates to existing tests to make this code solid enough prior to committing the changes necessary to implement this enhancement.

Prerequisite testing updates

All existing cgroup v2 test jobs must continue to pass. Tests should verify that:

The default behavior correctly disables cgroup v1 support
Appropriate warning messages are displayed when cgroup v1 is detected

Unit tests

Unit tests should cover:

Default configuration values
Warning message generation
Event creation for cgroup v1 detection
Configuration override behavior

Integration tests

Integration tests will verify the end-to-end behavior of the configuration changes and ensure proper interaction between kubelet components.

e2e tests

Continue monitoring cgroup v2 CI jobs to ensure stability
Add specific tests for the new default behavior
Ensure all new tests use cgroup v2 hosts
Remove all cgroup v1 test lanes.

Graduation Criteria

Beta

Default value for FailCgroupV1 kubelet config will be changed to true
Updated warning messages and events for unsupported status
Documentation updates in kubernetes/enhancements repository
Subset of tests will run on cgroupv1 offering sanity check of cgroupv1 functioning. Since the cgroupv1 was moved to the maintenance mode, only older features are supported on cgroupv1.

Stable

Remove all cgroup v1 related code.

Upgrade / Downgrade Strategy

Upgrade considerations:

Clusters upgrading to Kubernetes v1.35+ on cgroup v1 hosts will fail to start kubelet unless FailCgroupV1 is set to false
Administrators should migrate to cgroup v2 before upgrading or explicitly set the override flag
Clear documentation and communication about this breaking change

Downgrade strategy:

Downgrading to versions prior to this change will restore the previous default behavior
No additional configuration changes needed for downgrade

Version Skew Strategy

This change only affects kubelet behavior and does not involve coordination with other control plane components. The change is backward compatible for users who explicitly configure cgroup v1 support.

Production Readiness Review Questionnaire

Feature Enablement and Rollback

How can this feature be enabled / disabled in a live cluster?

This is a default configuration change. The feature can be controlled via the kubelet config:

To disable cgroup v1 support (default): FailCgroupV1=true
To enable cgroup v1 support (override): FailCgroupV1=false

Does enabling the feature change any default behavior?

Yes, this change modifies the default behavior. Previously, cgroup v1 support was enabled by default. After this change, cgroup v1 support will be disabled by default.

Can the feature be disabled once it has been enabled (i.e. can we roll back the enablement)?

Yes, users can set FailCgroupV1=false to re-enable cgroup v1 support.

What happens if we reenable the feature if it was previously rolled back?

Re-enabling cgroup v1 support will restore the previous behavior, allowing kubelet to run on cgroup v1 hosts.

Are there any tests for feature enablement/disablement?

Yes, unit and integration tests will cover both the default behavior and the override scenarios.

Rollout, Upgrade and Rollback Planning

How can a rollout fail? Can it impact already running workloads?

Potential failure scenarios:

Kubelet fails to start on cgroup v1 hosts without the override flag
Existing clusters running on cgroup v1 experience service disruption during upgrade

Impact on running workloads:

Existing workloads on upgraded nodes will be impacted if the node uses cgroup v1 and the override flag is not set
The kubelet will fail to start, causing the node to become unavailable

What specific metrics should inform a rollback?

kubelet_cgroup_version metric showing unexpected cgroup version distribution
Increased node failures or unavailability
Kubelet startup failures indicating cgroup v1 detection

Were upgrade and rollback tested? Was the upgrade->downgrade->upgrade path tested?

Testing will include:

Upgrade scenarios on both cgroup v1 and cgroup v2 hosts
Rollback to previous versions
Override flag functionality

Is the rollout accompanied by any deprecations and/or removals of features, APIs, fields of API types, flags, etc.?

This change moves cgroup v1 from maintenance mode to unsupported status but does not remove any APIs or flags. The FailCgroupV1 flag remains available for override purposes.

Monitoring Requirements

How can someone using this feature know that it is working for their instance?

Users can monitor:

Kubelet logs for warnings about cgroup v1 detection
Events related to cgroup v1 unsupported status
The kubelet_cgroup_version metric to verify cgroup version usage

How can an operator determine if the feature is in use by workloads?

Operators can use the kubelet_cgroup_version metric to determine cgroup version distribution across their cluster and monitor logs/events for cgroup v1 warnings.

What are the SLIs (Service Level Indicators) an operator can use to determine the health of the service?

Node availability and kubelet health status
kubelet_cgroup_version metric distribution
Absence of cgroup v1 related warning messages

What are the reasonable SLOs (Service Level Objectives) for the above SLIs?

99.9% of nodes should be running cgroup v2
Zero cgroup v1 related warnings in production clusters
Node availability should remain consistent after migration

Are there any missing metrics that would be useful to have to improve observability of this feature?

The existing kubelet_cgroup_version metric from KEP-4569 provides sufficient observability for this change.

Dependencies

Does this feature depend on any specific services running in the cluster?

No external dependencies. This change only affects kubelet configuration defaults.

Scalability

Can enabling / using this feature result in resource exhaustion of some node resources (PIDs, sockets, inodes, etc.)?

No, this is a configuration default change that does not impact resource usage.

Will enabling / using this feature result in any new API calls?

No new API calls are introduced.

Will enabling / using this feature result in introducing new API types?

No new API types are introduced.

Will enabling / using this feature result in any new calls to the cloud provider?

No new cloud provider calls are made.

Will enabling / using this feature result in increasing size or count of the existing API objects?

No impact on existing API objects.

Will enabling / using this feature result in increasing time taken by any operations covered by existing SLIs/SLOs?

No impact on existing operation timing.

Will enabling / using this feature result in non-negligible increase of resource usage (CPU, RAM, disk, IO, …) in any components?

No increase in resource usage. The change may actually improve performance by defaulting to the more efficient cgroup v2.

Troubleshooting

How does this feature react if the API server and/or etcd is unavailable?

This feature operates at the kubelet level and does not depend on API server or etcd availability.

What are other known failure modes?

Failure mode: Kubelet fails to start on cgroup v1 hosts

Detection: Kubelet startup logs and node status
Mitigation: Set FailCgroupV1=false or migrate to cgroup v2
Diagnostics: Kubelet logs will clearly indicate cgroup v1 detection and unsupported status
Testing: Covered in unit and integration tests

What steps should be taken if SLOs are not being met to determine the problem?

Check kubelet logs for cgroup-related error messages
Verify cgroup version on affected nodes using kubelet_cgroup_version metric
If cgroup v1 is detected, either migrate to cgroup v2 or set override flag
Monitor node availability and kubelet health status

Implementation History

2025-09-26: KEP for removing cgroup v1 created

Drawbacks

Breaking change: This represents a breaking change for clusters running on cgroup v1 hosts that upgrade without preparation.
Migration burden: Users who have not yet migrated to cgroup v2 will be forced to either migrate or explicitly override the default behavior.
Ecosystem readiness: Some users may still rely on environments or workloads that are not fully ready for cgroup v2.
Support burden: Increased support requests from users who encounter issues during the transition.

Alternatives

Continue maintenance mode longer: Keep cgroup v1 in maintenance mode for additional releases to provide more migration time. This was ruled out because it delays the necessary ecosystem transition and maintains technical debt.
Immediate removal: Completely remove cgroup v1 support without an unsupported phase. This was ruled out as too aggressive and would break existing clusters without providing a migration path.
Opt-in cgroup v2: Require explicit configuration to enable cgroup v2 instead of disabling cgroup v1 by default. This was ruled out because it doesn’t provide clear signals about the deprecation path and slows adoption of the preferred technology.
Feature gate approach: Use a feature gate instead of a kubelet flag. This was ruled out because kubelet flags provide more direct control over the behavior and are more appropriate for this type of configuration change.