nvmeof: treat "connecting" state as valid in path detection by gadididi · Pull Request #5974 · ceph/ceph-csi

gadididi · 2026-01-26T12:38:40Z

When checking if a path to a gateway already exists, treat both "live" and "connecting" states as
valid connections that should not be re-attempted.

The "connecting" state indicates the NVMe kernel
is actively trying to establish or re-establish
a connection, which occurs in scenarios like:

Initial connection establishment
Gateway temporarily unavailable and kernel retrying
Subsystem\Host deleted and recreated on the gateway

The kernel's ctrl_loss_tmo mechanism will continue retry attempts for up to 30 minutes
( by -l 1800 param in nvme connect command).
Attempting nvme connect while a path is in
"connecting" state results in "already connected"
errors and can cause volume attachment failures
during create/delete cycles.

By treating "connecting" as a valid state,
we allow the kernel's retry logic to handle
reconnection automatically without interference.

Related issues

Fixes: #5964

Checklist:

Commit Message Formatting: Commit titles and messages follow
guidelines in the developer
guide.
Reviewed the developer guide on Submitting a Pull
Request
Pending release
notes
updated with breaking and/or notable changes for the next major release.
Documentation has been updated, if necessary.
Unit tests have been added, if necessary.
Integration tests have been added, if necessary.

Show available bot commands

These commands are normally not required, but in case of issues, leave any of
the following bot commands in an otherwise empty comment in this PR:

/retest ci/centos/<job-name>: retest the <job-name> after unrelated
failure (please report the failure too!)

When checking if a path to a gateway already exists, treat both "live" and "connecting" states as valid connections that should not be re-attempted. The "connecting" state indicates the NVMe kernel is actively trying to establish or re-establish a connection, which occurs in scenarios like: - Initial connection establishment - Gateway temporarily unavailable and kernel retrying - Subsystem deleted and recreated on the gateway The kernel's ctrl_loss_tmo mechanism will continue retry attempts for up to 30 minutes ( by -l param in nvme connect command). Attempting nvme connect while a path is in "connecting" state results in "already connected" errors and can cause volume attachment failures during create/delete cycles. By treating "connecting" as a valid state, we allow the kernel's retry logic to handle reconnection automatically without interference. Signed-off-by: gadi-didi <gadi.didi@ibm.com>

Madhu-1 · 2026-01-27T05:47:33Z

internal/nvmeof/nvmeof_initiator.go

 					path.Address.Trsvcid == gatewayPort &&
-					path.State == "live" {
+					(path.State == "live" ||
+						path.State == "connecting") {


Do we need to retry if its in connecting state until it goes to live?

No, we don't need to retry. Here's why:
The kernel is already doing the retrying for us. When a path is in "connecting" state, the NVMe kernel driver is actively attempting to establish the connection and will keep retrying automatically for up to 30 minutes (based on the ctrl_loss_tmo we set with -l 1800 in nvme connect command).

From the CSI driver's perspective, our job is just to ensure a connection attempt has been initiated for the next step- mounting.
Once we see a path in either "live" or "connecting" state for our subsystem, we know the kernel has it under control.
What happens in practice:
this function is called from the nodeserver in NodeStageVolume(). right after ControllerPublishVolume() added (if was need to add) the host (by GRPC AddHost to nvmeof GW)

If the path is "live" --> great, connection is working
If the path is "connecting" --> it means there is a path but it sis not connected (probably due to remove last nvmeof ns , So the ControllerUnPublishVolume() removed the host and the subsystem) kernel is retrying, and when the ControllerPublishVolume() added the host again, it becomes available but it is too fast get into this check. the nvme will automatically transition to "live" and the namespace will appear.

another example.

there was connection with live path for some nvmeof ns . then the admin deletes that pod and the pvc. the host and the subsystem both are deleted.

30 minutes passed

admin wants to create new pvc and then new pod.

the nvme subsys-list will not see connecting or live.

nvme connect will rerun again

actually I could anyway to rerun connect, without this checking. if it is "live" or "connecting" I would get an error "Already Connected" So basically, I can parse the error code and stderr and see if this is the case, move on, and do not raise an error.
@Madhu-1 what do you think?

A couple of questions

if its in the connecting state, will the application be able to read/write? I assume no. What is the use of mounting such PVC if it cannot be used?

Will there be any problem during NodeUnstage?

actually I could anyway to rerun connect, without this checking. if it is "live" or "connecting" I would get an error "Already Connected" So basically, I can parse the error code and stderr and see if this is the case, move on, and do not raise an error.

If the nvme subsys-list is not reliable, we should rerun the connect and discard known errors

if its in the connecting state, will the application be able to read/write? I assume no. What is the use of mounting such PVC if it cannot be used?

The "connecting" state we're checking happens right after ControllerPublishVolume stage, and before the mount stage.
Here's the timeline:

CreateVolume - Volume is created on the gateway

ControllerPublishVolume - Host is added to the subsystem on the gateway

NodeStageVolume - This is where we check for existing connections.
At this point, we call nvme list-subsys to see what's already connected
If we see "connecting", it means a previous operation already initiated the connection (some prev PVC that's already deleted)

The kernel continues establishing the connection in the background

Before we actually mount, we wait for the namespace device to appear using GetNamespaceDeviceByUUID() which retries until the device is accessible

Only when the device exists and is accessible do we proceed to format and mount it

So, the key point is: We don't mount a PVC that's stuck in "connecting" state. We just avoid running nvme connect again (which would fail with "already connected"). We still wait for the device to become available before mounting.

If the nvme subsys-list is not reliable, we should rerun the connect and discard known errors

I dont think it is not reliable.
at some runs the nvme subsys-list already shows live . I think it is about timing. but for sure (and easier way) to parse the stderr is fit solution.
But I would avoid parsing the string in stderr in some reason the next nvme-tool version will return another sting.

I'm happy with this approach for now. If there is a need to always try to connect, we can do that at a later point in time too.

Madhu-1 · 2026-01-27T16:45:07Z

@Mergifyio queue

mergify · 2026-01-27T16:45:19Z

queue

✅ The pull request has been merged automatically

Details

The pull request has been merged automatically at 8df46e3

mergify · 2026-01-27T16:45:20Z

Merge Queue Status

✅ The pull request has been merged at 4dc16ff

This pull request spent 35 minutes 10 seconds in the queue, including 34 minutes 52 seconds running CI.
The checks were run on draft #5981.

Required conditions to merge

gadididi requested review from a team and nixpanic January 26, 2026 12:38

gadididi self-assigned this Jan 26, 2026

gadididi added bug Something isn't working component/nvme-of Issues and PRs related to NVMe-oF. labels Jan 26, 2026

gadididi force-pushed the nvmeof/fix_nodesever_already_connected branch from ad52f60 to 4dc16ff Compare January 26, 2026 13:41

Madhu-1 reviewed Jan 27, 2026

View reviewed changes

nixpanic approved these changes Jan 27, 2026

View reviewed changes

Madhu-1 approved these changes Jan 27, 2026

View reviewed changes

Madhu-1 added the ci/skip/e2e skip running e2e CI jobs label Jan 27, 2026

mergify bot added the queued label Jan 27, 2026

mergify bot added a commit that referenced this pull request Jan 27, 2026

Merge of #5974

9f7834b

mergify bot mentioned this pull request Jan 27, 2026

merge queue: embarking devel (80c0474) and #5974 together #5981

Closed

78 tasks

mergify bot merged commit 8df46e3 into ceph:devel Jan 27, 2026
21 checks passed

mergify bot removed the queued label Jan 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nvmeof: treat "connecting" state as valid in path detection#5974

nvmeof: treat "connecting" state as valid in path detection#5974
mergify[bot] merged 1 commit intoceph:develfrom
gadididi:nvmeof/fix_nodesever_already_connected

gadididi commented Jan 26, 2026 •

edited

Loading

Uh oh!

Madhu-1 Jan 27, 2026

Uh oh!

gadididi Jan 27, 2026

Uh oh!

Madhu-1 Jan 27, 2026

Uh oh!

gadididi Jan 27, 2026

Uh oh!

nixpanic Jan 27, 2026

Uh oh!

Madhu-1 commented Jan 27, 2026

Uh oh!

mergify bot commented Jan 27, 2026 •

edited

Loading

Uh oh!

mergify bot commented Jan 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

gadididi commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related issues

Uh oh!

Madhu-1 Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

gadididi Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

Madhu-1 Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

gadididi Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

nixpanic Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

Madhu-1 commented Jan 27, 2026

Uh oh!

mergify bot commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ The pull request has been merged automatically

Uh oh!

mergify bot commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge Queue Status

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gadididi commented Jan 26, 2026 •

edited

Loading

mergify bot commented Jan 27, 2026 •

edited

Loading

mergify bot commented Jan 27, 2026 •

edited

Loading