Skip to content

Conversation

@garlick
Copy link
Member

@garlick garlick commented Jan 29, 2026

Problem: on a system with Nvidia GPU and hwloc 2.7.0, some tests in t3218-resource-verify.t are failing.

The failing tests are for cores not gpus, so just ignore the gpu counts by setting resource.verify.gpu=ignore.

Works around #7332

Copy link
Contributor

@grondo grondo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks! Just made a note to remove the previous fix for the same problem on some RSMI systems from the top of the file.

test_expect_success MULTICORE 'fewer cores: allow-extra allows extra cores' '
flux alloc -n1 -o cpu-affinity=off \
--conf=resource.verify.core=allow-extra \
--conf=resource.verify.gpu=ignore \
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Attempts to work around this same issue on rsmi systems were made in commit 56dfdd7 and then #7296. This fix supersedes those fixes so the block that sets HWLOC_COMPOENTS=-rsmi when necessary at the top of this file can be removed in this commit.

@codecov
Copy link

codecov bot commented Jan 30, 2026

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 83.53%. Comparing base (8010804) to head (83f0a8b).
⚠️ Report is 4 commits behind head on master.

Additional details and impacted files
@@            Coverage Diff             @@
##           master    #7333      +/-   ##
==========================================
- Coverage   83.55%   83.53%   -0.02%     
==========================================
  Files         562      562              
  Lines       93782    93782              
==========================================
- Hits        78361    78344      -17     
- Misses      15421    15438      +17     

see 13 files with indirect coverage changes

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Problem: on a system with Nvidia GPU and hwloc 2.7.0, some tests
in t3218-resource-verify.t are failing.

The failing tests are for cores not gpus, so just ignore the gpu
counts by setting resource.verify.gpu=ignore.

Drop the previous work-around for RSMI systems added in flux-framework#7288
(56dfdd7) and tweaked in flux-framework#7296
that is superseded by this change.

Works around flux-framework#7332
@garlick
Copy link
Member Author

garlick commented Jan 30, 2026

Thanks - I added that and will set MWP.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants