Skip to content

Split out updates to model config into its own function and call it during setup#12663

Merged
ananthsub merged 16 commits intoNVIDIA-NeMo:mlm-pretrain-loopfrom
ananthsub:setup-cleanup
Mar 19, 2025
Merged

Split out updates to model config into its own function and call it during setup#12663
ananthsub merged 16 commits intoNVIDIA-NeMo:mlm-pretrain-loopfrom
ananthsub:setup-cleanup

Conversation

@ananthsub
Copy link
Collaborator

Important

The Update branch button must only be pressed in very rare occassions.
An outdated branch is never blocking the merge of a PR.
Please reach out to the automation team before pressing that button.

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Collection: [Note which collection this PR will affect]

Changelog

  • Add specific line by line info of high level changes in this PR.

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Copy link
Collaborator

@maanug-nv maanug-nv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thanks!

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
@ananthsub ananthsub merged commit 752edac into NVIDIA-NeMo:mlm-pretrain-loop Mar 19, 2025
7 of 8 checks passed
hemildesai pushed a commit that referenced this pull request Mar 20, 2025
…uring setup (#12663)

* move ddp config updates to setup

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* split into own function

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* names

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* add copyright for example

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* address comments

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* remove changes to get_model_from_config

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* update model config sync funcs as part of get_model_from_config

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* move back to setup, include grad scale func from optimizer

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* lint

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* undo

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* right check

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* keep order

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* keep comment

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* lints

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* remove arg

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* updates

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

---------

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
hemildesai pushed a commit that referenced this pull request Apr 15, 2025
…uring setup (#12663)

* move ddp config updates to setup

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* split into own function

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* names

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* add copyright for example

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* address comments

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* remove changes to get_model_from_config

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* update model config sync funcs as part of get_model_from_config

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* move back to setup, include grad scale func from optimizer

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* lint

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* undo

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* right check

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* keep order

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* keep comment

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* lints

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* remove arg

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* updates

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

---------

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
jiemingz pushed a commit that referenced this pull request Jul 10, 2025
…uring setup (#12663)

* move ddp config updates to setup

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* split into own function

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* names

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* add copyright for example

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* address comments

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* remove changes to get_model_from_config

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* update model config sync funcs as part of get_model_from_config

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* move back to setup, include grad scale func from optimizer

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* lint

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* undo

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* right check

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* keep order

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* keep comment

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* lints

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* remove arg

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

* updates

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>

---------

Signed-off-by: Ananth Subramaniam <ansubramania@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants