Codebase for ICML 2025 Paper
[ Preprint ] | [Embeddings]]
We have a series of work focusing on reward models in RLHF:
- Part I. Reward Model Foundation ICLR'2025 Oral, Code Repo
- Part II. Active Reward Modeling (This paper/repo) ICML 2025
- Part III. Accelerating Reward Model Research with our Infra. Preprint, Code Repo
The algorithms we tested were implemented in model, there are two algorithms from other authors, namely coreset (Huggins et al. 2016) in lrcoresets and batchBALD (Kirsch et al 2019) in batchbald_redux, we did minimal modification to make sure then can be compatible with our computation environment.
The experiment code to be released soon after we remove unnecessary parts due to our specific computation environment.