Skip to content

Actions: natolambert/rlhf-book

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
216 workflow runs
216 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fix: Include code redirect in build (#225)
Deploy static content to Pages #254: Commit adc78b2 pushed by natolambert
7m 38s main
Add rlhfbook.com/code redirect (#224)
Deploy static content to Pages #253: Commit 4896870 pushed by natolambert
6m 2s main
Fix diagrams Makefile and remove unused diagrams (#223)
Deploy static content to Pages #252: Commit c81b2df pushed by natolambert
6m 30s main
Fix directory structure in README to match actual layout
Deploy static content to Pages #251: Commit c37db8a pushed by natolambert
10m 58s main
Clean up root files: gitignore, favicon, pyproject (#222)
Deploy static content to Pages #250: Commit 6643bf9 pushed by natolambert
5m 18s main
[WIP] Add Code Library (#219)
Deploy static content to Pages #248: Commit d54f445 pushed by natolambert
6m 30s main
Add equation labels to chapters 4 and 10 (#218)
Deploy static content to Pages #247: Commit 1944b65 pushed by natolambert
13m 10s main
Fix PPO implementation bugs: returns and KL masking (#216)
Deploy static content to Pages #245: Commit 5a1f43b pushed by natolambert
11m 58s main
Clarify DPO paper quote in chapter 12 (#215)
Deploy static content to Pages #244: Commit 9137ab2 pushed by natolambert
11m 40s main
Add CartPole diagram to chapter 4 (#213)
Deploy static content to Pages #243: Commit aa515bf pushed by natolambert
8m 23s main
Delete images/timeline-v2.png
Deploy static content to Pages #242: Commit 6c341f0 pushed by natolambert
13m 11s main
Fix bibliography citation errors and hallucinations (#211)
Deploy static content to Pages #241: Commit 32726e4 pushed by natolambert
8m 11s main
Add distillation diagrams to synthetic data chapter (#209)
Deploy static content to Pages #239: Commit 9cbdebf pushed by natolambert
6m 36s main
Add TikZ diagram comparing PPO, GRPO, and RLOO (#208)
Deploy static content to Pages #238: Commit a7d8736 pushed by natolambert
14m 0s main
Add PPO vs GRPO architecture comparison diagram (#207)
Deploy static content to Pages #237: Commit 33e6b6f pushed by natolambert
8m 38s main
Update copyright to include 2026 (#205)
Deploy static content to Pages #235: Commit 92e656c pushed by natolambert
8m 15s main
explicitly state the dpo loss from bt model (#202)
Deploy static content to Pages #233: Commit 993e7da pushed by natolambert
9m 53s main
Uppercase the ratio term R_t in PPO objective (#201)
Deploy static content to Pages #231: Commit b4a47aa pushed by natolambert
8m 19s main
Wrap up RL chapter & tooling stability improvements (#200)
Deploy static content to Pages #230: Commit f2adb7c pushed by natolambert
6m 28s main