Skip to content

Actions: sierra-research/tau2-bench

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
38 workflow runs
38 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Qwen3 max thinking submission (#149)
Deploy Leaderboard to GitHub Pages #17: Commit 70e700c pushed by benshi34
1m 6s main
Copilot code review
Copilot code review #1: by Copilot AI
3m 7s
Fix toolorchestrator trajectory loading (#120)
Deploy Leaderboard to GitHub Pages #16: Commit 337326e pushed by benshi34
54s main
Submit Toolorchestra to leaderboard - Revised (#119)
Deploy Leaderboard to GitHub Pages #15: Commit 1b67be7 pushed by benshi34
57s main
Leaderboard/qwen 3 max thinking preview (#117)
Deploy Leaderboard to GitHub Pages #14: Commit 5704c21 pushed by benshi34
1m 27s main
support custom scaffold on leaderboard view (#101)
Deploy Leaderboard to GitHub Pages #13: Commit b00cd42 pushed by benshi34
1m 42s main
Deploy Leaderboard to GitHub Pages
Deploy Leaderboard to GitHub Pages #10: Manually run by benshi34
1m 25s main
Deploy Leaderboard to GitHub Pages
Deploy Leaderboard to GitHub Pages #9: Manually run by benshi34
5m 38s main