-
Notifications
You must be signed in to change notification settings - Fork 117
Open
Description
Hi there! I was wondering how you guys generated this figure in the Sable paper?
I'd like to generate a similar "aggregated evaluator win rate" plot for SMAX where it's essentially just an average win rate across all of the SMAX tasks (instead of just a single task). Did you guys use Marl-eval to generate your aggregated plots? I can't seem to find a similar option and I'd like to be as similar as possible (hence why I haven't just put my own function together yet).
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels