Skip to content

Pull requests: stanford-crfm/helm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Added COT Metric and Adapter to MMLU Pro
#3162 opened Nov 15, 2024 by siyagoel Loading…
Added Metric for COT
#3159 opened Nov 14, 2024 by siyagoel Loading…
Build frontend
#3155 opened Nov 12, 2024 by github-actions bot Loading…
Adding WildBench
#3150 opened Nov 12, 2024 by liamjxu Loading…
2 tasks done
IBM Enterprise Scenarios
#3064 opened Oct 16, 2024 by yifanmai Draft
Medhelm
#3038 opened Oct 2, 2024 by aunell Loading…
New safety scenario: HarmBench GCG-T
#3035 opened Oct 1, 2024 by farzaank Loading…
Documentation: Evaluation run lifecycle
#2506 opened Mar 25, 2024 by yifanmai Loading…
Remove AdapterSpec from metrics
#2244 opened Jan 17, 2024 by yifanmai Draft
Numeracy scenario update
#1978 opened Nov 2, 2023 by friedeggs Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.