Pull requests: EleutherAI/lm-evaluation-harness
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Multiple Choice Questions and Large Languages Models: A Case Study with Fictional Medical Data
#1867
opened May 21, 2024 by
maximegmd
Loading…
Rename Something isn't working.
lm_eval.logging -> lm_eval.loggers
bug
#1858
opened May 19, 2024 by
haileyschoelkopf
Loading…
Force BOS token usage in 'gemma' models for VLLM
#1857
opened May 19, 2024 by
haileyschoelkopf
Loading…
Add option in TaskManager to not index library default tasks ; Tests for include_path
feature request
A feature that isn't implemented yet.
#1856
opened May 19, 2024 by
haileyschoelkopf
Loading…
Fix Something isn't working.
--gen_kwargs
and VLLM (temperature
not respected)
bug
#1800
opened May 7, 2024 by
haileyschoelkopf
Loading…
Make
scripts.write_out
error out when no splits match
#1796
opened May 7, 2024 by
haileyschoelkopf
Loading…
Create task
dharma2
- a small (300 qs) & wide (many topics) dataset
#1753
opened Apr 26, 2024 by
UmerHA
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2024-05-22.