Skip to content

Pull requests: EleutherAI/lm-evaluation-harness

Author
Filter by author
Label
Filter by label
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Milestones
Filter by milestone
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Updated vllm imports in vllm_causallms.py
#1890 opened May 25, 2024 by mgoin Loading…
[HFLM]Add support for Ascend NPU
#1886 opened May 25, 2024 by statelesshz Loading…
Add LegalBench tasks
#1878 opened May 23, 2024 by zafstojano Loading…
Add chat template
#1873 opened May 22, 2024 by KonradSzafer Loading…
Test coverage for optimum_lm.py
#1872 opened May 22, 2024 by zafstojano Loading…
Added tests for Anthropic LLMs
#1868 opened May 21, 2024 by zafstojano Loading…
Draft - Support ov models via genai
#1862 opened May 20, 2024 by sstrehlk Loading…
mmlu-pro for the Italian language
#1860 opened May 19, 2024 by giux78 Loading…
[WIP] Fix NeuralMagic tests
#1859 opened May 19, 2024 by haileyschoelkopf Loading…
Rename lm_eval.logging -> lm_eval.loggers bug Something isn't working.
#1858 opened May 19, 2024 by haileyschoelkopf Loading…
Fix m_mmlu target
#1853 opened May 18, 2024 by jordane95 Loading…
Implement Exams benchmark
#1852 opened May 17, 2024 by snova-zoltanc Loading…
Fix self.max_tokens in anthropic_llms.py
#1848 opened May 16, 2024 by lozhn Loading…
Adding LLaVa support
#1832 opened May 13, 2024 by ashvinnihalani Loading…
Financial PhraseBank (FPB) Eval Metric
#1815 opened May 9, 2024 by bcicc Loading…
Fix cost_estimate.py
#1810 opened May 8, 2024 by xksteven Loading…
Fix --gen_kwargs and VLLM (temperature not respected) bug Something isn't working.
#1800 opened May 7, 2024 by haileyschoelkopf Loading…
Vllm get tokenizer
#1794 opened May 6, 2024 by AguirreNicolas Loading…
add NPU support for huggingface.py
#1787 opened May 6, 2024 by jiaqiw09 Loading…
Group agg rework
#1741 opened Apr 23, 2024 by lintangsutawika Loading…
ProTip! Updated in the last three days: updated:>2024-05-22.