evaluation-framework

This repository contains codes for NLP project "In-context Learning of Pre-trained Language Models for Controlled Dialogue Summarization: A Holistic Benchmark and Empirical Analysis"

dialogue summarization emnlp language-model evaluation-metrics evaluation-framework in-context-learning

Updated Jul 21, 2023
Jupyter Notebook

integrated-evaluation-framework / IEvaluate-Web

Star

Integrated Evaluation Framework - Front-End Web Application

ai web-application clinical-research evaluation-framework

Updated Jan 7, 2023
TypeScript

ucinlp / unstereo-eval

Star

Official repository for the paper *Are Models Biased on Text Without Gender-related Language?*, published in ICLR 2024.

evaluation-framework large-language-models gender-bias-evaluation non-stereotypical-evaluation fair-nlp

Updated May 3, 2024
Jupyter Notebook

difuture-lmu / dsCalibration

Star

Calculate calibration of a model on DataSHIELD servers

distributed-computing evaluation-framework datashield

Updated Feb 15, 2022
R

yyq83 / DR-method-evaluation

Star

drug repositioning method evaluation

pipeline evaluation-framework drug-repositioning

Updated Sep 28, 2023
Python

IHIaadj / N-Compariw

Star

N-Compariw: End-to-End Workflow for Neural Networks Comparison

deep-learning analysis evaluation-framework comparison-tool

Updated Oct 2, 2021

Brudalaxe / BM25-VSM-Search-Engine

Star

A hybrid search engine based on the BM25 and VSM retrieval models.

search-engine information-retrieval query evaluation corpus indexing evaluation-metrics bm25 vsm evaluation-framework indexing-querying retrieval-model

Updated May 18, 2022
Jupyter Notebook

feilong0309 / EvaluationFramework

Star

Evaluation Framework for Graph Sample Clustering

evaluation-framework graph-clustering graph-sampling

Updated Apr 4, 2018
Python

MIRTK / REPEAT

Star

REgistration PErformance Assessment Tools

registration mirtk evaluation-framework

Updated Sep 26, 2017
Shell

SouravD-Me / LLM-Evaluation-Dashboard

Star

A Visual Dashboard for Fundamental Benchmarking of LLMs

visualization python benchmarking natural-language-processing dashboard deep-learning jupyter-notebook data-analytics evaluation-framework large-language-models

Updated Feb 23, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the evaluation-framework topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the evaluation-framework topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

evaluation-framework

Here are 119 public repositories matching this topic...

CalebGartner / PyExpr

Galactic-FaaS / BMR-Harness

brettdidonato / BSD_Evals

ulysses-camara / ulysses-senteval

AI4Bharat / Dhruva-Evaluation-Suite

thlmenezes / TMJudge

szegedai / hun_ner_checklist

Vmalik1995 / FlightDelay

gauravpatil93 / evaluation-framework

gplhegde / Object-Detection-Metrics

t170815518 / control_dialogue_summarization_evaluation_benchmark

integrated-evaluation-framework / IEvaluate-Web

ucinlp / unstereo-eval

difuture-lmu / dsCalibration

yyq83 / DR-method-evaluation

IHIaadj / N-Compariw

Brudalaxe / BM25-VSM-Search-Engine

feilong0309 / EvaluationFramework

MIRTK / REPEAT

SouravD-Me / LLM-Evaluation-Dashboard

Improve this page

Add this topic to your repo