evaluation-metrics

Star

Here are 381 public repositories matching this topic...

athina-ai / athina-evals

Star

Python SDK for running evaluations on LLM generated responses

evaluation evaluation-metrics evaluation-framework llmops llm-eval llm-ops llm-evaluation llm-evaluation-toolkit

Updated May 25, 2024
Python

AgentOps-AI / agentops

Star

Open source Python SDK for agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI, Langchain, and Autogen

agent ai openai evaluation-metrics mistral cost-estimation autogen groq agentops llm langchain anthropic evals ollama crewai

Updated May 25, 2024
Python

chandanthota75 / Molecular-Symphony

Star

Harmonizing clinical and genetic data to enhance the precision and efficiency of glioma diagnosis.

data-science machine-learning scala spark sbt feature-selection ensemble-learning evaluation-metrics intellij-idea datapreprocessing

Updated May 24, 2024
Shell

kolenaIO / kolena

Star

Python client for Kolena's machine learning testing platform

testing machine-learning evaluation evaluation-metrics evaluation-framework mlops evaluate-models llmops

Updated May 24, 2024
Python

ImmersiveMediaLaboratory / ColorTransferLib

Star

A collection of color and style transfer algorithms and objective evaluation metrics.

style-transfer evaluation-metrics iqa color-transfer

Updated May 24, 2024
Python

Ahmad-Ali-Rafique / Adult-Income-Dataset

Star

This repository contains a Jupyter Notebook exploring the adult income dataset. The notebook performs Exploratory Data Analysis (EDA), including visualizations with charts and graphs. Additionally, it implements various classification models to predict income and analyzes their accuracy.

machine-learning exploratory-data-analysis evaluation eda classification accuracy logistic-regression evaluation-metrics decision-tree-classifier random-forest-classifier dataanalytics datavisualization-project

Updated May 24, 2024
Jupyter Notebook

confident-ai / deepeval

Star

The LLM Evaluation Framework

evaluation-metrics evaluation-framework llm-evaluation llm-evaluation-framework llm-evaluation-metrics

Updated May 24, 2024
Python

songweige / content-debiased-fvd

Star

[CVPR 2024] On the Content Bias in Fréchet Video Distance

generative-adversarial-network evaluation-metrics frechet-distance generative-models video-generation frechet-inception-distance diffusion-models video-generation-evaluation

Updated May 23, 2024
Python

daaanishhh002 / MachineLearning

Star

This is a collection of all the machine learning techniques required in any machine learning project. It contains detailed descriptions, videos, book recommendations, and additional material to properly grasp all the concepts.

monitoring deployment random-forest machine-learning-algorithms cross-validation pipelines xgboost feature-engineering basics evaluation-metrics

Updated May 23, 2024
Jupyter Notebook

czh-98 / REALY_homepage

Star

Project page for our paper "REALY: Rethinking the Evaluation of 3D Face Reconstruction".

benchmark evaluation-metrics 3d-face-reconstruction

Updated May 23, 2024
TypeScript

huggingface / lighteval

Star

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

evaluation evaluation-metrics evaluation-framework huggingface

Updated May 23, 2024
Python

nick7nlp / Counting-Stars

Star

Counting-Stars (★)

evaluation-metrics long-context large-language-model

Updated May 23, 2024
Jupyter Notebook

afsharino / DataTalksClub

Star

This repository houses my solutions to a diverse range of machine learning assignments and projects completed during the ML Zoom Camp , and MLOps Zoom Camp, a comprehensive machine learning boot camp.