mllm

Datasets, case studies and benchmarks for extracting structured information from PDFs, HTML files or images, created by the Parsee.ai team. Datasets also on Hugging Face: https://huggingface.co/parsee-ai

datasets rag llm mllm

Updated May 15, 2024
Jupyter Notebook

VisualWebBench / VisualWebBench

Star

Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"

machine-learning natural-language-processing computer-vision deep-learning evaluation question-answering visual-question-answering multimodal multimodal-deep-learning foundation-models large-language-models llm llms mllm multimodal-large-language-models large-multimodal-models

Updated May 31, 2024
Python

bigai-nlco / LSTP-Chat

Star

A Video Chat Agent with Temporal Prior

spatial-temporal video-language llm mllm visual-instruction-tuning multimodal-large-language-models

Updated Feb 28, 2024
Python

TideDra / VL-RLHF

Star

A RLHF Infrastructure for Vision-Language Models

vlm lmm dpo llm rlhf mllm

Updated May 30, 2024
Python

X-PLUG / mPLUG-HalOwl

Star

mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating

benchmark contrastive-learning hallucinations mllm multimodal-large-language-models multimodal-hallucination

Updated Jan 29, 2024
Python

graphic-design-ai / graphist

Star

Official Repo of Graphist

graphic-design hlg lmm llm mllm layout-generation

Updated Apr 23, 2024

showlab / VisInContext

Star

Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning

efficient in-context-learning llm mllm

Updated Jun 6, 2024
Python

UCSC-VLAA / Sight-Beyond-Text

Star

This repository includes the official implementation of our paper "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"

alignment vlm ai-alignment vision-language vicuna llm mllm llava llama2

Updated Sep 15, 2023
Python

KwaiVGI / Uniaa

Star

Unified Multi-modal IAA Baseline and Benchmark

benchmark dataset image-aesthetic-assessment mllm llava

Updated Apr 16, 2024

BAAI-DCAI / DataOptim

Star

A collection of visual instruction tuning datasets.

llm mllm visual-instruction-tuning

Updated Mar 14, 2024
Python

Improve this page

Add a description, image, and links to the mllm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mllm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mllm

Here are 38 public repositories matching this topic...

alexander-moore / vlm

Ahnsun / merlin

BUAADreamer / Chinese-LLaVA-Med

isLinXu / MLLM-Research-Learn

eric-ai-lab / MultipanelVQA

CharlieDDDD / AISurveyPapers

sterzhang / image-textualization

xirui-li / attacks-on-LLMs

BUAADreamer / MLLM-Finetuning-Demo

zzq2000 / MIKO

parsee-ai / parsee-datasets

VisualWebBench / VisualWebBench

bigai-nlco / LSTP-Chat

TideDra / VL-RLHF

X-PLUG / mPLUG-HalOwl

graphic-design-ai / graphist

showlab / VisInContext

UCSC-VLAA / Sight-Beyond-Text

KwaiVGI / Uniaa

BAAI-DCAI / DataOptim

Improve this page

Add this topic to your repo