Official code for Paper "Mantis: Multi-Image Instruction Tuning"
-
Updated
May 23, 2024 - Python
Official code for Paper "Mantis: Multi-Image Instruction Tuning"
React component library for crafting user-friendly and engaging conversational experiences
The easiest way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Multi-model Inference Graph/Pipelines, LLM/RAG apps, and more!
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
ALICE and its prior work, implementation of paper and Unity Package "Voice2Action: Language Models as Agent for Efficient Real-Time Interaction in VR".
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Orchestrate Swarms of Agents From Any Framework Like OpenAI, Langchain, and Etc for Business Operation Automation. Join our Community: https://discord.gg/DbjBMJTSWD
Beyond Entities: A Large-Scale Multi-Modal Knowledge Graph with Triplet Fact Grounding
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
autoupdate paper list
Implementation for the different ML tasks on Kaggle platform with GPUs.
VisualWebArena is a benchmark for multimodal agents.
Visualize streams of multimodal data. Fast, easy to use, and simple to integrate. Built in Rust using egui.
Stable Diffusion and LLMs offline on your own hardware
A Benchmark Dataset for Multimodal Scientific Fact Checking
Build real-time multimodal AI applications 🤖🎙️📹
Images to inference with no labeling (use foundation models to train supervised models).
The World's Largest Decentralized AGI Multimodal Dataset
This repository is used to collect papers and code in the field of AI.
Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Audio, Image, Video, Music and 3D content. 🔥
Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.
To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."