#

multi-modal-learning

Here are 93 public repositories matching this topic...

joannahong / Visagesyntalk

The video demo of ECCV2022 paper titled "Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection"

speech-recognition multi-modal-learning lip-to-speech

Updated Oct 25, 2022

fmenat / missingviews-study-EO

Public repository of our assessment work in missing views for EO applications

deep-learning remote-sensing missing-data earth-observation robustness multi-view-learning multi-modal-learning

Updated Apr 2, 2024
Python

fpsluozi / tofindwaldo

Official Repo for "To Find Waldo You Need Contextual Cues: Debiasing Who’s Waldo", ACL 2022 (Short)

natural-language-processing computer-vision dataset multi-modal-learning

Updated Apr 7, 2023

hyeonsieun / Text-to-Image_Generation

pytorch text-to-image stackgan attngan multi-modal-learning pytorch-lightning openai-clip lafite

Updated Sep 2, 2023
Jupyter Notebook

mattroz / miniCLIP

Implementation of CLIP model with a reduced capacity. For self-educational purposes only.

machine-learning deep-learning text-processing language-model clip multi-modal-learning visual-models

Updated Jan 9, 2024
Python

MIFA-Lab / InstructionGPT-4

About Implementation for paper "InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4" (https://arxiv.org/abs/2308.12067)

multi-modal-learning vision-language-model minigpt4

Updated Oct 9, 2023
Python

talipucar / DomainTranslation

Pytorch implementation of "Multi-domain translation between single-cell imaging and sequencing data using autoencoders" (https://www.nature.com/articles/s41467-020-20249-2) with custom models.

multi-domain single-cell multi-modal single-cell-rna-seq shared-embedding multi-view-learning single-cell-omics multi-view data-alignment multi-modal-learning multi-domain-adaptation

Updated Oct 13, 2021
Python

Bekyilma / VA_RecSys

Learning Latent Semantic Representations of Paintings for Personalized Recommendation

recommendation-system machinelearning latent-dirichlet-allocation resnet-50 visual-arts bert-embeddings multi-modal-learning rank-fusion

Updated May 3, 2023
PHP

nshen7 / multimodal-cell-matching

Solution to one of the problems in 2021 NeurIPS Competition: A self-supervised contrastive learning model to learn matched cell modality embeddings in 10X Multiome data.

deep-learning computational-biology embeddings transformer convolutional-neural-networks multi-modal-learning contrastive-learning

Updated Jan 20, 2024
Jupyter Notebook

YinterestingProjects / human-wildlife-interactions

🐘 uncovering social interests in wildlife

nlp topic-modeling wildlife multi-modal-learning

Updated Apr 18, 2023
Jupyter Notebook

Hleephilip / MLVU-project

Modality Translation through Conditional Encoder-Decoder (2023-1 Machine Learning for Visual Understanding Team project)

multi-modal-learning latent-diffusion

Updated Jun 13, 2023
Python

ZINZINBIN / Disruption-Prediciton-based-on-Multimodal-Deep-Learning

Research-repository: Disruption Prediction and Analysis through Multimodal Deep Learning in KSTAR

computer-vision deep-learning tokamak plasma-physics time-series-classification disruptions multi-modal-learning plasma-instabilities

Updated Feb 21, 2024
Jupyter Notebook

rookiie / CDSpixel

[AAAI24] Learning Invariant Inter-pixel Correlations for Superpixel Generation

superpixel domain-generalization multi-modal-learning aaai2024

Updated Mar 27, 2024
Python

WangJingyao07 / ST-F2M

🌈 Official Code for **Spatio-Temporal Fuzzy-oriented Multi-modal Meta-learning for Fine-grained Emotion Recognition**

fuzzy-rules spatio-temporal-analysis meta-learning multi-modal-learning fine-grained-emotion-recognition

Updated Mar 5, 2024
Python

JHKim-snu / PGA

Under review. [IROS 2024] PGA: Personalizing Grasping Agents with Single Human-Robot Interaction

personalization semi-supervised-learning vision-and-language robotic-manipulation visual-grounding multi-modal-learning

Updated Mar 30, 2024
Python

itsShnik / allForOne

PyTorch implementation of the paper: All For One: Multi-modal Multi-Task Learning

deep-learning sentiment-classification multi-task-learning visual-question-answering vision-and-language multi-modal-learning

Updated Jul 17, 2020
Python

yookyungkho / Multimodal-Entailment-pytorch

Pytorch Implementation of Multimodal Entailment baseline

nlp computer-vision deep-learning pytorch multimodal-learning multi-modal-learning multimodal-entailment

Updated May 24, 2022
Jupyter Notebook

MunzerDw / Gen3DQA

(BMVC23) Paper on 3D visual question answering at the lab of Prof. Dr. Niessner at Technical University of Munich.

reinforcement-learning deep-learning multi-modal-learning 3d-question-answering 3d-vision-language-understanding

Updated Nov 22, 2023
Python

deep-symbolic-mathematics / Multimodal-Symbolic-Regression

Deep Symbolic Regression with Multimodal Pretraining

transformers symbolic-regression multi-modal-learning latent-space-interpolation equation-discovery ai4science ai4math

Updated Apr 18, 2024
Python

Gtothemoon / Contrastive-VisionVAE-Follower

Contrastive-VisionVAE-Follower is a model used for multi-modal task called Vision-and-Language Navigation (VLN).

nlp deep-learning cv pytorch lstm variational-autoencoder matterport3d-simulator multi-modal-learning vision-and-language-navigation vln contrastive-learning

Updated Jan 24, 2024
C++

Improve this page

Add a description, image, and links to the multi-modal-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-modal-learning topic, visit your repo's landing page and select "manage topics."