#

instruction-tuning

Here are 113 public repositories matching this topic...

Nips20262 / Nips20262

Language Models Resist Alignment

alignment theory llm rlhf instruction-tuning unalignment

Updated Jun 7, 2024
Python

davidandym / Multitask-Transfer-Instruction-Tuning

This is the official code repository for the ACL Findings Paper "Multi-Task Transfer Matters During Instruction-Tuning"

transfer-learning multi-task-learning in-context-learning large-language-models instruction-tuning

Updated May 28, 2024

aitor-alvarez / llm-evaluation

Evaluating Large Language Models with Instructions and Prompts

t5-model large-language-models prompt-engineering instruction-tuning

Updated Jan 28, 2024
Python

byunsj / KoTox-Korean-Toxic-Instruction-Dataset

KoTox is an automatically generated instruction dataset in Korean. The instruction set is used to mitigate the toxicity of the LLMs.

korean instruction-set toxicity large-language-models instruction-tuning

Updated Jan 6, 2024

timothyckl / self-instruct

an instruction-tuning dataset generation script

python language-model instruction-tuning

Updated Apr 7, 2024
Python

devashish-gupta / Instruct-Nav

A multimodal model for language-guided socially compliant robot navigation.

robotics navigation vlm vision-and-language omniverse instruction-tuning

Updated Apr 27, 2024
Jupyter Notebook

minseok0809 / question-answering-with-lora

Basline: google/flan-t5 Finetuning: LMQG , LoRA

text-generation question-answering lora fine-tuning huggingface t5 instruction-tuning

Updated Apr 24, 2024
Jupyter Notebook

smendes2901 / movie_chat_bot

This repository contains the implementation of a fine-tuned Llama2 chatbot using QLoRA, tailored to provide detailed information and recommendations about movies. The model is fine-tuned on the IMDB dataset, enabling it to generate informed and contextually relevant responses.

vector chatbot pandas pytorch embeddings sentence recommender-system fine-tuning huggingface llm instruction-tuning llama2

Updated May 19, 2024
Jupyter Notebook

YutongWang1216 / ReflectionLLMMT

natural-language-processing machine-translation self-reflection large-language-models instruction-tuning

Updated May 30, 2024
Python

ymnseol / weekly-paper-reading-group

Summaries of papers related to the alignment problem in NLP

nlp natural-language-processing rlhf instruction-tuning reinforcement-learning-from-human-feedback

Updated May 29, 2023

BoutainaELYAZIJI / Universal-NER

Implementation of the models of the Universal-NER Paper 2024 using a Streamlit-based web application that is designed to process PDF documents for Named Entity Recognition tasks. It allows users to upload PDF files, from which the application extracts text, images, and tables to identify entities based on a user-specific user-specified entity type.

nlp named-entity-recognition huggingface streamlit langchain instruction-tuning

Updated Feb 18, 2024
Jupyter Notebook

NoisyStudents / NoisyABSA

Domain generalization on Aspect Based Sentiment Analysis (ABSA) task via utilizing noisy student architecture.

nlp machine-learning cross-domain absa aspect-based-sentiment-analysis domain-generalization noisy-student large-language-models instruction-tuning

Updated Jun 28, 2023
Jupyter Notebook

jackfsuia / chats-crawler

Discourse chat data crawling and on-the-way parsing straight for LLM instruction finetuning. 论坛数据爬取和解析，直接用于对话微调。

nlp parser crawler gpt nlp-parsing html-css-javascript fine-tuning llm llms instruction-tuning llm-training finetune-llm

Updated May 6, 2024
TypeScript

KomeijiForce / Incubator

This repo is the official implementation for Incubating Text Classifiers Following User Instruction with Nothing but LLM. We allow users to get a personalized classifier with only the instruction as input. The incubation is based on a llama-2-7b fine-tuned on Huggingface Meta Data and Self-Diversification.

text-classification zero-shot-learning diversification instruction-tuning model-incubation

Updated May 26, 2024
Python

Danitilahun / LLM_Projects

This repository has a lot of LLM projects done. It is the best place to start learning LLM.

transformer gemini llama gpt fine-tuning gpt-3 large-language-models llm langchain instruction-tuning vllm retrieval-augmented-generation

Updated May 3, 2024
Python

Reason-Wang / InstructLLM

The official implementation of paper "Demystifying Instruction Mixing for Fine-tuning Large Language Models"

nlp transformers fine-tuning deepspeed llm instruction-tuning llama2

Updated Jan 5, 2024
Python

LLMath-QLoRA

Logisx / LLMath-QLoRA

End-to-end MLOps LLM instruction finetuning based on PEFT & QLoRA to solve math problems.

nlp docker aws machine-learning transformers llama dvc fastapi llm llmops instruction-tuning

Updated Jan 14, 2024
Jupyter Notebook

ParthaPRay / LLM-Learning-Sources

This repo contains a list of channels and sources from where LLMs should be learned

lora fine-tuning huggingface instruction-following prompt-engineering generative-ai instruction-tuning large-language-model retrieval-augmented-generation multi-modal-llms pervasive-generative-ai iot-generative-ai

Updated Jun 7, 2024

tamlhp / awesome-instruction-editing

Awesome Instruction Editing. Image and Media Editing with Human Instructions. Instruction-Guided Image and Media Editing.

image-editing audio-editing video-editing media-editing image-edit instruction-following text-guided-image-editing text-guidance instruction-tuning instruction-learning text-guided music-editing instruction-editing instruction-guided instructional-edit instructional-editing

Updated Apr 14, 2024

crux82 / BISS-2024

This repository hosts materials from the Bertinoro International Spring School 2024 course

nlp transformers llama distributional-semantics large-language-models instruction-tuning llama2

Updated Mar 12, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the instruction-tuning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the instruction-tuning topic, visit your repo's landing page and select "manage topics."