Skip to content

Latest commit

 

History

History
217 lines (193 loc) · 12.7 KB

LINKS.md

File metadata and controls

217 lines (193 loc) · 12.7 KB

Code to consider including:

flan-alpaca
text-generation-webui
minimal-llama
finetune GPT-NeoX
GPTQ-for_LLaMa
OpenChatKit on multi-GPU
Non-Causal LLM
OpenChatKit_Offload
Flan-alpaca training.py

Some open source models:

GPT-NeoXT-Chat-Base-20B
GPT-NeoX
GPT-NeoX-20B
Pythia-6.9B
Pythia-12B
Flan-T5-XXL
GPT-J-Moderation-6B
OIG safety models
BigScience-mT0
BigScience-XP3
BigScience-Bloomz

Some create commons models that would be interesting to use:

Galactica-120B
LLaMa-small-pt
LLaMa-64b-4bit

Papers/Repos

Self-improve
Coding
self-reflection
RLHF
DERA
HAI Index Report 2023
LLaMa
GLM-130B
RWKV RNN
Toolformer
GPTQ
Retro
Clinical_outperforms
Chain-Of-Thought
scaling law1
Big-bench
Natural-Instructions

Other projects:

StackLLaMa
Alpaca-CoT
ColossalAIChat
EasyLM
Koala
Vicuna
Flan-Alpaca
FastChat
alpaca-lora
alpaca.http
chatgpt-retrieval-pllugin
subtl.ai docs search on private docs
gretel
alpaca_lora_4bit
alpaca_lora_4bit_readme
code alpaca
serge
BlinkDL
RWKV-LM
MosaicCM
OpenAI Plugins
GPT3.5-Turbo-PGVector
LLaMa-Adapter
llama-index
minimal-llama
llama.cpp
ggml
mmap
llama.cpp more
TargetedSummarization
OpenFlamingo
Auto-GPT

Apache2/etc. Data

OIG 43M instructions direct HF link
More on OIG
DataSet Viewer
Anthropic RLHF
WebGPT_Comparisons
Self_instruct
20BChatModelData

Apache2/MIT/BSD-3 Summarization Data

xsum for Summarization
Apache2 Summarization
MIT summarization
BSD-3 summarization
OpenRail
Summarize_from_feedback

Ambiguous License Data

GPT-4-LLM
GPT4All
LinkGPT4
ShareGPT52K
ShareGPT_Vicuna
ChatLogs
Alpaca-CoT
LaMini-LM

Non-commercial Data

GPT-3 based Alpaca Cleaned

Prompt ENGR

Prompt/P-tuning
Prompt/P-tuing Nemo/NVIDIA
Info
Info2
Prompt-Tuning
P-tuning v2
babyagi
APE

Validation

Bleu/Rouge/Meteor/Bert-Score

Generate Hyperparameters

hot-to-generate
Notes_on_Transformers Chpt5
Notes_on_Transformers_Chpt10

Embeddings

OpenAI Expensive?
Leaderboard

Commercial products

OpenAI
OpenAI Tokenizer
OpenAI Playground
OpenAI Chat
OpenAI GPT-4 Chat
cohere
coherefinetune
DocsBotAI
Perplexity
VoiceFlow
NLPCloud

Multinode inference

FasterTransformer
Kubernetes Triton

Faster inference

text-generation-inference
Optimum

Semi-Open source Semi-Commercial products

OpenAssistant
OpenAssistant Repo
OpenChatKit
OpenChatKit2
OpenChatKit3
OpenChatKit4
OpenChatKitPreview
langchain
langchain+pinecone

Q/A docs

HUMATA
OSSCHat
NeuralSearchCohere
ue5

AutoGPT type projects

AgentGPT
Self-DEBUG
BabyAGI
AutoPR

Cloud fine-tune

AWS
AWS2

Chatbots:

GPT4ALL Chat
GLT4ALL
OASSST
FastChat
Dolly
HF Instructions
DeepSpeed Chat
LoraChat
Tabby
TalkToModel
You.com

LangChain or Agent related

Gradio Tools
LLM Agents
Meta Prompt
HF Agents HF Agents Collab Einstein GPT SMOL-AI Pandas-AI

Summaries

LLMs

Deployment

MLC-LLM

Evaluations

LMSYS (check for latest glob)
LMSYS Chatbot Arena
LMSYS Add model
NLL
HackAPrompt