Skip to content

Joint speech-language model - respond directly to audio!

License

Notifications You must be signed in to change notification settings

fixie-ai/gazelle

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Gazelle - Joint Speech Language Model

This repository contains modeling code for the Gazelle Joint Speech Language Model.

gazelle wearing headphones, cartoon style

For some more details, read our blog post or update.

This inference code is heavily derived from Huggingface's Llava implementation. It is not particularly optimized, but we welcome community contributions.

Checkpoints:

Links:

Disclaimer

We are backproppin' on a budget and these initial checkpoints are not very robust to many real-world considerations. We make no representation of warranty or liability.

The v0.2 model responds to jailbreaks and adversarial attacks. We do not suggest using this in production.

License

This modeling code is licensed under Apache 2.0. The v0.2 release is derived from Mistral 7B and is released under the same Apache 2.0 license.

The v0.1 pretrained checkpoints derive from Llama 2, which is governed by the Llama 2 license. You must agree to these terms if you use the pretrained weights!

About

Joint speech-language model - respond directly to audio!

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 95.4%
  • Shell 4.6%