New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How does OpenFunction fit into the context of the OG Gorilla paper? #388
Comments
Hey @sreenivasmrpivot, thanks for your interest. The OG Gorilla paper looked at how do you train (fine-tine) an LLM that can retrieve the right API with the right arguments. This can be used in 0-shot (ChatGPT Style) and with retriever setting! Now Openfunctions series of models are models that we train and you don't need to train any further. Here, the input is basically a prompt and a set of functions, the output from the LLM is a properly formatted json of the right arguments filled in. Hope this helps. Another way to understand this is to look at the two colab examples: |
@ShishirPatil Thanks for your kind response. This really helps in our journey of adopting Gorilla at work. I have a followup query to you and would like to get your expert advice on it. Scenario: So based on your above response, what is the best path for this scenario? OpenFunctions or OG Gorilla with Retriever setting or RAFT? If we opt to go with OpenFunctions, is my understanding with required steps correct?
If we opt to go with RAFT, is my understanding with required steps correct?
Given the above options, can we safely say that the OG Gorilla paper instructions can be ignored by any average users of Gorilla repo for the above mentioned scenarios? If my understanding above is correct, then how does anyone adopt or train a new Gorilla LLM based on some new OpenSource model such as Llama 3 or Mixtral etc? Is there a training procedure doc or .py script for that available in the github repo? I sincerely appreciate your efforts and time here. Clarifying my above queries would help a lot more new users of Gorilla and increase adoption in-turn. Thanks a lot in advance. |
The OG Gorilla paper talks about multiple steps listed below.
How does OpenFunctions V2 fit in the above context?
How can we fine tune the Gorilla Model for our custom API? Is this (https://gorilla.cs.berkeley.edu/blogs/5_how_to_gorilla.html) the way to train?
The text was updated successfully, but these errors were encountered: