How does OpenFunction fit into the context of the OG Gorilla paper? #388

sreenivasmrpivot · 2024-04-26T22:39:45Z

The OG Gorilla paper talks about multiple steps listed below.

API Collection
Hand-generate Instruction-API pairs
Choose Instruction-API pairs
Generate Instruction-API pairs corpus
Convert Instruction-API pairs to user-agent chat style conversations
Instruction Fine-tune LLaMA 2

How does OpenFunctions V2 fit in the above context?

How can we fine tune the Gorilla Model for our custom API? Is this (https://gorilla.cs.berkeley.edu/blogs/5_how_to_gorilla.html) the way to train?

ShishirPatil · 2024-04-27T20:48:47Z

Hey @sreenivasmrpivot, thanks for your interest.

The OG Gorilla paper looked at how do you train (fine-tine) an LLM that can retrieve the right API with the right arguments. This can be used in 0-shot (ChatGPT Style) and with retriever setting!

Now Openfunctions series of models are models that we train and you don't need to train any further. Here, the input is basically a prompt and a set of functions, the output from the LLM is a properly formatted json of the right arguments filled in.

Hope this helps. Another way to understand this is to look at the two colab examples:
Gorilla demo: https://colab.research.google.com/drive/1DEBPsccVLF_aUnmD0FwPeHFrtdC0QIUP?usp=sharing
OpenFunctions demo: https://gorilla.cs.berkeley.edu/leaderboard.html#api-explorer

sreenivasmrpivot · 2024-04-28T15:14:18Z

@ShishirPatil Thanks for your kind response. This really helps in our journey of adopting Gorilla at work.

I have a followup query to you and would like to get your expert advice on it.

Scenario:
We have a lot of internal enterprise REST APIs which we expect to integrate with Gorilla style LLM (either classic OG Gorilla implementation or Open Functions).

So based on your above response, what is the best path for this scenario? OpenFunctions or OG Gorilla with Retriever setting or RAFT?

If we opt to go with OpenFunctions, is my understanding with required steps correct?

Add all your enterprise REST APIs to the apizoo folder in the required format (is this step required for using OpenFunctions or should we just be injecting all possible functions in the invocation each time?)
Then you may run the one of the inference modes here from repo(gorilla/openfunctions), based on your hosting

If we opt to go with RAFT, is my understanding with required steps correct?

Add all your enterprise REST APIs to the apizoo folder in the required format
Then you may run the RAFT as per instruction here from repo (gorilla/raft/README.md)
This fine-tunes the OpenFunctions based model and helps it to perform better for a specific domain as per (https://gorilla.cs.berkeley.edu/blogs/9_raft.html)

Given the above options, can we safely say that the OG Gorilla paper instructions can be ignored by any average users of Gorilla repo for the above mentioned scenarios?

If my understanding above is correct, then how does anyone adopt or train a new Gorilla LLM based on some new OpenSource model such as Llama 3 or Mixtral etc? Is there a training procedure doc or .py script for that available in the github repo?

I sincerely appreciate your efforts and time here. Clarifying my above queries would help a lot more new users of Gorilla and increase adoption in-turn.

Thanks a lot in advance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How does OpenFunction fit into the context of the OG Gorilla paper? #388

How does OpenFunction fit into the context of the OG Gorilla paper? #388

sreenivasmrpivot commented Apr 26, 2024

ShishirPatil commented Apr 27, 2024

sreenivasmrpivot commented Apr 28, 2024

How does OpenFunction fit into the context of the OG Gorilla paper? #388

How does OpenFunction fit into the context of the OG Gorilla paper? #388

Comments

sreenivasmrpivot commented Apr 26, 2024

ShishirPatil commented Apr 27, 2024

sreenivasmrpivot commented Apr 28, 2024