Gemma in Model Garden Deployment - confusing section on Chat Applications #2800

afirstenberg · 2024-03-25T00:53:44Z

Environment

Deployed a gemma-7b-it model on Vertex AI Model Garden using the "Deploy" button from the Gemma card. No additional tuning was done.
I have an instance running on a g2-standard-12 machine with I4 GPU. It is visible in the Online Prediction section of my Cloud Console.
I am able to reach the endpoint without any issues.

Unable to find any good documentation on what needs to be sent to the model and what to get back, I used the "Model Garden Gemma Deployment on Vertex" notebook to try and get an idea. (See #2799 for a related issue.)

Description:

The instructions in the section "Build chat applications with Gemma" indicate that there are templates that define the structure of a conversation.
The code shows how to use the template to create a prompt.
However, the prompt that is created isn't sent to the model. Instead, the text from a previous example is.
Furthermore, even if I do send the text to the model, it doesn't seem to respond any differently.
As with Model Garden Gemma Deployment on Vertex - incomplete documentation about prediction response format #2799, the response format that is to be expected isn't documented.

See starting

vertex-ai-samples/notebooks/community/model_garden/model_garden_gemma_deployment_on_vertex.ipynb

Line 667 in b37ed6e

"### Build chat applications with Gemma\n",

gericdong · 2024-03-27T20:55:56Z

@kathyyu-google: could you please assist with this? Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gemma in Model Garden Deployment - confusing section on Chat Applications #2800

Gemma in Model Garden Deployment - confusing section on Chat Applications #2800

afirstenberg commented Mar 25, 2024

gericdong commented Mar 27, 2024

Gemma in Model Garden Deployment - confusing section on Chat Applications #2800

Gemma in Model Garden Deployment - confusing section on Chat Applications #2800

Comments

afirstenberg commented Mar 25, 2024

gericdong commented Mar 27, 2024