You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Instead of Beginner, Intermediate, Advanced in the current hierarchy of:
Beginner
Serve ML Models
Serve a Stable Diffusion Model
Serve a Text Classification Model
Serve an Object Detection Model
Intermediate
Serve an Inference Model on AWS NeuronCores Using FastAPI
Serve an Inference with Stable Diffusion Model on AWS NeuronCores Using FastAPI
Serve a model on Intel Habana Gaudi
Scale a Gradio App with Ray Serve
Serve a Text Generator with Request Batching
Serve a Chatbot with Request and Response Streaming
Serving models with Triton Server in Ray Serve
Advanced
Serve a Java App
We want to organize it into:
ML Applications
Serve ML Models
Serve a Stable Diffusion Model
Serve a Text Classification Model
Serve an Object Detection Model
Serve a Chatbot with Request and Response Streaming
Integrations
Scale a Gradio App with Ray Serve
Serve a Text Generator with Request Batching
Serving models with Triton Server in Ray Serve
Serve a Java App
AI Accelerators
Serve an Inference Model on AWS NeuronCores Using FastAPI
Serve an Inference with Stable Diffusion Model on AWS NeuronCores Using FastAPI
Serve a model on Intel Habana Gaudi
*Eventual AMD examples and further examples for different AI Accelerator Types
The text was updated successfully, but these errors were encountered:
anyscalesam
added
docs
An issue or change related to documentation
P1.5
Issues that will be fixed in a couple releases. It will be bumped once all P1s are cleared
labels
Apr 3, 2024
Maybe instead of Model types we call it ML Applications .
"Serve a Text Generator with Request Batching
Serve a Chatbot with Request and Response Streaming"
should go under that
anyscalesam
added
P2
Important issue, but not time-critical
and removed
P1.5
Issues that will be fixed in a couple releases. It will be bumped once all P1s are cleared
labels
Apr 23, 2024
Description
@peytondmurray, @akshay-anyscale @angelinalg and I had a discussion on how better to organize the Ray Serve examples page.
Instead of Beginner, Intermediate, Advanced in the current hierarchy of:
Beginner
Serve ML Models
Serve a Stable Diffusion Model
Serve a Text Classification Model
Serve an Object Detection Model
Intermediate
Serve an Inference Model on AWS NeuronCores Using FastAPI
Serve an Inference with Stable Diffusion Model on AWS NeuronCores Using FastAPI
Serve a model on Intel Habana Gaudi
Scale a Gradio App with Ray Serve
Serve a Text Generator with Request Batching
Serve a Chatbot with Request and Response Streaming
Serving models with Triton Server in Ray Serve
Advanced
Serve a Java App
We want to organize it into:
ML Applications
Serve ML Models
Serve a Stable Diffusion Model
Serve a Text Classification Model
Serve an Object Detection Model
Serve a Chatbot with Request and Response Streaming
Integrations
Scale a Gradio App with Ray Serve
Serve a Text Generator with Request Batching
Serving models with Triton Server in Ray Serve
Serve a Java App
AI Accelerators
Serve an Inference Model on AWS NeuronCores Using FastAPI
Serve an Inference with Stable Diffusion Model on AWS NeuronCores Using FastAPI
Serve a model on Intel Habana Gaudi
*Eventual AMD examples and further examples for different AI Accelerator Types
Link
https://docs.ray.io/en/master/serve/examples.html
The text was updated successfully, but these errors were encountered: