Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it possible to fine-tune without Azure AI studio? #286

Open
mark86v1 opened this issue Mar 25, 2024 · 2 comments
Open

Is it possible to fine-tune without Azure AI studio? #286

mark86v1 opened this issue Mar 25, 2024 · 2 comments

Comments

@mark86v1
Copy link

Is it possible to implement RAFT without Azure AI studio? I am planning to use open-source models.

@Danielskry
Copy link
Contributor

Danielskry commented Mar 26, 2024

Yes, you can fine-tune or implement RAFT without Azure AI Studio, provided you have access to the appropriate hardware. One challenging aspect is determining whether the model you wish to fine-tune fits within the available vRAM memory (e.g., loading a model onto CUDA). I suggest using a calculator (can be found on Azure, Hugging Face, etc.) to assist with this process of selecting the proper model and GPU. Fine-tuning models can quickly become expensive and may exceed the capabilities of standard consumer GPUs, which is why many opt for platforms like Azure AI Studio for these tasks.

For example, Tianjun and Shishir used "Llama 2 7B is also a perfect model for training on 4 A100-40G GPUs and serving on a single GPU" (given Microsoft blog post) to implement RAFT on a Llama 2 7B model. The price for one A100 40GB GPU is now roughly $8,200 USD.

@songole
Copy link

songole commented Mar 28, 2024

is there a skypilot recipe to finetune this on any cloud?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants