Issues: huggingface/accelerate
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
GPU Memory Imbalance and OOM Errors During Training
#2789
opened May 17, 2024 by
DONGRYEOLLEE1
2 of 4 tasks
[DeepSpeed] Asking for feedback when training with zero2 with accelerate and diffusers
#2787
opened May 16, 2024 by
sayakpaul
AcceleratorState
object has no attribute distributed_type
.
#2786
opened May 16, 2024 by
evelinamorim
2 of 4 tasks
Unable to launch DeepSpeed multinode training with a heterogenous mix of # devices per node.
#2780
opened May 14, 2024 by
iantbutler01
2 of 4 tasks
Unable to load mistralai/Mixtral-8x7B-Instruct-v0.1 using mps
#2778
opened May 14, 2024 by
chimezie
2 of 4 tasks
Accelerate FSDP RuntimeError: Tensors of the same index must be on the same device and the same dtype
#2764
opened May 10, 2024 by
yaswanthchittepu
Cuda Out of memory while loading PEFT weights using accelerate on multi gpu
#2760
opened May 10, 2024 by
sidtandon2014
2 of 4 tasks
Performance on single GPU is much better than on Multi-GPUs
#2754
opened May 8, 2024 by
baicenxiao
3 of 4 tasks
PicklingError: Can't pickle <function Embedding.forward at XXXXXXX> it's not the same object as torch.nn.modules.sparse.Embedding.forward
#2749
opened May 7, 2024 by
arpit2665
1 of 4 tasks
4-bit quantization cannot load weights to meta device for bias terms of the linear layer: NotImplementedError: Cannot copy out of meta tensor; no data!
#2742
opened May 5, 2024 by
MuhammedHasan
2 of 4 tasks
[Feature Request] Allows registering custom trackers to internal tracker type registry
enhancement
New feature or request
feature request
Request for a new feature to be added to Accelerate
#2734
opened May 2, 2024 by
luowyang
You are trying to offload the whole model to the disk. Please use the
disk_offload
function instead.
#2726
opened Apr 30, 2024 by
Moazzamnamal
2 of 4 tasks
Training with PEFT + Accelerate randomly gets stuck with DeepSpeed after the first epoch
#2724
opened Apr 29, 2024 by
vikram71198
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.