-
Notifications
You must be signed in to change notification settings - Fork 27.2k
Issues: huggingface/transformers
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Training config that worked with transformers v4.4.6.3 results in OOM error with v4.47.0 (using SFTTrainer)
bug
#35108
opened Dec 5, 2024 by
jjbuck
2 of 4 tasks
Error during training: "Expected dtype float for end but got dtype c10::BFloat16"
bug
#35106
opened Dec 5, 2024 by
jjbuck
2 of 4 tasks
Running AG and SD when assistant and target models are on different devices
bug
Generation
#35099
opened Dec 5, 2024 by
jmamou
2 of 4 tasks
TextIteratorStreamer unable to create generator_kwargs
bug
#35098
opened Dec 5, 2024 by
ivanhe123
2 of 4 tasks
Is there a way to find the earliest version of transformers that has a certain model?
Feature request
Request for a new feature
#35097
opened Dec 5, 2024 by
ZihaoZheng98
Documentation for SWAG contradicts itself when constructing the first sentence.
bug
#35095
opened Dec 5, 2024 by
bauwenst
2 of 4 tasks
CausalLM loss function throws runtime error in multi-gpu setup
bug
#35086
opened Dec 4, 2024 by
xspirus
2 of 4 tasks
The dot in the model name when using auto_map will cause a path parsing error.
bug
#35082
opened Dec 4, 2024 by
hvlgo
2 of 4 tasks
Deprecated
shard_checkpoint
's replacement save_torch_state_dict
does not save tied embeddings
bug
#35080
opened Dec 4, 2024 by
casper-hansen
4 tasks
Bug of self.accelerator.gather(num_items_in_batch) with enabling average_tokens_across_devices
#35076
opened Dec 4, 2024 by
Snowdar
When extending embeddings, multivariate distribution isn't correctly estimated even when the calculated sigma matrix is symmetric and positive definite
bug
#35075
opened Dec 4, 2024 by
MayStepanyan
1 of 4 tasks
trainer.evaluate
always creates a new MLFlow run, separate from the one used during train()
bug
#35074
opened Dec 4, 2024 by
nathan-az
2 of 4 tasks
Multiple training runs not working with deepspeed
bug
#35073
opened Dec 4, 2024 by
H-Simpson123
2 of 4 tasks
issues when i change the lm_head to a 32 node layer
bug
#35071
opened Dec 4, 2024 by
SoSongzhi
2 of 4 tasks
Make it possible to only save best model (not last checkpoint)
Feature request
Request for a new feature
#35070
opened Dec 4, 2024 by
umarbutler
Bug in running facebook/wav2vec2-xlsr-53-espeak-cv-ft
bug
#35064
opened Dec 3, 2024 by
aldazero
2 of 4 tasks
SequenceClassification for all Model types should have the option to add weights in Cross Entropy loss
Feature request
Request for a new feature
#35061
opened Dec 3, 2024 by
SimonStanley1
Get "NotImplementedError: Cannot copy out of meta tensor; no data!" error while deploying model
bug
#35057
opened Dec 3, 2024 by
hawkiyc
2 of 4 tasks
ImportError: cannot import name 'HfApiEngine' from 'transformers'
bug
#35051
opened Dec 3, 2024 by
cluiverto
4 tasks
Incorrect hardcoded consolidated.pth path for Llama 3.2 11B Vision+Instruct Model
#35049
opened Dec 3, 2024 by
strangiato
Enable Quantize KV Cache for Mistral Model
Feature request
Request for a new feature
#35041
opened Dec 2, 2024 by
Bojun-Feng
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.