Configuration issue with tie_word_embeddings when using trl GRPOTrainer with vLLM
#9 opened 12 days ago
by
DimensionSTP
new predict
#8 opened about 1 month ago
by
tangyu12143
Best non-thinking model qwen ever released
๐
1
#7 opened 3 months ago
by
BigBlueWhale
AWQ version
#6 opened 3 months ago
by
cormak
Recommended model parameters
#5 opened 4 months ago
by
BigBlueWhale
GGUFs have buggy Japanese OCR performance
9
#4 opened 4 months ago
by
TPH441
'Qwen3VLConfig' object has no attribute 'vocab_size'
๐
2
1
#3 opened 4 months ago
by
vladciocan88
How about running by llama.cpp
2
#1 opened 4 months ago
by
rosspanda0