Sentence Similarity
Transformers
Safetensors
PyTorch
English
qwen2_5_vl
feature-extraction
video
retrieval
embedding
multimodal
qwen2.5-vl
custom_code
Instructions to use Alibaba-NLP/GVE-3B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Alibaba-NLP/GVE-3B with Transformers:
# Load model directly from transformers import AutoProcessor, AutoModel processor = AutoProcessor.from_pretrained("Alibaba-NLP/GVE-3B", trust_remote_code=True) model = AutoModel.from_pretrained("Alibaba-NLP/GVE-3B", trust_remote_code=True) - Notebooks
- Google Colab
- Kaggle
Are there any current methods to speed up inference?
#5
by zhouchongqin - opened
At present, the model doesn't support vllm. Are there any current methods to speed up inference?
Thanks for your interest. However, GVE does not support any acceleration framework. You can still use Transformers and try parallel inference on multiple GPUs.