Safetensors
qwen3

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

SFT Model for the paper "Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards"

GitHub arXiv Dataset & Model

Downloads last month
49
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for THU-KEG/DeepDive-4B-SFT

Quantizations
1 model

Collection including THU-KEG/DeepDive-4B-SFT

Paper for THU-KEG/DeepDive-4B-SFT