Improve model card: Add library, links, and usage example
#1
by
nielsr
HF Staff
- opened
This PR significantly enhances the model card for sunblaze-ucb/Qwen3-14B-GRPO-MATH-1EPOCH by:
- Updating the main heading to reflect the full model ID:
sunblaze-ucb/Qwen3-14B-GRPO-MATH-1EPOCH. - Adding
library_name: transformersto the metadata, enabling the "how to use" widget and improving discoverability. - Including descriptive
tagssuch asreinforcement-learning,llm,reasoning, andmathfor better categorization. - Providing an expanded model description based on the paper abstract and project details, giving users a better understanding of the model and the underlying "Intuitor" and "RLIF" frameworks.
- Adding explicit links to the paper, the project page, and the GitHub repository for easy access to source materials and code.
- Including a clear Python code snippet for sample usage with the
transformerslibrary, making it easier for users to get started with inference.
Please review and merge this PR to improve the model's visibility and usability on the Hugging Face Hub.
Xuandong
changed pull request status to
merged