Stabilizing Reinforcement Learning with LLMs: Formulation and Practices Paper • 2512.01374 • Published Dec 1, 2025 • 94
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 195
LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods Paper • 2412.05579 • Published Dec 7, 2024 • 2
Qilin: A Multimodal Information Retrieval Dataset with APP-level User Sessions Paper • 2503.00501 • Published Mar 1, 2025 • 12