DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_0a0458a3 Viewer • Updated about 5 hours ago • 764
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_0a0458a3 Viewer • Updated about 5 hours ago • 764
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_c267e2e6 Viewer • Updated about 9 hours ago • 509
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_c267e2e6 Viewer • Updated about 9 hours ago • 509
DCAgent/eval-terminal-bench-2.0-claude-haiku-4-5-20251001-20260115_165217 Viewer • Updated about 10 hours ago • 272
DCAgent/eval-terminal-bench-2.0-claude-haiku-4-5-20251001-20260115_165217 Viewer • Updated about 10 hours ago • 272
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_a60f4588 Viewer • Updated about 11 hours ago • 371
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_learning-rate_a60f4588 Viewer • Updated about 11 hours ago • 371
penfever/Kimi-K2T-neulab-agenttuning-mind2web-sandboxes-maxeps-32k_neulab-agenttuning-db-sandboxes Viewer • Updated about 16 hours ago • 15.8k • 5
penfever/Kimi-K2T-neulab-agenttuning-mind2web-sandboxes-maxeps-32k_neulab-agenttuning-db-sandboxes Viewer • Updated about 16 hours ago • 15.8k • 5
penfever/Kimi-K2T-neulab-agenttuning-kg-sandboxes-maxeps-32k_neulab-agenttuning-kg-sandboxes Viewer • Updated about 16 hours ago • 7.95k • 5
penfever/Kimi-K2T-neulab-agenttuning-kg-sandboxes-maxeps-32k_neulab-agenttuning-kg-sandboxes Viewer • Updated about 16 hours ago • 7.95k • 5