DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gemini-2.5-flash-20260114_200318 Updated 13 minutes ago
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-gemini-2.5-flash-20260114_200318 Updated 13 minutes ago
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-mini-2025-08-07-20260114_203811 Updated 13 minutes ago
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-mini-2025-08-07-20260114_203811 Updated 13 minutes ago
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-claude-haiku-4-5-20251001-20260114_164534 Updated about 2 hours ago
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-claude-haiku-4-5-20251001-20260114_164534 Updated about 2 hours ago
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gemini-2.5-flash-20260114_175612 Updated about 3 hours ago
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gemini-2.5-flash-20260114_175612 Updated about 3 hours ago
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epocf7b91126 Updated about 4 hours ago
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epocf7b91126 Updated about 4 hours ago
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-nano-2025-08-07-20260114_152435 Updated about 4 hours ago
DCAgent/eval-0d54f719f34dca712c8d6ef0f51df4670a2a287a-gpt-5-nano-2025-08-07-20260114_152435 Updated about 4 hours ago
DCAgent/eval-Qwen3-Coder-30B-A3B-Instruct_swebench-verified-random-100-folders Updated about 5 hours ago
DCAgent/eval-Qwen3-Coder-30B-A3B-Instruct_swebench-verified-random-100-folders Updated about 5 hours ago
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-claude-haiku-4-5-20251001-20260114_133343 Updated about 6 hours ago
DCAgent/eval-a2e51e9e0e8029156ed340719eb8cc7ceee3ed1a-claude-haiku-4-5-20251001-20260114_133343 Updated about 6 hours ago