Spaces:
Running
Running
zhimin-z commited on
Commit ·
2c7ae76
1
Parent(s): d2d0d14
add
Browse files
README.md
CHANGED
|
@@ -14,7 +14,7 @@ short_description: Model arena for software engineering tasks
|
|
| 14 |
|
| 15 |
# SWE-Model-Arena
|
| 16 |
|
| 17 |
-
An open-source platform for evaluating
|
| 18 |
|
| 19 |
**[Try it on Hugging Face Spaces](https://huggingface.co/spaces/SWE-Arena/SWE-Model-Arena)**
|
| 20 |
|
|
|
|
| 14 |
|
| 15 |
# SWE-Model-Arena
|
| 16 |
|
| 17 |
+
An open-source platform for evaluating tool-calling models head-to-head. Both sides share the **same scaffolding** ([opencode](https://opencode.ai)) with identical tools, prompts, and sandboxed environments — the **only variable** is the underlying tool-calling model.
|
| 18 |
|
| 19 |
**[Try it on Hugging Face Spaces](https://huggingface.co/spaces/SWE-Arena/SWE-Model-Arena)**
|
| 20 |
|
app.py
CHANGED
|
@@ -2053,7 +2053,7 @@ with gr.Blocks(title="SWE-Model-Arena", theme=gr.themes.Soft()) as app:
|
|
| 2053 |
|
| 2054 |
with gr.Tab("⚔️Arena"):
|
| 2055 |
gr.Markdown("# ⚔️ SWE-Model-Arena")
|
| 2056 |
-
gr.Markdown("Blind head-to-head
|
| 2057 |
|
| 2058 |
gr.Markdown("### 📜 How It Works")
|
| 2059 |
gr.Markdown(
|
|
|
|
| 2053 |
|
| 2054 |
with gr.Tab("⚔️Arena"):
|
| 2055 |
gr.Markdown("# ⚔️ SWE-Model-Arena")
|
| 2056 |
+
gr.Markdown("Blind head-to-head tool-calling model comparison — same scaffold (opencode), different tool-calling LLM")
|
| 2057 |
|
| 2058 |
gr.Markdown("### 📜 How It Works")
|
| 2059 |
gr.Markdown(
|