ChessArena: A Chess Testbed for Evaluating Strategic Reasoning Capabilities of Large Language Models
Paper
•
2509.24239
•
Published
•
1
This model is the SFT stage two model of paper https://arxiv.org/abs/2509.24239.
Training dataset: https://huggingface.co/datasets/ljcnju/ChessArena_Training_Dataset
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.5441 | 0.5667 | 500 | 0.5742 |
| 0.462 | 1.1326 | 1000 | 0.5241 |
| 0.4573 | 1.6993 | 1500 | 0.4992 |
| 0.4061 | 2.2652 | 2000 | 0.4926 |
| 0.3979 | 2.8320 | 2500 | 0.4887 |