bknyaz commited on
Commit
281e4ef
·
verified ·
1 Parent(s): 8b31005

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -20,7 +20,7 @@ This reduction is achieved by the REAM method described in https://bknyaz.github
20
 
21
  **Compared to other models obtained in this collection, more code data is used in the calibration data during pruning/merging
22
  to better preserve original's model coding abilities. Specifically, the ratio between c4, math and coding data (see https://bknyaz.github.io/blog/2026/moe/) is 0.0, 0.3, 0.7.
23
- The calibration data used here is the same as in our [Qwen3-Coder-Next-REAP](https://huggingface.co/SamsungSAILMontreal/Qwen3-Coder-Next-REAP).
24
  Compared to other REAM models, here we used C=32 (number of experts in groups) instead of C=16, which we found to work better.**
25
 
26
  The compressed model has 60B params (120GB) instead of 80B (160GB) of the original model,
 
20
 
21
  **Compared to other models obtained in this collection, more code data is used in the calibration data during pruning/merging
22
  to better preserve original's model coding abilities. Specifically, the ratio between c4, math and coding data (see https://bknyaz.github.io/blog/2026/moe/) is 0.0, 0.3, 0.7.
23
+ The calibration data used here is the same as in [Qwen3-Coder-Next-REAP](https://huggingface.co/SamsungSAILMontreal/Qwen3-Coder-Next-REAP).
24
  Compared to other REAM models, here we used C=32 (number of experts in groups) instead of C=16, which we found to work better.**
25
 
26
  The compressed model has 60B params (120GB) instead of 80B (160GB) of the original model,