Add Artificial Analysis evaluations for deepseek-v3-2

#1
Files changed (1) hide show
  1. README.md +54 -3
README.md CHANGED
@@ -1,3 +1,54 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ model-index:
4
+ - name: MackenzieTest
5
+ results:
6
+ - task:
7
+ type: evaluation
8
+ dataset:
9
+ name: Artificial Analysis Benchmarks
10
+ type: artificial_analysis
11
+ metrics:
12
+ - name: Artificial Analysis Intelligence Index
13
+ type: artificial_analysis_intelligence_index
14
+ value: 52.4
15
+ - name: Artificial Analysis Coding Index
16
+ type: artificial_analysis_coding_index
17
+ value: 42.8
18
+ - name: Artificial Analysis Math Index
19
+ type: artificial_analysis_math_index
20
+ value: 59
21
+ - name: Mmlu Pro
22
+ type: mmlu_pro
23
+ value: 0.837
24
+ - name: Gpqa
25
+ type: gpqa
26
+ value: 0.751
27
+ - name: Hle
28
+ type: hle
29
+ value: 0.105
30
+ - name: Livecodebench
31
+ type: livecodebench
32
+ value: 0.593
33
+ - name: Scicode
34
+ type: scicode
35
+ value: 0.387
36
+ - name: Aime 25
37
+ type: aime_25
38
+ value: 0.59
39
+ - name: Ifbench
40
+ type: ifbench
41
+ value: 0.49
42
+ - name: Lcr
43
+ type: lcr
44
+ value: 0.39
45
+ - name: Terminalbench Hard
46
+ type: terminalbench_hard
47
+ value: 0.305
48
+ - name: Tau2
49
+ type: tau2
50
+ value: 0.789
51
+ source:
52
+ name: Artificial Analysis API
53
+ url: https://artificialanalysis.ai
54
+ ---