INDEX
Explanations
numerical values and monetary amounts
New Auto-Interp
Negative Logits
707
-0.15
397
-0.14
tring
-0.14
ãĥĨãĥ«
-0.14
603
-0.14
696
-0.14
enser
-0.14
ernes
-0.14
971
-0.13
اÛĮÙĩ
-0.13
POSITIVE LOGITS
850
0.28
84
0.27
82
0.27
855
0.27
81
0.27
800
0.26
844
0.25
85
0.25
877
0.24
83
0.24
Activations Density 0.037%