INDEX
Explanations
numerical data or quantitative metrics
New Auto-Interp
Negative Logits
Forty
-0.26
Fifty
-0.26
forty
-0.23
sixty
-0.22
seventy
-0.20
64
-0.20
fifty
-0.20
.ZERO
-0.19
<quote
-0.18
вÑĸз
-0.18
POSITIVE LOGITS
02
0.45
03
0.45
04
0.45
06
0.45
09
0.44
07
0.44
05
0.44
08
0.43
01
0.42
2
0.41
Activations Density 0.116%