INDEX
Explanations
technical specifications and measurements
New Auto-Interp
Negative Logits
plin
-0.67
atem
-0.64
amaru
-0.64
qqa
-0.64
orc
-0.63
endum
-0.62
ramid
-0.62
£ı
-0.60
rolet
-0.60
orem
-0.58
POSITIVE LOGITS
th
1.12
00
1.10
%
0.98
%"
0.93
nm
0.91
%-
0.91
nd
0.91
50
0.90
81
0.89
87
0.88
Activations Density 0.175%