INDEX
Negative Logits
anager
-0.17
pies
-0.16
xeb
-0.16
datable
-0.16
SES
-0.15
incare
-0.15
metavar
-0.14
arest
-0.14
elian
-0.14
ses
-0.14
POSITIVE LOGITS
optera
0.28
gio
0.21
sterol
0.21
cción
0.20
lla
0.20
opt
0.20
fax
0.20
brook
0.19
ção
0.19
ccion
0.19
Activations Density 0.005%