INDEX
Negative Logits
astered
-0.07
appealed
-0.06
atención
-0.06
�
-0.06
("-0.06
ornament
-0.06
Property
-0.06
Unsigned
-0.06
=val
-0.06
_corr
-0.06
POSITIVE LOGITS
inating
0.07
نب
0.07
면
0.07
cel
0.06
meinen
0.06
schop
0.06
dread
0.06
صح
0.06
Ç
0.06
б
0.06
Activations Density 0.005%