INDEX
Negative Logits
raud
-0.19
ij
-0.15
acades
-0.15
roundup
-0.14
rokes
-0.14
commons
-0.14
že
-0.14
adır
-0.14
hell
-0.14
kad
-0.13
POSITIVE LOGITS
IFE
0.18
ife
0.17
ucer
0.16
Aph
0.15
Hang
0.15
omba
0.15
Gesture
0.14
_cpus
0.14
Section
0.14
ayan
0.14
Activations Density 0.000%