INDEX
Negative Logits
icked
-0.08
Dona
-0.08
them
-0.07
said
-0.07
acclaimed
-0.07
ologically
-0.07
uted
-0.07
antly
-0.07
considerado
-0.07
outed
-0.07
POSITIVE LOGITS
Optim
0.07
ордин
0.07
geben
0.07
�
0.07
ర
0.07
antis
0.07
scroll
0.07
lässt
0.07
corrosion
0.07
underground
0.07
Activations Density 0.012%