INDEX
Negative Logits
wears
0.58
pacif
0.52
Piero
0.52
Фи
0.52
shops
0.50
कॉलेज
0.50
Primeiro
0.50
ර
0.49
Medic
0.49
verwenden
0.48
POSITIVE LOGITS
pronged
0.49
<0x80>
0.47
visual
0.46
вна
0.45
flavor
0.45
position
0.45
fishing
0.43
skipped
0.43
text
0.42
vacancy
0.42
Activations Density 0.000%