INDEX
Negative Logits
Nir
-0.07
ejected
-0.07
flowering
-0.06
influence
-0.06
On
-0.06
Thus
-0.06
ICE
-0.06
труда
-0.06
.program
-0.06
bilingual
-0.06
POSITIVE LOGITS
desperate
0.16
desperately
0.13
desperation
0.12
urgency
0.07
crackdown
0.07
dpi
0.07
dying
0.07
.RESULT
0.07
stack
0.07
urgent
0.07
Activations Density 0.004%