INDEX
Negative Logits
Gets
-0.08
prison
-0.08
procur
-0.08
,两
-0.07
撃
-0.07
hil
-0.07
/raw
-0.07
Able
-0.07
empate
-0.07
primero
-0.07
POSITIVE LOGITS
WAS
0.08
ado
0.08
сер
0.07
(over
0.07
blatt
0.07
отв
0.07
िकल
0.07
keny
0.07
Christensen
0.07
.apple
0.07
Activations Density 0.002%