INDEX
Negative Logits
knock
-0.08
_MUT
-0.08
sm
-0.07
cumbersome
-0.07
kinetics
-0.07
Knock
-0.07
astr
-0.07
mutations
-0.07
�
-0.07
Knights
-0.07
POSITIVE LOGITS
politely
0.09
.Unsupported
0.09
/not
0.08
/Error
0.08
dym
0.08
apology
0.08
Sorry
0.08
કહી
0.08
wegens
0.08
مني
0.08
Activations Density 0.016%