INDEX
Negative Logits
smashing
-0.08
醉
-0.08
Kapoor
-0.08
drunken
-0.08
Wal
-0.08
pornography
-0.08
mood
-0.08
Tumblr
-0.08
psychiatric
-0.08
petty
-0.08
POSITIVE LOGITS
boundary
0.10
контакт
0.09
संपर्क
0.09
kontakt
0.09
liner
0.09
airflow
0.09
Boundary
0.09
_boundary
0.08
negotiated
0.08
transfer
0.08
Activations Density 0.010%