INDEX
Negative Logits
Tad
-0.08
aange
-0.08
�
-0.08
touted
-0.08
frequ
-0.08
방송
-0.07
outed
-0.07
coord
-0.07
Duft
-0.07
engineering
-0.07
POSITIVE LOGITS
helpless
0.12
impotence
0.11
impot
0.10
powerless
0.10
无法
0.09
catastrophic
0.09
挣
0.08
immobil
0.08
inability
0.08
hope
0.08
Activations Density 0.011%