INDEX
Negative Logits
(Sender
-0.08
(process
-0.08
TEXT
-0.08
estates
-0.08
ibet
-0.07
Dell
-0.07
Ler
-0.07
인
-0.07
(resolve
-0.07
Blocking
-0.07
POSITIVE LOGITS
kow
0.09
gp
0.08
美容
0.08
gay
0.07
wedding
0.07
illness
0.07
ಬೆಳ
0.07
morally
0.07
sangre
0.07
gland
0.07
Activations Density 0.000%