INDEX
Negative Logits
efficiently
-0.08
strategically
-0.08
estratégia
-0.07
lista
-0.07
Horr
-0.07
estratégica
-0.07
trucks
-0.07
Lic
-0.07
gana
-0.07
戰
-0.07
POSITIVE LOGITS
attribut
0.11
speculation
0.11
stereotypes
0.11
conclusions
0.10
sensational
0.10
blaming
0.10
interpretations
0.10
accusations
0.10
caution
0.09
prejudice
0.09
Activations Density 0.084%