INDEX
Negative Logits
MIN
-0.07
(shell
-0.07
nationalism
-0.07
-0.06
manned
-0.06
CAF
-0.06
translation
-0.06
Michigan
-0.06
Social
-0.06
(Table
-0.06
POSITIVE LOGITS
oro
0.30
оро
0.14
Toro
0.13
oro
0.12
Oro
0.11
Soros
0.09
boro
0.08
oso
0.07
orro
0.07
Corona
0.07
Activations Density 0.005%