INDEX
Explanations
references to morality in contexts of competition or rivalry
New Auto-Interp
Negative Logits
earing
-0.14
such
-0.14
latter
-0.14
iesel
-0.14
ilo
-0.14
ALSO
-0.13
ë²Į
-0.13
ãĢIJ
-0.13
è²
-0.13
braco
-0.13
POSITIVE LOGITS
period
0.68
Period
0.64
period
0.60
Period
0.57
-period
0.51
.period
0.49
periods
0.46
_period
0.43
plain
0.42
(period
0.42
Activations Density 0.302%