INDEX
Explanations
mentions of political figures and leaders
New Auto-Interp
Negative Logits
76561
-0.69
ighters
-0.67
utenberg
-0.64
aves
-0.61
items
-0.58
contents
-0.57
Consumers
-0.57
Items
-0.57
bins
-0.56
Adren
-0.56
POSITIVE LOGITS
whom
1.06
who
1.00
named
0.97
willing
0.93
whose
0.85
persona
0.85
who
0.84
capable
0.83
overseeing
0.83
hunt
0.79
Activations Density 0.307%