INDEX
Explanations
organizations and groups involved in societal or political issues
references to various stakeholders in social or political contexts
New Auto-Interp
Negative Logits
lasts
-0.70
fuse
-0.69
shalt
-0.68
utilizes
-0.67
achieves
-0.65
solves
-0.64
suffice
-0.64
prototype
-0.63
behaves
-0.60
exhibit
-0.60
POSITIVE LOGITS
worried
1.14
alarmed
1.10
outraged
1.07
concerned
1.04
frustrated
1.01
wary
1.01
who
1.00
angered
0.97
skeptical
0.97
unhappy
0.96
Activations Density 0.335%