INDEX
Explanations
criticisms of political figures and their actions regarding war and foreign policy
New Auto-Interp
Negative Logits
artz
-0.16
communist
-0.15
rink
-0.14
706
-0.14
burst
-0.14
Ada
-0.14
laus
-0.14
Babe
-0.14
underground
-0.13
Aub
-0.13
POSITIVE LOGITS
intervention
0.26
Empire
0.26
warm
0.25
empire
0.25
imperial
0.24
Intervention
0.24
imperialism
0.23
interventions
0.23
Imperial
0.23
neo
0.23
Activations Density 0.103%