INDEX
Explanations
words related to international relations and political events, particularly involving the United States
references to the United States and its actions
New Auto-Interp
Negative Logits
Virt
-0.77
Dickinson
-0.74
ãĥ£
-0.68
ersion
-0.67
andra
-0.66
iott
-0.64
brakes
-0.63
TAMADRA
-0.62
Cassidy
-0.60
xes
-0.60
POSITIVE LOGITS
taboola
0.98
division
0.81
webkit
0.80
_-
0.78
san
0.78
hap
0.77
alien
0.77
entimes
0.77
based
0.76
themed
0.76
Activations Density 0.010%