INDEX
Explanations
references to armed forces, military activities, and related terms
instances of the word "armed."
New Auto-Interp
Negative Logits
rx
-0.96
article
-0.78
MAL
-0.78
rb
-0.78
RGB
-0.72
HUD
-0.71
Remastered
-0.71
arget
-0.70
Woodward
-0.69
MRI
-0.68
POSITIVE LOGITS
robbery
0.96
guards
0.95
armed
0.94
insurrection
0.92
uprising
0.84
ament
0.84
robbers
0.82
robber
0.80
robberies
0.80
revolt
0.80
Activations Density 0.021%