INDEX
Explanations
words and phrases related to physical violence and human rights violations
terms related to severe violence and human rights abuses
New Auto-Interp
Negative Logits
inventoryQuantity
-0.80
natureconservancy
-0.78
hindsight
-0.74
Investor
-0.74
Regulation
-0.73
predecessor
-0.73
disclaimer
-0.73
optimism
-0.72
Collider
-0.72
ãĤ´ãĥ³
-0.72
POSITIVE LOGITS
raped
1.06
rapes
1.05
prostitutes
1.01
raping
0.99
indiscrim
0.98
rape
0.93
starvation
0.90
tortured
0.90
prisoners
0.88
corpses
0.88
Activations Density 0.484%