INDEX
Explanations
phrases related to war, conflict, and atrocities
references to historical atrocities related to concentration camps and forced labor
New Auto-Interp
Negative Logits
wcsstore
-0.84
soDeliveryDate
-0.75
WS
-0.71
Toast
-0.69
RC
-0.67
VP
-0.67
Prediction
-0.67
ASH
-0.67
Shares
-0.66
Drivers
-0.66
POSITIVE LOGITS
Auschwitz
1.24
inmates
1.07
detainees
1.05
chwitz
1.04
torture
1.03
confinement
0.99
dehuman
0.99
Holocaust
0.98
ocaust
0.97
gul
0.96
Activations Density 0.139%