INDEX
Explanations
references to the Holocaust and related historical events
historical atrocities and extremism
New Auto-Interp
Negative Logits
fromnode
-0.54
Ringo
-0.44
SourceChecksum
-0.42
parson
-0.42
baseman
-0.40
Kyr
-0.39
Pixie
-0.38
fir
-0.38
Rango
-0.37
jdons
-0.37
POSITIVE LOGITS
Holocaust
2.36
locaust
2.05
holocaust
2.02
genocide
0.88
Genocide
0.75
Auschwitz
0.69
Jewish
0.68
genoc
0.68
Chernobyl
0.67
printStackTrace
0.65
Activations Density 0.003%