INDEX
Explanations
references to concentration camps, particularly the term "ghetto"
names associated with historical events or figures, specifically related to the ghettos and concentration camps during World War II
New Auto-Interp
Negative Logits
utherford
-0.73
VALUE
-0.72
elusive
-0.69
uyomi
-0.66
DEN
-0.66
foremost
-0.65
drm
-0.64
leaf
-0.63
investig
-0.63
deaf
-0.63
POSITIVE LOGITS
obl
0.91
oes
0.85
oad
0.83
icators
0.83
oaded
0.83
hett
0.81
angelo
0.78
odox
0.77
ographies
0.77
ode
0.76
Activations Density 0.008%