INDEX
Explanations
historical dates and significant events related to Jewish persecution
New Auto-Interp
Negative Logits
erate
-0.20
ering
-0.19
eral
-0.17
abo
-0.17
oring
-0.17
arm
-0.16
ural
-0.16
ough
-0.16
ergy
-0.16
Ã
-0.15
POSITIVE LOGITS
nable
0.19
-cols
0.16
LOAT
0.15
리카
0.15
³
0.14
AGING
0.14
Kelly
0.14
adium
0.14
ingleton
0.14
onia
0.14
Activations Density 0.008%