INDEX
Explanations
references to historical events and conditions affecting Jewish populations during and after World War II
New Auto-Interp
Negative Logits
arya
-0.13
ève
-0.13
?)↵
-0.13
ï¼ı↵
-0.12
è³Ģ
-0.12
_triggered
-0.12
ayer
-0.12
cÃŃt
-0.12
sume
-0.12
?")↵
-0.12
POSITIVE LOGITS
mais
0.24
et
0.24
car
0.23
tand
0.23
afin
0.21
;
0.21
.
0.20
puis
0.19
quand
0.17
;
0.17
Activations Density 0.057%