INDEX
Explanations
references to significant events or statistics related to Jewish history and persecution
New Auto-Interp
Negative Logits
rome
-0.18
anc
-0.17
lua
-0.15
495
-0.15
Tato
-0.15
Ø´ÙħاÙĦÛĮ
-0.14
lak
-0.14
sei
-0.14
orton
-0.14
spat
-0.14
POSITIVE LOGITS
Gain
0.17
grade
0.16
andom
0.15
å¥Ĺ
0.15
agos
0.14
konkrét
0.14
_PENDING
0.14
gain
0.14
اÛĮاÙĨ
0.14
айÑĤ
0.13
Activations Density 0.022%