INDEX
Explanations
references to historical events and figures associated with World War II and the Holocaust
New Auto-Interp
Negative Logits
ilon
-0.17
onium
-0.16
ιαÏĤ
-0.15
leniyor
-0.15
ofday
-0.14
edio
-0.14
GuidId
-0.14
regnum
-0.14
pter
-0.14
tica
-0.14
POSITIVE LOGITS
previous
0.22
Previous
0.21
former
0.21
earlier
0.21
Previous
0.20
previous
0.19
erst
0.19
æĽ¾
0.18
سابÙĤ
0.17
original
0.16
Activations Density 0.386%