INDEX
Explanations
references to the Shabbat or related Jewish rituals and symbols
New Auto-Interp
Negative Logits
oca
-0.18
zin
-0.16
ropolis
-0.16
anova
-0.15
zsche
-0.15
HLT
-0.15
yp
-0.15
çŃĴ
-0.15
oure
-0.15
lanc
-0.14
POSITIVE LOGITS
abb
0.23
lish
0.23
алом
0.20
mini
0.19
mary
0.19
ABB
0.19
lich
0.19
aul
0.18
ñana
0.18
vat
0.18
Activations Density 0.008%