INDEX
Explanations
passages discussing religious practices and beliefs
New Auto-Interp
Negative Logits
scripture
-0.07
jist
-0.07
aks
-0.06
mina
-0.06
reon
-0.06
PropertyValue
-0.06
ække
-0.06
humble
-0.06
zyst
-0.06
alan
-0.06
POSITIVE LOGITS
Masc
0.07
nice
0.07
Klein
0.07
اÙĨÚ¯
0.07
cz
0.06
Ñĥдоб
0.06
Nice
0.06
NET
0.06
Independent
0.06
lv
0.06
Activations Density 0.060%