INDEX
Explanations
references to religious doctrine and church teachings
New Auto-Interp
Negative Logits
ONO
-0.16
-0.15
è©
-0.14
Äĥng
-0.14
eren
-0.14
onden
-0.14
VIP
-0.14
Thá»§
-0.14
å±Ĭ
-0.13
ĺìĿ´
-0.13
POSITIVE LOGITS
405
0.16
zza
0.16
alat
0.15
اÛĮÙĦ
0.15
pins
0.14
mrt
0.14
lam
0.14
itive
0.14
kola
0.13
alive
0.13
Activations Density 0.069%