INDEX
Explanations
references to exorcisms or related religious practices
New Auto-Interp
Negative Logits
ãģłãģª
-0.15
еÑĢеж
-0.14
ç®
-0.14
ãĥ¼ãĥ©
-0.14
ãĤ¤ãĥ¤
-0.14
oug
-0.14
cuckold
-0.14
gue
-0.14
husbands
-0.13
妻
-0.13
POSITIVE LOGITS
Pope
0.39
pap
0.39
pope
0.37
pont
0.35
пап
0.33
Benedict
0.31
Vatican
0.28
Pap
0.28
Pont
0.27
Francis
0.27
Activations Density 0.141%