INDEX
Explanations
mentions of religious figures, especially popes and their activities
references to different Popes and the Vatican
New Auto-Interp
Negative Logits
ript
-0.80
nesota
-0.79
selves
-0.71
Brawl
-0.69
Wonderland
-0.69
ÑĮ
-0.67
uld
-0.66
aze
-0.65
mental
-0.64
hooting
-0.64
POSITIVE LOGITS
Francis
1.37
Benedict
1.18
Franc
0.94
Pope
0.91
XVI
0.89
Pope
0.88
pont
0.87
pope
0.86
Clement
0.84
Archbishop
0.83
Activations Density 0.025%