INDEX
Explanations
mentions of the Pope
occurrences of the word "Pope."
New Auto-Interp
Negative Logits
nesota
-0.82
ript
-0.81
yrinth
-0.79
awks
-0.79
ership
-0.70
sbm
-0.70
ties
-0.68
stract
-0.67
rg
-0.66
awk
-0.66
POSITIVE LOGITS
Francis
1.15
Benedict
0.94
Pope
0.82
Pablo
0.80
Pope
0.77
pope
0.74
Father
0.74
Clement
0.74
otle
0.73
Patriarch
0.71
Activations Density 0.010%