INDEX
Explanations
mentions of the word "pap" or related words like "papal" in various contexts
references to papal topics or the Pope
New Auto-Interp
Negative Logits
hirt
-0.82
UCT
-0.82
VE
-0.71
IDES
-0.70
IGHTS
-0.70
everal
-0.69
terday
-0.68
Wonderland
-0.66
é¾
-0.66
CONCLUS
-0.64
POSITIVE LOGITS
yrus
1.47
rika
1.19
ular
1.03
aya
1.03
ilion
1.02
acy
1.02
arella
1.01
oleon
1.01
illion
0.99
illon
0.97
Activations Density 0.042%