INDEX
Explanations
proper nouns related to historical figures and geographic locations
references to the concept of a pontiff or pope
New Auto-Interp
Negative Logits
ITNESS
-0.87
76561
-0.70
encers
-0.70
EStream
-0.70
encer
-0.69
berman
-0.67
ARE
-0.66
ggies
-0.66
oulder
-0.66
VIDE
-0.64
POSITIVE LOGITS
Pont
1.21
Pont
1.19
ificate
1.10
pont
0.93
ified
0.92
unia
0.86
rovers
0.83
iac
0.83
ific
0.79
anus
0.79
Activations Density 0.005%