INDEX
Explanations
mentions of or references to the word "Saint"
references to saint-related terms or entities
New Auto-Interp
Negative Logits
HO
-0.72
PT
-0.72
ORN
-0.67
ERG
-0.64
BIP
-0.63
sled
-0.62
clips
-0.61
itsch
-0.60
mishand
-0.60
mable
-0.59
POSITIVE LOGITS
Laurent
1.05
Lucia
0.99
Clair
0.94
Petersburg
0.93
clair
0.91
Augustine
0.90
Louis
0.90
Francis
0.84
Kitt
0.81
ctuary
0.80
Activations Density 0.016%