INDEX
Explanations
proper nouns of people named "Paul" or similar variations
the name "Paul."
New Auto-Interp
Negative Logits
2048
-0.70
Democr
-0.66
Goddess
-0.65
Reincarn
-0.62
åĤ
-0.62
RFC
-0.62
Stim
-0.61
NX
-0.61
AUTH
-0.60
Fiscal
-0.60
POSITIVE LOGITS
iflower
1.46
aul
1.23
ascus
1.04
oard
0.98
iott
0.95
anth
0.94
inations
0.92
eson
0.90
terday
0.89
inator
0.89
Activations Density 0.007%