INDEX
Explanations
instances of the name "Paul."
New Auto-Interp
Negative Logits
pleaſure
-1.06
Anſ
-1.04
iſt
-1.03
ſche
-1.00
Monfieur
-0.99
Houſe
-0.96
preſent
-0.96
Diſ
-0.95
purpoſe
-0.93
Reſ
-0.93
POSITIVE LOGITS
Paul
3.05
Paul
2.80
PAUL
2.38
paul
2.23
PAUL
2.16
paul
2.08
Paulus
1.39
Paulson
1.33
ポール
1.24
Pau
1.19
Activations Density 0.050%