INDEX
Explanations
mentions of the name "Paul."
New Auto-Interp
Negative Logits
pleaſure
-1.23
Anſ
-1.23
iſt
-1.19
preſent
-1.15
Diſ
-1.14
purpoſe
-1.12
Monfieur
-1.12
Reſ
-1.11
ſche
-1.11
ſta
-1.11
POSITIVE LOGITS
Paul
2.14
Paul
1.90
PAUL
1.56
paul
1.45
paul
1.39
PAUL
1.38
Paulson
1.14
Paulus
1.05
Paulina
1.05
ポール
1.00
Activations Density 0.055%