INDEX
Explanations
the name "Paul" in various contexts
New Auto-Interp
Negative Logits
Anſ
-1.25
Monfieur
-1.18
pleaſure
-1.13
Reſ
-1.12
Bertie
-1.05
itſelf
-1.05
myſelf
-1.02
+#+#
-1.01
Houſe
-1.01
Diſ
-1.00
POSITIVE LOGITS
Paul
1.23
s
1.03
Paul
0.97
paul
0.78
van
0.77
Van
0.76
ς
0.75
Paulus
0.74
PAUL
0.69
(
0.66
Activations Density 0.106%