INDEX
Explanations
instances of the word "wonderful."
New Auto-Interp
Negative Logits
houſe
-1.09
pleaſure
-1.08
Houſe
-1.06
greateſt
-1.06
faſt
-1.06
Reſ
-1.05
Efq
-1.05
Anſ
-1.05
ſever
-1.04
Majefty
-1.02
POSITIVE LOGITS
L
0.69
0.64
I
0.61
He
0.60
El
0.60
(
0.59
T
0.57
is
0.56
addComponent
0.55
did
0.54
Activations Density 0.143%