INDEX
Explanations
HTML heading tags and other structural markers in the text
New Auto-Interp
Negative Logits
.
-0.57
her
-0.50
-
-0.49
AND
-0.48
I
-0.48
Pe
-0.48
T
-0.48
Brock
-0.48
S
-0.47
’
-0.46
POSITIVE LOGITS
ſche
1.15
Houſe
1.10
itſelf
1.10
myſelf
1.07
greateſt
1.03
houſe
1.01
Chriftian
1.00
Anſ
1.00
ſta
1.00
pleaſure
1.00
Activations Density 0.141%