INDEX
Explanations
the word "that" and references to individuals
New Auto-Interp
Negative Logits
itſelf
-1.96
myſelf
-1.89
Efq
-1.77
purpoſe
-1.74
―――――
-1.74
Majefty
-1.69
Monfieur
-1.68
doubtnut
-1.68
himſelf
-1.63
themſelves
-1.61
POSITIVE LOGITS
1.07
That
1.07
That
1.06
<eos>
0.97
(
0.95
что
0.93
I
0.91
that
0.85
.
0.85
,
0.85
Activations Density 0.118%