INDEX
Explanations
the preposition "of"
"One of" followed by a possessive pronoun
New Auto-Interp
Negative Logits
Theſe
-0.88
myſelf
-0.87
itſelf
-0.86
Monfieur
-0.86
Efq
-0.86
RSSSF
-0.84
fometimes
-0.84
Anſ
-0.82
himſelf
-0.82
becauſe
-0.82
POSITIVE LOGITS
those
0.65
many
0.62
few
0.61
wenigen
0.60
my
0.53
them
0.52
tanti
0.51
few
0.49
our
0.48
these
0.48
Activations Density 0.084%