INDEX
Explanations
instances of articles and pronouns that indicate possession or connection
New Auto-Interp
Negative Logits
abox
-0.17
abee
-0.16
otto
-0.15
BV
-0.15
Magic
-0.14
ÑģоÑĢ
-0.14
han
-0.14
orgen
-0.14
ger
-0.13
antro
-0.13
POSITIVE LOGITS
986
0.16
Trace
0.15
³
0.15
rippling
0.14
.trace
0.14
Stevenson
0.14
otherwise
0.14
šek
0.14
stead
0.14
official
0.14
Activations Density 0.169%