INDEX
Explanations
references to the pronoun "you" in various contexts
New Auto-Interp
Negative Logits
ÑĢож
-0.16
ashtra
-0.14
uet
-0.14
amage
-0.14
akter
-0.14
-linear
-0.14
idente
-0.14
pdata
-0.13
stoff
-0.13
udder
-0.13
POSITIVE LOGITS
/us
0.23
-même
0.18
zelf
0.18
488
0.15
anz
0.15
ados
0.15
hib
0.15
Sphinx
0.15
self
0.14
ocard
0.14
Activations Density 0.104%