INDEX
Explanations
pronouns and their usage in dialogue or indirect speech
New Auto-Interp
Negative Logits
quate
-0.15
usto
-0.15
.datas
-0.15
evin
-0.15
ulfill
-0.14
quip
-0.14
uve
-0.14
_MAXIMUM
-0.14
ihn
-0.14
lements
-0.13
POSITIVE LOGITS
how
0.32
about
0.29
what
0.28
why
0.28
it
0.27
everything
0.27
something
0.25
they
0.25
stories
0.25
exactly
0.24
Activations Density 0.088%