INDEX
Explanations
phrases related to emotional investment and character development in narratives
New Auto-Interp
Negative Logits
oad
-0.16
agon
-0.16
chw
-0.14
辺
-0.14
teri
-0.14
LS
-0.14
oter
-0.13
ame
-0.13
ìłģ
-0.13
essler
-0.13
POSITIVE LOGITS
wanting
0.16
how
0.16
being
0.16
things
0.15
how
0.15
neler
0.15
why
0.14
384
0.14
itol
0.14
AMPL
0.14
Activations Density 0.447%