INDEX
Explanations
character names and interactions in narratives
New Auto-Interp
Negative Logits
ãĤ¤ãĥ³ãĥĪ
-0.17
访
-0.16
ednou
-0.16
ãĤ¸ãĤª
-0.16
_traits
-0.15
ãĥĨãĥ«
-0.15
utdown
-0.15
omik
-0.15
theon
-0.14
gger
-0.14
POSITIVE LOGITS
ind
0.15
961
0.15
Jones
0.15
panorama
0.14
arg
0.14
expend
0.14
180
0.14
as
0.14
[
0.14
in
0.14
Activations Density 0.136%