INDEX
Explanations
character names and relational dynamics in stories
New Auto-Interp
Negative Logits
also
-0.14
blink
-0.14
anche
-0.14
mote
-0.14
edback
-0.14
Also
-0.14
subsequent
-0.14
ãģ«ãĤĤ
-0.13
itol
-0.13
ynet
-0.13
POSITIVE LOGITS
wake
0.23
decide
0.22
Wake
0.22
wakes
0.21
woke
0.21
suddenly
0.21
woke
0.20
decides
0.20
decided
0.20
innoc
0.19
Activations Density 0.291%