INDEX
Explanations
instances of significant transitions or contrasts in narratives
New Auto-Interp
Negative Logits
ivent
-0.19
ombo
-0.18
nar
-0.15
gh
-0.15
ouver
-0.15
amon
-0.15
mute
-0.15
omen
-0.15
abel
-0.14
Overnight
-0.14
POSITIVE LOGITS
.internet
0.15
podob
0.15
oppins
0.14
iping
0.14
peter
0.14
chie
0.14
ammable
0.13
RL
0.13
igy
0.13
pip
0.13
Activations Density 0.001%