INDEX
Explanations
elements associated with storytelling and narrative structure
New Auto-Interp
Negative Logits
غاÙĦ
-0.16
živ
-0.14
anness
-0.14
вок
-0.14
osomal
-0.14
rita
-0.13
stderr
-0.13
è¼Ŀ
-0.13
utz
-0.13
atown
-0.13
POSITIVE LOGITS
die
0.36
discrim
0.28
fans
0.28
discern
0.28
lovers
0.27
devoted
0.25
lover
0.25
die
0.25
devote
0.24
Die
0.24
Activations Density 0.340%