INDEX
Explanations
interactions between characters and their actions in a narrative context
New Auto-Interp
Negative Logits
asaki
-0.16
pari
-0.16
vary
-0.15
pez
-0.15
chet
-0.15
elon
-0.14
imson
-0.14
ostel
-0.14
acz
-0.14
Guerrero
-0.14
POSITIVE LOGITS
Orc
0.15
anh
0.15
dou
0.14
icorn
0.14
Dou
0.14
OWL
0.14
celed
0.14
-aos
0.14
ldb
0.14
гл
0.14
Activations Density 0.351%