INDEX
Explanations
dialogue and conversational interactions between characters
New Auto-Interp
Negative Logits
undy
-0.18
ód
-0.16
esser
-0.16
оном
-0.15
Compliance
-0.15
aph
-0.15
è£
-0.14
ÙģØ§Øª
-0.14
appa
-0.14
IGO
-0.14
POSITIVE LOGITS
opc
0.14
vac
0.14
opo
0.14
upe
0.14
ipeg
0.13
Roc
0.13
arent
0.13
Jac
0.13
á»įc
0.13
éĽ
0.13
Activations Density 0.212%