INDEX
Explanations
dialogues and conversational exchanges between characters
New Auto-Interp
Negative Logits
è¶
-0.14
ymb
-0.14
habi
-0.14
mada
-0.14
pedo
-0.14
Ñĥмов
-0.13
gesi
-0.13
-css
-0.13
enberg
-0.13
(()
-0.13
POSITIVE LOGITS
indeed
0.94
Indeed
0.69
Indeed
0.68
inde
0.63
дейÑģÑĤвиÑĤелÑĮно
0.50
ç¡®
0.49
yes
0.44
skuteÄįnÄĽ
0.35
Yes
0.35
yeah
0.35
Activations Density 0.351%