INDEX
Explanations
conversational structures and dialogue markers
New Auto-Interp
Negative Logits
Damn
-0.16
akat
-0.15
inis
-0.14
uche
-0.14
seam
-0.14
олее
-0.14
ingers
-0.13
èĤ¥
-0.13
Yep
-0.13
distinct
-0.13
POSITIVE LOGITS
come
0.20
Listen
0.20
Exc
0.20
Come
0.20
Listen
0.20
You
0.19
look
0.19
listen
0.19
Look
0.19
listen
0.18
Activations Density 0.303%