INDEX
Explanations
conversational elements and expressions of gratitude
New Auto-Interp
Negative Logits
awns
-0.15
Leigh
-0.14
ypi
-0.13
VOKE
-0.13
.sf
-0.13
igon
-0.13
lobs
-0.13
KY
-0.13
hey
-0.13
orr
-0.13
POSITIVE LOGITS
interview
0.19
oux
0.15
nom
0.15
anger
0.15
Interview
0.15
подв
0.14
Nom
0.14
ãĥĨãĥ«
0.14
entrev
0.14
onom
0.14
Activations Density 0.639%