INDEX
Explanations
expressions of dialogue and conversational interactions
New Auto-Interp
Negative Logits
basically
-0.17
Basically
-0.17
Basically
-0.16
nay
-0.15
Fuck
-0.15
éĥ
-0.15
fuck
-0.14
huge
-0.14
gm
-0.14
Fuck
-0.14
POSITIVE LOGITS
sorter
0.21
queer
0.18
arter
0.18
iglia
0.16
fellows
0.16
couldn
0.15
positively
0.15
Jest
0.15
.want
0.14
Que
0.14
Activations Density 0.342%