INDEX
Explanations
conversational expressions conveying uncertainty or reluctance
New Auto-Interp
Negative Logits
erval
-0.15
èIJ
-0.15
Interval
-0.15
Interval
-0.15
ivar
-0.14
.bt
-0.14
anel
-0.14
wow
-0.14
wow
-0.14
alth
-0.13
POSITIVE LOGITS
who
0.24
beg
0.24
screw
0.23
who
0.22
fine
0.22
thems
0.20
shrugged
0.20
fine
0.19
Meh
0.19
Fine
0.18
Activations Density 0.298%