INDEX
Explanations
discourse markers and conversational turn-taking cues
New Auto-Interp
Negative Logits
while
-0.21
since
-0.19
though
-0.19
after
-0.17
moreover
-0.17
plus
-0.17
however
-0.16
æŃ¤
-0.16
heck
-0.16
WHILE
-0.16
POSITIVE LOGITS
And
0.28
And
0.25
Cause
0.21
Cause
0.21
So
0.20
Number
0.19
So
0.19
So
0.19
Number
0.18
Again
0.17
Activations Density 0.141%