INDEX
Explanations
instances of conversational transitions and rhetorical questions
New Auto-Interp
Negative Logits
èĩº
-0.16
uns
-0.15
ork
-0.15
Dough
-0.14
220
-0.14
lava
-0.14
berg
-0.14
avig
-0.14
Housing
-0.14
arn
-0.13
POSITIVE LOGITS
_gap
0.19
ãĤıãģĽ
0.16
ãĥ¥
0.15
-gap
0.15
DISCLAIM
0.14
ì±ħ
0.14
Gap
0.14
chy
0.14
EPROM
0.14
eka
0.14
Activations Density 0.061%