INDEX
Explanations
phrases indicating a transition or continuation in a speech or text
phrases indicating a preface or introduction to content
New Auto-Interp
Negative Logits
cow
-0.81
knit
-0.78
rod
-0.76
pless
-0.70
scl
-0.66
pee
-0.66
Tens
-0.65
nian
-0.65
tek
-0.65
pak
-0.65
POSITIVE LOGITS
doubt
1.30
ado
1.29
hesitation
1.25
provocation
1.08
exaggeration
1.06
explanation
1.02
delay
1.00
contradiction
0.92
qualification
0.92
irony
0.90
Activations Density 0.107%