INDEX
Explanations
interrogative sentences that prompt for an explanation or response
questions or inquiries posed in the text
New Auto-Interp
Negative Logits
athe
-0.72
threaded
-0.68
weld
-0.68
agos
-0.67
knit
-0.65
background
-0.64
iannopoulos
-0.63
ema
-0.63
brim
-0.63
imer
-0.63
POSITIVE LOGITS
Well
1.52
Firstly
1.21
Probably
1.19
Well
1.16
Quite
1.14
Plenty
1.11
Certainly
1.09
Turns
1.09
Surely
1.08
Perhaps
1.08
Activations Density 0.099%