INDEX
Explanations
expressions of uncertainty and questions about choices or opinions
New Auto-Interp
Negative Logits
Attempting
-0.16
Granted
-0.15
attempting
-0.15
ostensibly
-0.15
Granted
-0.15
Odds
-0.14
Contained
-0.14
predomin
-0.14
ie
-0.13
essler
-0.13
POSITIVE LOGITS
Bes
0.23
Bes
0.23
anyway
0.22
Anyway
0.21
Anyway
0.21
Moreover
0.21
Nevertheless
0.21
bt
0.20
nevertheless
0.20
beside
0.20
Activations Density 0.221%