INDEX
Explanations
phrases suggesting doubt or uncertainty
phrases indicating uncertainty or skepticism about future events
New Auto-Interp
Negative Logits
ngth
-0.73
raq
-0.68
alde
-0.67
verbs
-0.65
speculated
-0.65
ription
-0.64
clamation
-0.62
aina
-0.61
reminds
-0.60
culated
-0.60
POSITIVE LOGITS
anytime
1.46
anymore
1.34
ever
1.19
anything
1.16
EVER
1.15
anywhere
1.11
any
1.10
ever
1.00
nor
0.99
whatsoever
0.96
Activations Density 0.353%