INDEX
Explanations
pronouns related to future actions or aspirations
New Auto-Interp
Negative Logits
ãĤ»
-0.65
ĸļ
-0.60
iameter
-0.58
reminder
-0.57
puzzling
-0.56
Rumble
-0.55
emphasis
-0.55
ĨĴ
-0.55
unexplained
-0.54
questioning
-0.54
POSITIVE LOGITS
will
1.45
'll
1.41
will
1.29
WILL
1.20
would
1.16
would
1.11
someday
1.09
could
1.07
might
1.06
wont
1.06
Activations Density 0.336%