INDEX
Explanations
phrases indicating a sense of permanence or continuation in the present or future
frequently occurring adverbs and expressions indicating certainty or continuity
New Auto-Interp
Negative Logits
cos
-0.79
stad
-0.74
tics
-0.74
gdala
-0.71
lab
-0.67
quer
-0.66
Qual
-0.66
Sing
-0.66
regn
-0.65
role
-0.64
POSITIVE LOGITS
gonna
0.90
been
0.73
gotta
0.70
Corner
0.70
got
0.68
Plenty
0.67
Chevy
0.67
pretty
0.66
Wrap
0.65
Crazy
0.64
Activations Density 0.285%