INDEX
Explanations
expressions of anticipation or uncertainty
the phrase "I guess" and variations of it
New Auto-Interp
Negative Logits
perty
-0.72
foreseen
-0.72
iqueness
-0.70
byn
-0.70
esc
-0.68
natureconservancy
-0.68
claimed
-0.64
è¦ļéĨĴ
-0.64
elight
-0.62
atures
-0.62
POSITIVE LOGITS
nob
0.79
thats
0.76
unsurprisingly
0.73
whoever
0.70
you
0.70
cha
0.69
yeah
0.69
everybody
0.68
congr
0.67
why
0.67
Activations Density 0.047%