INDEX
Explanations
past tense verbs following "at first" or similar phrases
phrases emphasizing initial reactions or feelings
New Auto-Interp
Negative Logits
ĺħ
-0.77
cellaneous
-0.74
ĵĺ
-0.72
etheless
-0.70
éĸ
-0.67
ļéĨĴ
-0.67
âĹ¼
-0.67
isine
-0.66
today
-0.66
anwhile
-0.66
POSITIVE LOGITS
glance
0.83
blush
0.80
innocuous
0.76
puzz
0.75
rudimentary
0.69
naive
0.68
reluctance
0.66
antagon
0.64
naïve
0.63
hesitant
0.62
Activations Density 0.300%