INDEX
Explanations
phrases indicating anticipation or looking forward to future events
New Auto-Interp
Negative Logits
preferring
-0.14
opting
-0.14
opts
-0.14
prefers
-0.14
willing
-0.13
bothered
-0.13
bens
-0.13
beden
-0.13
olare
-0.13
demanded
-0.13
POSITIVE LOGITS
hearing
0.27
many
0.24
continued
0.23
future
0.22
seeing
0.21
opportunities
0.21
continue
0.20
welcoming
0.20
Hearing
0.20
hopefully
0.20
Activations Density 0.056%