INDEX
Explanations
instances of anticipation or eagerness related to future events
New Auto-Interp
Negative Logits
lang
-0.16
ortex
-0.15
hl
-0.15
nj
-0.14
vert
-0.14
chez
-0.14
ffer
-0.14
Gibbs
-0.14
verts
-0.14
in
-0.13
POSITIVE LOGITS
wait
0.27
WAIT
0.24
wait
0.22
believe
0.22
imagine
0.20
/wait
0.19
remember
0.19
waits
0.19
help
0.19
Wait
0.18
Activations Density 0.031%