INDEX
Explanations
verbs related to beginnings or starting events
New Auto-Interp
Negative Logits
Continue
-0.17
Continue
-0.15
oret
-0.15
continuation
-0.15
continue
-0.15
continued
-0.14
still
-0.14
.boost
-0.14
wal
-0.14
still
-0.14
POSITIVE LOGITS
to
0.20
paying
0.17
down
0.16
857
0.15
asking
0.15
thinking
0.15
associ
0.15
preparations
0.15
taking
0.15
showing
0.15
Activations Density 0.060%