INDEX
Explanations
phrases indicating future actions or outcomes
expressions of future possibilities or predictions
New Auto-Interp
Negative Logits
¶
-0.63
assin
-0.60
ilaterally
-0.59
sket
-0.58
culus
-0.56
Dude
-0.56
Advertisement
-0.56
specificity
-0.55
ELF
-0.54
disadvant
-0.52
POSITIVE LOGITS
continue
1.50
continue
1.27
hereafter
1.18
continuing
1.08
continued
1.04
soon
1.04
resume
1.03
forever
0.99
continues
0.99
remain
0.97
Activations Density 0.257%