INDEX
Explanations
phrases related to intentions or actions
phrases indicating future actions or ongoing situations
New Auto-Interp
Negative Logits
Reviewer
-0.70
#$#$
-0.67
jection
-0.64
ollow
-0.63
ð
-0.61
tsky
-0.61
OTOS
-0.60
ĸļ
-0.58
differs
-0.57
interven
-0.54
POSITIVE LOGITS
gonna
2.17
going
2.07
going
1.57
Going
1.24
destined
1.18
likely
1.17
chance
1.05
supposed
1.03
slated
1.01
Going
0.99
Activations Density 0.412%