INDEX
Explanations
phrases indicating a future action about to occur
phrases indicating future intentions or planned actions
New Auto-Interp
Negative Logits
PLIED
-0.78
Ain
-0.63
tein
-0.62
Takeru
-0.58
Tess
-0.58
olis
-0.56
ifle
-0.56
AMA
-0.54
hearts
-0.54
illin
-0.54
POSITIVE LOGITS
likely
0.78
atile
0.75
likely
0.72
Parenthood
0.71
untarily
0.71
poised
0.69
onna
0.68
heading
0.66
doomed
0.66
igible
0.65
Activations Density 0.118%