INDEX
Explanations
instructions or commands in a document
phrases related to actions or tasks being completed
New Auto-Interp
Negative Logits
rising
-0.68
used
-0.58
intend
-0.56
mare
-0.54
Prosecutor
-0.54
Chiefs
-0.53
rub
-0.53
Buk
-0.53
videos
-0.53
opened
-0.52
POSITIVE LOGITS
so
1.13
oming
0.89
likewise
0.86
ze
0.83
omsday
0.83
ozy
0.81
zing
0.78
away
0.78
ppel
0.78
vet
0.77
Activations Density 0.097%