INDEX
Explanations
commands or instructions that involve telling someone to do something
instances of the word "ordered" indicating commands or directives
New Auto-Interp
Negative Logits
pmwiki
-0.76
cit
-0.76
doi
-0.74
bil
-0.74
td
-0.73
����
-0.71
ãĤ¨
-0.71
ÎĶ
-0.70
odor
-0.69
Ott
-0.69
POSITIVE LOGITS
ordering
0.94
ordered
0.92
orders
0.88
etary
0.81
avorite
0.77
confir
0.77
eering
0.76
lies
0.73
rul
0.73
psychiat
0.72
Activations Density 0.015%