INDEX
Explanations
phrases that involve advising or instructing someone not to do something
phrases that include the word "to" followed by an action or directive
New Auto-Interp
Negative Logits
ãĤ¦ãĤ¹
-0.85
grounds
-0.82
examination
-0.81
usage
-0.71
accompan
-0.70
æ©
-0.70
¸
-0.69
raising
-0.69
DIT
-0.69
weekly
-0.68
POSITIVE LOGITS
bother
1.21
worry
1.04
mention
1.03
interfere
0.96
offend
0.93
bud
0.91
exceed
0.91
anymore
0.91
succumb
0.91
stray
0.89
Activations Density 0.101%