INDEX
Explanations
phrases related to stopping or preventing actions
instances of the word "stop" and related phrases emphasizing cessation or interruption
New Auto-Interp
Negative Logits
eton
-0.73
icol
-0.72
é¾įåĸļ士
-0.68
æĪ¦
-0.68
çīĪ
-0.66
igree
-0.64
soDeliveryDate
-0.64
ittal
-0.64
deserts
-0.61
lot
-0.61
POSITIVE LOGITS
bleeding
0.97
ãĤ®
0.79
movement
0.74
cheating
0.72
execution
0.71
smoking
0.71
lockout
0.70
angering
0.70
breathing
0.68
bothering
0.68
Activations Density 0.132%