INDEX
Explanations
the word "on" in contexts related to making plans or decisions
phrases that express intentions or plans
New Auto-Interp
Negative Logits
arta
-0.78
omsky
-0.77
ESA
-0.76
chin
-0.75
Pac
-0.74
MRI
-0.74
Mine
-0.73
externalActionCode
-0.71
displayText
-0.70
SUP
-0.70
POSITIVE LOGITS
keeping
1.13
staying
1.12
seeing
1.07
preserving
1.07
sticking
1.07
getting
1.04
delivering
1.03
acquiring
1.02
having
1.02
avoiding
1.02
Activations Density 0.096%