INDEX
Explanations
verbs related to taking action or making decisions
imperative phrases that express necessity or obligation
New Auto-Interp
Negative Logits
addictive
-0.66
Levant
-0.65
bone
-0.65
ELD
-0.63
chy
-0.59
Marino
-0.58
Dro
-0.57
Favorite
-0.57
epile
-0.57
cit
-0.57
POSITIVE LOGITS
ourselves
1.16
educate
0.83
conclude
0.82
discuss
0.82
celebrate
0.81
ruce
0.81
yt
0.76
collectively
0.76
presume
0.76
learn
0.76
Activations Density 0.216%