INDEX
Explanations
**phrases signaling permission, advice, or assistance.**
phrases emphasizing the idea of not needing to do something or feeling obligated
New Auto-Interp
Negative Logits
Cyp
-0.75
Motion
-0.72
interstitial
-0.66
Eva
-0.66
iple
-0.65
wikipedia
-0.64
cember
-0.63
cade
-0.60
Crate
-0.60
handled
-0.60
POSITIVE LOGITS
worry
1.40
bother
1.14
anymore
1.07
wait
0.97
rely
0.93
sacrifice
0.88
fret
0.87
stare
0.86
suffer
0.86
confront
0.84
Activations Density 0.067%