INDEX
Explanations
terms related to making a strong effort or exerting pressure
instances of the word "push" indicating an attempt to influence or promote action
New Auto-Interp
Negative Logits
abol
-0.78
redited
-0.71
omial
-0.67
BuyableInstoreAndOnline
-0.66
poss
-0.66
Chaff
-0.65
Recogn
-0.64
Surviv
-0.64
odied
-0.61
Dealer
-0.61
POSITIVE LOGITS
push
0.90
push
0.88
back
0.87
pushing
0.83
chairs
0.81
overs
0.81
pushes
0.80
boxes
0.78
Push
0.76
arte
0.76
Activations Density 0.017%