INDEX
Explanations
phrases related to persuading or convincing someone to take a specific action
phrases that indicate an effort to persuade or influence someone
New Auto-Interp
Negative Logits
advertising
-0.70
é¾įå¥ij士
-0.66
Judging
-0.66
Effects
-0.64
contrasted
-0.63
Compar
-0.63
inferred
-0.62
Reports
-0.61
Chall
-0.61
runtime
-0.60
POSITIVE LOGITS
cooperate
1.09
accept
0.99
obey
0.99
behave
0.97
agree
0.96
commit
0.96
comply
0.95
participate
0.94
soften
0.93
admit
0.93
Activations Density 0.107%