INDEX
    Explanations

    phrases related to persuading or convincing someone to take a specific action

    phrases that indicate an effort to persuade or influence someone

    New Auto-Interp
    Negative Logits
    advertising
    -0.70
    é¾įå¥ij士
    -0.66
     Judging
    -0.66
     Effects
    -0.64
     contrasted
    -0.63
    Compar
    -0.63
     inferred
    -0.62
     Reports
    -0.61
     Chall
    -0.61
    runtime
    -0.60
    POSITIVE LOGITS
     cooperate
    1.09
     accept
    0.99
     obey
    0.99
     behave
    0.97
     agree
    0.96
     commit
    0.96
     comply
    0.95
     participate
    0.94
     soften
    0.93
     admit
    0.93
    Act Density 0.107%

    No Known Activations