INDEX
    Explanations

    phrases related to giving instructions or demands

    action-oriented phrases that involve requesting or demanding actions from individuals

    New Auto-Interp
    Negative Logits
    obyl
    -0.73
    roup
    -0.64
     Flavoring
    -0.63
    yna
    -0.59
    zbollah
    -0.59
    otonin
    -0.58
    bard
    -0.55
    ugu
    -0.54
    roups
    -0.54
    umbers
    -0.54
    POSITIVE LOGITS
     him
    1.29
     he
    1.25
     his
    1.10
     hers
    1.09
    him
    1.01
     she
    0.98
    He
    0.95
     He
    0.94
    his
    0.93
     whom
    0.86
    Act Density 0.700%

    No Known Activations