INDEX
    Explanations

    verbs indicating actions or emotional reactions

    phrases indicating emotions and reactions to change

    New Auto-Interp
    Negative Logits
    selves
    -0.70
     unison
    -0.67
    hub
    -0.63
    they
    -0.57
    angular
    -0.56
    ocument
    -0.55
    VERTISEMENT
    -0.54
     respectively
    -0.54
    taboola
    -0.54
    ADVERTISEMENT
    -0.53
    POSITIVE LOGITS
     himself
    1.80
     Himself
    1.28
     his
    1.25
     herself
    1.11
     HIS
    0.94
     His
    0.92
    his
    0.90
    His
    0.88
     subordinates
    0.79
     wife
    0.70
    Act Density 2.161%

    No Known Activations