INDEX
    Explanations

    phrases related to actions or activities that the individual has been doing

    the phrase "I have been" in various contexts

    New Auto-Interp
    Negative Logits
    lies
    -0.72
    izable
    -0.69
    rones
    -0.68
    terday
    -0.68
    Must
    -0.67
    ives
    -0.66
    iop
    -0.66
    regate
    -0.65
    Guy
    -0.64
    idental
    -0.64
    POSITIVE LOGITS
     subjected
    1.03
     able
    1.03
     unable
    1.03
     tasked
    0.95
     accused
    0.93
     forgiven
    0.91
     criticized
    0.91
     warned
    0.89
     punished
    0.89
     rewarded
    0.89
    Act Density 0.126%

    No Known Activations