INDEX
    Explanations

    phrases expressing personal opinions

    New Auto-Interp
    Negative Logits
     Gothic
    -0.77
    aic
    -0.69
     consequential
    -0.66
     Window
    -0.65
     Conversation
    -0.63
     Meaning
    -0.63
     Azerbai
    -0.63
     Organizations
    -0.61
     Palest
    -0.59
     Warden
    -0.59
    POSITIVE LOGITS
     took
    1.23
     knew
    1.17
     gave
    1.17
     drove
    1.15
     went
    1.15
     underwent
    1.14
     blew
    1.13
     stole
    1.11
     became
    1.11
     chose
    1.10
    Act Density 1.036%

    No Known Activations