INDEX
    Explanations

    phrases that involve calling or labeling something

    phrases and expressions related to self-identification

    New Auto-Interp
    Negative Logits
    ersen
    -0.69
    iland
    -0.61
    ockets
    -0.59
    aten
    -0.59
     cough
    -0.58
    midt
    -0.57
     volcano
    -0.57
    ----------------
    -0.57
    edia
    -0.57
     Tribune
    -0.56
    POSITIVE LOGITS
     "#
    0.86
    selves
    0.70
     ''
    0.69
     '
    0.68
     ``
    0.68
    named
    0.67
    runners
    0.66
    geant
    0.63
    leaders
    0.63
     slogans
    0.63
    Act Density 0.194%

    No Known Activations