INDEX
    Explanations

    phrases related to being physically damaged or in a state of disrepair

    instances of the word "broken."

    New Auto-Interp
    Negative Logits
    atur
    -0.74
    gee
    -0.72
     appell
    -0.72
     Salman
    -0.71
     Heller
    -0.70
     designate
    -0.69
    ffer
    -0.67
    advertising
    -0.66
    respond
    -0.66
    iens
    -0.66
    POSITIVE LOGITS
     broken
    3.63
    broken
    2.28
     Broken
    2.10
     shattered
    2.05
     fractured
    1.95
     cracked
    1.75
     busted
    1.62
     breaking
    1.54
     smashed
    1.54
     broke
    1.51
    Act Density 0.013%

    No Known Activations