INDEX
    Explanations

    proper nouns or other specific words related to certain entities or events

    line breaks or breaks in text formatting

    New Auto-Interp
    Negative Logits
    WARE
    -0.72
    balls
    -0.67
    meal
    -0.66
    eers
    -0.64
    Sax
    -0.63
     catch
    -0.62
    tale
    -0.61
     Totem
    -0.59
     caught
    -0.58
    ãģŁ
    -0.57
    POSITIVE LOGITS
    ackets
    1.28
    acket
    1.14
    anches
    1.06
    anch
    1.04
    igham
    1.04
    unn
    1.01
    aces
    0.99
    OAD
    0.99
    ushed
    0.97
    acing
    0.95
    Act Density 0.017%

    No Known Activations