INDEX
    Explanations

    phrases indicating a directive or action

    repetitive phrases emphasizing the word "just."

    New Auto-Interp
    Negative Logits
    ixel
    -0.70
    lav
    -0.67
    untled
    -0.67
    ensical
    -0.66
     plaintiff
    -0.63
    endra
    -0.62
    glomer
    -0.62
     adversary
    -0.60
    pora
    -0.60
     PLUS
    -0.59
    POSITIVE LOGITS
    ifiable
    1.05
    ifications
    1.03
     kidding
    0.91
    if
    0.89
    ifi
    0.85
    ices
    0.85
     plain
    0.81
    icia
    0.77
    ific
    0.76
    itia
    0.74
    Act Density 0.098%

    No Known Activations