INDEX
    Explanations

    instances of the word "just."

    New Auto-Interp
    Negative Logits
    ccording
    -0.69
     Palestin
    -0.67
     adolesc
    -0.65
     Strategies
    -0.64
     subsequ
    -0.64
     Key
    -0.64
     discrep
    -0.63
     challeng
    -0.61
     Archdemon
    -0.59
     representations
    -0.59
    POSITIVE LOGITS
    ifiable
    1.15
    ifications
    1.07
     ignore
    1.03
    ify
    1.02
    if
    0.98
    itate
    0.92
    ificate
    0.89
    ification
    0.89
    ified
    0.87
    ifi
    0.85
    Act Density 0.040%

    No Known Activations