INDEX
    Explanations

    words that end in 'ed'

    occurrences of prefixes or root words that suggest a state or action

    New Auto-Interp
    Negative Logits
     kittens
    -0.67
    Reviewer
    -0.65
     Whats
    -0.63
    OHN
    -0.62
    accompan
    -0.62
    Ü
    -0.57
     operators
    -0.56
     davidjl
    -0.55
    whatever
    -0.55
    eps
    -0.55
    POSITIVE LOGITS
    ividual
    1.01
    urities
    0.83
    uable
    0.76
    asonic
    0.74
    inately
    0.73
    uments
    0.73
    itably
    0.73
    ible
    0.73
    iever
    0.72
    umerable
    0.70
    Act Density 0.088%

    No Known Activations