INDEX
    Explanations

    examples of situations or actions

    the phrase "For example" or variations of it

    New Auto-Interp
    Negative Logits
    iewicz
    -0.75
    alan
    -0.68
    izzle
    -0.65
    rious
    -0.65
    eat
    -0.64
    ownt
    -0.63
    Moh
    -0.63
    hya
    -0.63
    beat
    -0.61
    itably
    -0.60
    POSITIVE LOGITS
     example
    1.47
     instance
    1.31
     comparison
    1.12
     simplicity
    1.12
     purposes
    1.11
    cing
    1.09
     sake
    1.03
     Example
    1.03
    bidden
    1.01
    got
    0.99
    Act Density 0.176%

    No Known Activations