INDEX
    Explanations

    numeric identifiers within strings

    instances of specific keywords and identifiers

    New Auto-Interp
    Negative Logits
     Sally
    -0.83
     Ler
    -0.77
     Sz
    -0.76
     Stew
    -0.75
     Ens
    -0.72
     Elise
    -0.71
     Sigma
    -0.70
     Sak
    -0.70
     Isle
    -0.69
     Ell
    -0.69
    POSITIVE LOGITS
    b
    1.47
    bs
    1.39
    bis
    1.29
    ber
    1.28
    bb
    1.27
    bol
    1.26
    ba
    1.25
    bish
    1.23
    bral
    1.20
    bi
    1.19
    Act Density 0.222%

    No Known Activations