INDEX
    Explanations

    numerical identifiers or codes

    New Auto-Interp
    Negative Logits
    teenth
    -0.77
    orius
    -0.72
    light
    -0.71
    intosh
    -0.69
    fall
    -0.68
    olate
    -0.66
    achu
    -0.66
    shot
    -0.66
    oby
    -0.65
    zyme
    -0.65
    POSITIVE LOGITS
    nd
    2.13
    ND
    1.21
    ndra
    0.77
    FW
    0.73
     thirds
    0.73
    nder
    0.72
    50
    0.71
    502
    0.71
    nda
    0.70
    121
    0.69
    Act Density 0.106%

    No Known Activations