INDEX
    Explanations

    instances where the word "for" is followed by a mix of numbers and letters, such as "FOR9" or "FOR7CE1"

    New Auto-Interp
    Negative Logits
    inese
    -0.66
     jaws
    -0.66
    adelphia
    -0.62
     vom
    -0.57
    zona
    -0.57
     Ples
    -0.56
    omi
    -0.55
    Tube
    -0.55
    marine
    -0.54
     tailor
    -0.54
    POSITIVE LOGITS
    gotten
    1.64
    bidden
    1.53
     example
    1.12
    WARD
    1.10
     instance
    1.08
    gery
    1.04
    cing
    1.03
    ged
    0.98
    cible
    0.96
    ced
    0.95
    Act Density 8.189%

    No Known Activations