INDEX
    Explanations

    occurrences of the word "El" along with its variations

    New Auto-Interp
    Negative Logits
    ynet
    -0.16
    o
    -0.16
    upertino
    -0.15
    ole
    -0.15
    l
    -0.15
    ously
    -0.14
    lum
    -0.14
    yp
    -0.14
    yper
    -0.14
    dit
    -0.14
    POSITIVE LOGITS
    ora
    0.20
    odie
    0.18
    raith
    0.18
    kins
    0.17
    bow
    0.16
    ipse
    0.16
    ÃŃas
    0.16
    placeholders
    0.15
     Paso
    0.15
    izabeth
    0.15
    Act Density 0.015%

    No Known Activations