INDEX
    Explanations

    occurrences of the word "first" and its variations

    New Auto-Interp
    Negative Logits
    ESIS
    -0.07
    ekyll
    -0.07
    llib
    -0.07
    Äįen
    -0.07
    <center
    -0.07
    jian
    -0.06
    okino
    -0.06
    klä
    -0.06
    IED
    -0.06
     jadx
    -0.06
    POSITIVE LOGITS
    óst
    0.07
    inkel
    0.07
    ghan
    0.06
     Craft
    0.06
    unei
    0.06
     proper
    0.06
    ikki
    0.06
     AFF
    0.06
    503
    0.06
    umper
    0.06
    Act Density 0.025%

    No Known Activations