INDEX
    Explanations

    occurrences of the letter 'w' and numerical digits

    New Auto-Interp
    Negative Logits
    kili
    -0.18
    anje
    -0.16
    fir
    -0.16
    emek
    -0.15
     Eag
    -0.15
    vider
    -0.15
    Ïģιά
    -0.15
    å͝
    -0.15
    eden
    -0.15
    жд
    -0.15
    POSITIVE LOGITS
     Pla
    0.15
    uhn
    0.15
     plateau
    0.15
    itter
    0.15
     Automatic
    0.14
    omentum
    0.14
     total
    0.14
    енз
    0.13
    nut
    0.13
     prepared
    0.13
    Act Density 0.000%

    No Known Activations