INDEX
    Explanations

    words that end with "ing."

    New Auto-Interp
    Negative Logits
    unny
    -0.17
    bron
    -0.16
    ifa
    -0.16
    eth
    -0.15
    nds
    -0.15
     skl
    -0.15
    ÑĭÑģ
    -0.14
    ogne
    -0.14
    reira
    -0.14
    uger
    -0.14
    POSITIVE LOGITS
    oment
    0.17
    urv
    0.15
    polator
    0.14
    رات
    0.14
    animations
    0.14
    лава
    0.14
    ạm
    0.14
    ustry
    0.14
    amaha
    0.14
    Ä±ÅŁÄ±k
    0.13
    Act Density 0.004%

    No Known Activations