INDEX
    Explanations

    gerunds and their usage in various contexts

    New Auto-Interp
    Negative Logits
    zan
    -0.17
    INGS
    -0.17
    ings
    -0.17
    /remove
    -0.15
    å·
    -0.15
    olan
    -0.15
       
    -0.14
    инг
    -0.14
    ยà¸ĩ
    -0.14
    oice
    -0.14
    POSITIVE LOGITS
    Ãłn
    0.15
    æŁ±
    0.15
    IRST
    0.15
    ADB
    0.14
    redient
    0.14
     oneself
    0.14
    redients
    0.14
     mere
    0.14
    dre
    0.14
    pip
    0.14
    Act Density 0.141%

    No Known Activations