INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ์ซ
    -0.07
     SAL
    -0.07
     sel
    -0.07
    _SQL
    -0.07
     Mais
    -0.07
     distinguish
    -0.07
     expl
    -0.07
     basal
    -0.06
    mızı
    -0.06
    signal
    -0.06
    POSITIVE LOGITS
     foot
    0.13
    foot
    0.12
     Foot
    0.11
    Foot
    0.09
     FOOT
    0.09
     feet
    0.09
     footing
    0.08
    -foot
    0.08
    ifter
    0.08
    ooter
    0.08
    Act Density 0.016%

    No Known Activations