INDEX
    Explanations

    sports practice

    New Auto-Interp
    Negative Logits
     signed
    -0.07
    '=
    -0.07
     воз
    -0.07
    ivec
    -0.06
     comfortably
    -0.06
    -expand
    -0.06
     overturn
    -0.06
     runaway
    -0.06
     Singleton
    -0.06
    BF
    -0.06
    POSITIVE LOGITS
     двох
    0.07
    зации
    0.07
    ướng
    0.06
     erotik
    0.06
    DownList
    0.06
    (il
    0.06
     identifiable
    0.06
    різ
    0.06
    (image
    0.06
    ão
    0.06
    Act Density 0.082%

    No Known Activations