INDEX
    Explanations

    b.a. degrees

    New Auto-Interp
    Negative Logits
    ки
    -0.08
     pro
    -0.08
    igual
    -0.07
    -worker
    -0.07
     мой
    -0.07
    legs
    -0.07
    leer
    -0.07
    heat
    -0.07
    Store
    -0.07
    chart
    -0.07
    POSITIVE LOGITS
    893
    0.09
     comedy
    0.09
     teatr
    0.09
     theatrical
    0.08
    Comedy
    0.08
     psz
    0.08
     psik
    0.08
    ה
    0.08
     hyv
    0.08
     comedian
    0.08
    Act Density 0.003%

    No Known Activations