INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     borderline
    -0.08
     ben
    -0.08
     DAS
    -0.08
     openly
    -0.07
     sert
    -0.07
    _v
    -0.07
     vivo
    -0.07
    anic
    -0.07
    uma
    -0.07
     Kum
    -0.07
    POSITIVE LOGITS
     الخط
    0.08
    0.08
    vq
    0.08
     Horr
    0.07
     филь
    0.07
    Tel
    0.07
     بحث
    0.07
     monumental
    0.07
    Gro
    0.07
     pell
    0.07
    Act Density 0.054%

    No Known Activations