INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    geist
    -0.09
     wore
    -0.08
    ملت
    -0.07
     à
    -0.07
    Pret
    -0.07
     Curriculum
    -0.07
     contexts
    -0.07
     parecer
    -0.07
    ventures
    -0.07
    (named
    -0.07
    POSITIVE LOGITS
    ివ
    0.08
    ikwa
    0.08
    ога
    0.08
     prostor
    0.08
    েছে
    0.08
    ిడ
    0.07
     Wohnung
    0.07
     dostup
    0.07
     массив
    0.07
     dealership
    0.07
    Act Density 0.017%

    No Known Activations