INDEX
    Explanations

    Russian word

    New Auto-Interp
    Negative Logits
     k
    -0.08
     fakta
    -0.08
     факт
    -0.07
    ിള
    -0.07
     Listing
    -0.07
     નોંધ
    -0.07
     znam
    -0.07
     cũng
    -0.07
    -0.07
     klicken
    -0.07
    POSITIVE LOGITS
     خپلو
    0.09
    صبح
    0.08
     వెళ్ల
    0.08
     ventures
    0.08
    ventures
    0.08
     ventured
    0.08
     مباشرة
    0.08
    تم
    0.08
     instead
    0.08
     обратно
    0.08
    Act Density 0.035%

    No Known Activations