INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     проблема
    -0.07
    ,p
    -0.06
     kní
    -0.06
     Mercy
    -0.06
     الدولة
    -0.06
     اح
    -0.06
     faithful
    -0.06
     Cem
    -0.06
     اختلاف
    -0.06
    cola
    -0.06
    POSITIVE LOGITS
     pointing
    0.07
     notwithstanding
    0.07
    тися
    0.07
    FLT
    0.07
    _SECOND
    0.06
    anging
    0.06
     waiting
    0.06
    venture
    0.06
                                                                       
    0.06
    TEXT
    0.06
    Act Density 0.007%

    No Known Activations