INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     제가
    -0.07
    ahoma
    -0.07
     assez
    -0.07
    anvas
    -0.06
     amac
    -0.06
     yapmaya
    -0.06
    ывать
    -0.06
     رفته
    -0.06
     همین
    -0.06
     مشاهده
    -0.06
    POSITIVE LOGITS
    %s
    0.08
    %d
    0.07
    Media
    0.07
    %p
    0.07
    Great
    0.07
    %x
    0.06
    .'));↵
    0.06
     consulted
    0.06
    Term
    0.06
    'field
    0.06
    Act Density 0.005%

    No Known Activations