INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     kuhusu
    -0.51
    grà
    -0.48
    ✨:
    -0.48
     asupra
    -0.46
     riguardo
    -0.46
    liwości
    -0.44
     nemlig
    -0.44
     Recognizing
    -0.44
    Concerning
    -0.44
    Datuak
    -0.43
    POSITIVE LOGITS
     виправивши
    0.72
    stdafx
    0.60
    mybatisplus
    0.59
     صوتيه
    0.59
     well
    0.58
    saraba
    0.57
     initComponents
    0.57
     تعدى
    0.56
    TRIBUN
    0.56
    :+:
    0.55
    Act Density 0.015%

    No Known Activations