INDEX
    Explanations

    most important and relevant elements

    New Auto-Interp
    Negative Logits
     ವಿವಿಧ
    0.41
     ਇੱਕ
    0.41
     another
    0.40
    كى
    0.40
     женщина
    0.38
    министра
    0.37
     hermoso
    0.37
     një
    0.37
     ə
    0.37
     иной
    0.37
    POSITIVE LOGITS
    那些
    0.75
     those
    0.73
    those
    0.72
     areas
    0.70
     наиболее
    0.65
     selected
    0.64
    哪些
    0.63
     Those
    0.63
     aquellos
    0.63
     ceux
    0.61
    Act Density 0.385%

    No Known Activations