INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     సంస్థ
    -0.08
     elderly
    -0.08
     धीरे
    -0.07
    _poll
    -0.07
     Жен
    -0.07
     संगठन
    -0.07
     beschouwd
    -0.07
    -0.07
     γυνα
    -0.07
    consider
    -0.07
    POSITIVE LOGITS
     recieved
    0.09
    ิโ
    0.08
     ഓഫ്
    0.08
    0.08
    ਾਇ
    0.08
    іда
    0.08
     allot
    0.08
     เซ
    0.08
    Compiler
    0.07
     നൽക
    0.07
    Act Density 0.052%

    No Known Activations