INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     des
    -0.07
     üzer
    -0.06
    .ci
    -0.06
     ماشین
    -0.06
     специалист
    -0.06
    -0.06
     pedestrian
    -0.06
     phenomenon
    -0.06
    DST
    -0.06
     UIImageView
    -0.06
    POSITIVE LOGITS
     não
    0.19
     Não
    0.08
    0.07
    ��
    0.07
    Não
    0.07
     disagreed
    0.07
    CURRENT
    0.06
    しまった
    0.06
    ूं
    0.06
     не
    0.06
    Act Density 0.024%

    No Known Activations