INDEX
    Explanations

    leadership and management roles

    New Auto-Interp
    Negative Logits
     is
    0.52
    0.48
    д
    0.47
     మనం
    0.47
     ఉపయోగ
    0.47
     ము
    0.46
    0.46
     ذلك
    0.46
     నె
    0.45
     ఇప్పుడు
    0.45
    POSITIVE LOGITS
     for
    0.65
    0.52
    ство
    0.52
    کار
    0.49
    0.49
    for
    0.48
    کل
    0.48
    B
    0.48
    0.46
    کن
    0.45
    Act Density 0.207%

    No Known Activations