INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    на
    1.04
    ری
    1.04
    কে
    0.91
    0.90
    いた
    0.90
     sviluppo
    0.88
    จะ
    0.84
    نے
    0.83
    هم
    0.82
     akan
    0.79
    POSITIVE LOGITS
    on
    1.04
    '
    0.89
    -
    0.68
    onar
    0.58
    hi
    0.58
    us
    0.57
    A
    0.57
    N
    0.57
    L
    0.56
    V
    0.55
    Act Density 5.589%

    No Known Activations