INDEX
    Explanations
    No Explanations Found
    New Auto-Interp
    Negative Logits
    .
    1.51
    1.50
    1.33
    .:
    1.30
    +.
    1.29
    .|
    1.28
    .).
    1.28
    ۔
    1.27
    .)
    1.25
    ().
    1.25
    POSITIVE LOGITS
     непосредственно
    0.85
     DIRECT
    0.75
     تمام
    0.71
     bezpośred
    0.71
     চিত্র
    0.70
    விலான
    0.68
     trực
    0.67
    0.67
     đối
    0.67
     대상으로
    0.66
    Act Density 1.290%

    No Known Activations