INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    inqu
    -0.07
     FF
    -0.06
    Info
    -0.06
    (errors
    -0.06
     semaphore
    -0.06
    sv
    -0.06
    _Format
    -0.06
    Ly
    -0.06
    Bang
    -0.06
     гид
    -0.06
    POSITIVE LOGITS
     действия
    0.07
     غرب
    0.07
    ุร
    0.07
    _dynamic
    0.06
     спост
    0.06
     Upper
    0.06
    irling
    0.06
    jc
    0.06
     연결
    0.06
     VIP
    0.06
    Act Density 0.052%

    No Known Activations