INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    Narrated
    -0.07
    -0.07
    DIRECT
    -0.06
    ala
    -0.06
     Guil
    -0.06
    Classes
    -0.06
    ALA
    -0.06
     Shin
    -0.06
    .delegate
    -0.06
     очі
    -0.06
    POSITIVE LOGITS
     Univ
    0.08
     επ
    0.07
    ?>>↵
    0.06
     때문
    0.06
    ucceed
    0.06
    CodeGen
    0.06
    tere
    0.06
     values
    0.06
    (elem
    0.06
     nec
    0.06
    Act Density 0.088%

    No Known Activations