INDEX
    Explanations

    discrimination

    New Auto-Interp
    Negative Logits
     dup
    -0.06
     penalties
    -0.06
     Duty
    -0.06
    _Post
    -0.06
     />;↵
    -0.06
     brink
    -0.06
     trov
    -0.06
    -horizontal
    -0.06
    HH
    -0.06
    Borders
    -0.06
    POSITIVE LOGITS
    .Result
    0.06
    VMLINUX
    0.06
    oops
    0.06
    народ
    0.06
    (style
    0.06
     &=
    0.06
    ublik
    0.06
    nímu
    0.06
    .un
    0.06
    /gr
    0.06
    Act Density 0.007%

    No Known Activations