INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _gender
    -0.07
     :-
    -0.07
     chars
    -0.06
    adir
    -0.06
    conte
    -0.06
     totalitarian
    -0.06
     [:
    -0.06
    });↵↵↵↵
    -0.06
    _mult
    -0.06
     Kir
    -0.06
    POSITIVE LOGITS
     Toolbox
    0.07
    (new
    0.07
    ">×</
    0.06
    /application
    0.06
    _atts
    0.06
    'image
    0.06
    atabase
    0.06
    ůr
    0.06
    'value
    0.06
    .HorizontalAlignment
    0.06
    Act Density 0.189%

    No Known Activations