INDEX
    Explanations

    phrases that represent a negative connotation

    caution/avoidance

    New Auto-Interp
    Negative Logits
    <bos>
    -0.63
     Femen
    -0.56
    ьаж
    -0.52
     Wikiquote
    -0.52
     ThemeData
    -0.51
    istani
    -0.49
     Himself
    -0.48
    WriteTagHelper
    -0.48
     aught
    -0.48
    timation
    -0.47
    POSITIVE LOGITS
    ReusableCell
    0.68
    stdc
    0.64
    CppMethod
    0.56
     ComVisible
    0.52
    ArgsConstructor
    0.49
    tał
    0.49
     autorytatywna
    0.48
     mergeFrom
    0.48
    glLoadIdentity
    0.47
    fromnode
    0.47
    Act Density 0.550%

    No Known Activations