INDEX
    Explanations

    negations or expressions of denial

    Preceding "not" negations in different languages

    New Auto-Interp
    Negative Logits
    still
    -0.59
     efectivamente
    -0.55
     ***!
    -0.54
    situ
    -0.52
    saturated
    -0.52
     همچ
    -0.51
    χι
    -0.51
    atase
    -0.51
    acious
    -0.51
    LikeLike
    -0.51
    POSITIVE LOGITS
    сение
    0.61
     rospy
    0.60
    '},
    
    0.56
     AppDelegate
    0.55
     Hodgkin
    0.53
    etcode
    0.53
    pidou
    0.52
    0.52
     Bede
    0.52
    SuppressLint
    0.51
    Act Density 0.057%

    No Known Activations