INDEX
    Explanations

    negatives emphasizing the quality or nature of something being unfavorable

    New Auto-Interp
    Negative Logits
    ConstraintMaker
    -0.55
     NSCoder
    -0.55
     ivelany
    -0.55
     duquel
    -0.53
    absolutely
    -0.51
    しまいます
    -0.50
    しまう
    -0.49
    finally
    -0.49
    Boring
    -0.49
    ってしまう
    -0.48
    POSITIVE LOGITS
     pleasant
    0.92
     good
    0.81
     conducive
    0.72
     bueno
    0.71
     welcome
    0.71
     desirable
    0.71
     nice
    0.69
     ideal
    0.68
     favorable
    0.67
     bode
    0.66
    Act Density 0.255%

    No Known Activations