INDEX
    Explanations

    positive encouragement and support

    New Auto-Interp
    Negative Logits
     erotic
    0.94
     ubiqu
    0.92
     obliquely
    0.92
     orthogon
    0.90
    Fuck
    0.90
     popularized
    0.86
     seductive
    0.85
     hematopoietic
    0.85
     involuntarily
    0.84
     immutable
    0.84
    POSITIVE LOGITS
     fantastic
    1.07
     teamwork
    0.97
     positive
    0.94
    fantastic
    0.94
     wonderful
    0.92
     профессиона
    0.91
     everyone
    0.90
     professionalism
    0.90
     positivity
    0.89
    大変
    0.87
    Act Density 0.485%

    No Known Activations