INDEX
    Explanations

    expressions of collective sentiment and personal connection

    Positive sentiment/emotion words

    express positive feelings

    New Auto-Interp
    Negative Logits
    ]--;
    -0.60
     transfieras
    -0.57
    featureID
    -0.55
    ();)
    -0.55
     дописавши
    -0.54
    uxxxx
    -0.54
    WriteTagHelper
    -0.53
     newOwner
    -0.50
     leaſt
    -0.50
     newBuilder
    -0.50
    POSITIVE LOGITS
     proud
    1.09
     glad
    1.07
     pleased
    1.01
     thankful
    0.93
     happy
    0.92
     delighted
    0.92
     extremely
    0.90
     grateful
    0.90
     honored
    0.89
     very
    0.85
    Act Density 0.120%

    No Known Activations