INDEX
Explanations
expressions of collective sentiment and personal connection
Positive sentiment/emotion words
express positive feelings
New Auto-Interp
Negative Logits
]--;
-0.60
transfieras
-0.57
featureID
-0.55
();)
-0.55
дописавши
-0.54
uxxxx
-0.54
WriteTagHelper
-0.53
newOwner
-0.50
leaſt
-0.50
newBuilder
-0.50
POSITIVE LOGITS
proud
1.09
glad
1.07
pleased
1.01
thankful
0.93
happy
0.92
delighted
0.92
extremely
0.90
grateful
0.90
honored
0.89
very
0.85
Activations Density 0.120%