INDEX
Explanations
positive words or sentiments
expressions of positive sentiment
New Auto-Interp
Negative Logits
spo
-0.65
Brilliant
-0.63
sidel
-0.57
stride
-0.56
Recall
-0.56
crest
-0.55
Lucia
-0.54
ways
-0.54
Chancellor
-0.54
Boko
-0.53
POSITIVE LOGITS
itional
1.75
itions
1.66
itivity
1.50
itionally
1.41
itives
1.37
icion
1.37
itiveness
1.34
ession
1.28
itive
1.28
essed
1.28
Activations Density 0.055%