INDEX
Explanations
words related to positive feelings or attitudes
terms related to positivity and positive sentiment
New Auto-Interp
Negative Logits
spo
-0.69
opsy
-0.69
Brilliant
-0.67
Lucia
-0.67
Hearts
-0.66
loo
-0.64
Clarkson
-0.63
Wiggins
-0.60
Emerson
-0.59
STEP
-0.59
POSITIVE LOGITS
itional
1.59
itions
1.48
itivity
1.40
icion
1.25
idon
1.25
itives
1.22
itionally
1.18
itiveness
1.18
ited
1.16
itive
1.14
Activations Density 0.023%