INDEX
Explanations
words and phrases expressing positive emotions or excitement
New Auto-Interp
Negative Logits
rame
-0.77
landfill
-0.70
arij
-0.69
flawed
-0.68
faulty
-0.67
lder
-0.67
detrimental
-0.64
restrictive
-0.64
issors
-0.63
restraining
-0.62
POSITIVE LOGITS
iously
0.99
delight
0.93
ecstatic
0.90
delighted
0.87
anticipation
0.86
exclaim
0.85
grin
0.85
gle
0.83
eagerly
0.81
praise
0.80
Activations Density 0.049%