INDEX
Explanations
statements expressing feelings or opinions, often with words like "happy", "disappointing", and "pleased"
expressions of emotions and opinions
New Auto-Interp
Negative Logits
ãĥ¯ãĥ³
-0.77
apeshifter
-0.74
definition
-0.73
spec
-0.71
æŃ¦
-0.66
ixt
-0.66
esm
-0.66
arsen
-0.65
defined
-0.64
illac
-0.64
POSITIVE LOGITS
glad
1.44
thankful
1.39
thanking
1.31
applaud
1.29
congratulate
1.27
pity
1.27
hope
1.25
rejoice
1.24
grateful
1.24
hopeful
1.22
Activations Density 0.720%