INDEX
Explanations
expressions and sentiments related to happiness and positivity
New Auto-Interp
Negative Logits
676
-0.16
StartPosition
-0.15
favourable
-0.14
werk
-0.14
","","
-0.13
Xã
-0.13
umpt
-0.13
lisi
-0.13
ãĥ³ãĥĨ
-0.13
hence
-0.13
POSITIVE LOGITS
-go
0.29
happy
0.27
Happy
0.24
happy
0.24
Happy
0.22
endings
0.20
Ending
0.19
HAPP
0.19
/content
0.19
happier
0.18
Activations Density 0.031%