INDEX
Explanations
words and phrases associated with excitement or announcements about music and art events
New Auto-Interp
Negative Logits
?");↵
-0.17
?↵↵
-0.17
?"↵↵↵↵
-0.17
:↵↵↵↵
-0.16
:↵↵
-0.16
..↵↵
-0.15
ï¼Ł”
-0.15
,↵↵
-0.14
íĨłíĨł
-0.14
".↵↵↵↵
-0.14
POSITIVE LOGITS
!
0.77
!↵
0.58
!:
0.51
!↵↵
0.51
!I
0.50
!č↵
0.50
!*
0.49
!).
0.48
!!
0.48
!.
0.48
Activations Density 3.085%