INDEX
Explanations
exclamatory expressions or phrases denoting high emotion or urgency
exclamatory statements and strong emotional reactions
New Auto-Interp
Negative Logits
atories
-0.78
lining
-0.68
path
-0.67
roup
-0.67
unker
-0.67
ativity
-0.67
ouf
-0.66
outline
-0.65
lings
-0.64
athering
-0.64
POSITIVE LOGITS
!!!!!
1.17
!!!
1.13
!!
1.10
:-)
1.00
:)
0.91
9999
0.90
!/
0.90
?!
0.89
!!!!
0.87
@#
0.85
Activations Density 0.017%