INDEX
Explanations
exclamatory expressions indicating strong emotions or emphasis
exclamatory punctuation, indicating strong emotions or excitement
New Auto-Interp
Negative Logits
atories
-0.81
atory
-0.71
reen
-0.70
unker
-0.70
aldi
-0.70
athering
-0.69
ativity
-0.68
asser
-0.67
brance
-0.66
ator
-0.65
POSITIVE LOGITS
!!!!!
1.18
!!!
1.12
!!
1.10
?!
1.03
@#
1.00
??
0.97
!!!!
0.93
@#&
0.89
?:
0.87
:-)
0.86
Activations Density 0.019%