INDEX
Explanations
exclamation marks and expressions of excitement or emphasis
Exclamations
end of emphatic clauses
New Auto-Interp
Negative Logits
ede
-0.86
gdx
-0.82
Rump
-0.79
ation
-0.79
iNdEx
-0.79
[`
-0.72
Wetter
-0.71
Rés
-0.71
aure
-0.70
}`).
-0.69
POSITIVE LOGITS
?!?
1.42
?!?!
1.25
!!!!!!!
1.09
%!
1.07
!!!!!!
1.07
!
1.07
!!!!!!!!!!
1.03
~!
0.96
!
0.95
!!!!!!!!
0.94
Activations Density 0.076%