INDEX
Explanations
imperatives or directives
exclamatory statements or punctuation indicating strong emotion or surprise
New Auto-Interp
Negative Logits
matically
-0.75
oria
-0.74
ching
-0.72
eling
-0.72
der
-0.71
ning
-0.69
med
-0.67
bor
-0.67
washing
-0.67
otto
-0.67
POSITIVE LOGITS
CLASSIFIED
0.89
?!
0.86
#$
0.85
Anyway
0.83
;)
0.82
Cohn
0.81
ðŁĻĤ
0.76
:-)
0.74
!)
0.73
Marino
0.71
Activations Density 0.016%