INDEX
Explanations
occurrences of strong emotional expressions or emphatic phrases
New Auto-Interp
Negative Logits
tremend
-0.74
Downs
-0.71
assemb
-0.70
whistle
-0.69
decomp
-0.67
dispers
-0.65
dirt
-0.65
bearer
-0.64
promul
-0.64
straw
-0.64
POSITIVE LOGITS
į
1.07
Ķ
1.02
ı
1.01
¤
1.00
ł
1.00
¬
0.99
º
0.97
ĸ
0.96
ľ
0.95
Ĥ
0.94
Activations Density 0.045%