INDEX
Explanations
phrases indicating strong emotional reactions, especially with negative connotations
occurrences of ellipses in the text
New Auto-Interp
Negative Logits
Newsletter
-0.80
ÂŃ
-0.78
\(\
-0.63
antip
-0.59
fluorescent
-0.53
staffers
-0.52
living
-0.49
hypothetical
-0.49
icio
-0.49
Gleaming
-0.49
POSITIVE LOGITS
..
3.38
..
2.55
......
2.31
....
2.23
.......
2.22
.........
2.18
.....
2.17
........
2.01
â̦..
1.97
...
1.90
Activations Density 0.008%