INDEX
Explanations
quotations, particularly those emphasizing strong opinions or beliefs
quotation marks and dialogue
New Auto-Interp
Negative Logits
Armenian
-0.80
Closing
-0.72
Ange
-0.71
tabloid
-0.71
ERY
-0.71
daylight
-0.70
consequential
-0.69
Daylight
-0.69
Anthropology
-0.69
constructive
-0.68
POSITIVE LOGITS
could
1.61
would
1.56
should
1.51
had
1.49
didn
1.48
doesn
1.43
did
1.42
saw
1.40
must
1.38
might
1.38
Activations Density 0.136%