INDEX
Explanations
phrases related to opinions or statements, potentially with emphatic language like "no more" or "hope"
notations indicating the absence of conflict or moderation
New Auto-Interp
Negative Logits
igent
-0.87
Enlarge
-0.85
ashington
-0.83
estine
-0.80
typically
-0.78
obook
-0.78
occup
-0.77
associated
-0.77
Supported
-0.76
edIn
-0.75
POSITIVE LOGITS
surprises
1.16
booze
1.11
explosions
1.10
hugs
1.10
fireworks
1.10
roses
1.08
spoilers
1.08
applause
1.07
goodies
1.05
excuses
1.03
Activations Density 0.563%