INDEX
Explanations
phrases related to negative emotions, interpersonal conflict, and reactions
expressions of strong emotions or feelings, particularly negative ones
New Auto-Interp
Negative Logits
adoes
-0.61
treaties
-0.60
billions
-0.59
schedule
-0.58
Thumbnail
-0.58
Presidents
-0.55
basics
-0.55
soDeliveryDate
-0.54
monarchy
-0.54
pioneered
-0.54
POSITIVE LOGITS
Another
1.14
Another
1.07
another
1.02
another
0.96
him
0.91
his
0.86
He
0.84
Someone
0.81
he
0.80
Someone
0.76
Activations Density 0.480%