INDEX
Explanations
mentions of announcements, pledges, intentions, projects, resignations, or endorsements
phrases indicating announcements or declarations
New Auto-Interp
Negative Logits
anon
-0.71
interrogated
-0.71
PLIED
-0.70
alone
-0.69
countered
-0.67
rhet
-0.64
screamed
-0.62
argued
-0.62
argues
-0.62
abused
-0.61
POSITIVE LOGITS
impending
1.17
intention
1.15
plans
1.10
intentions
1.10
imminent
1.03
arrival
1.00
retirement
0.98
existence
0.96
formation
0.94
demise
0.94
Activations Density 0.165%