INDEX
Explanations
announcements or statements of commitment
phrases that indicate official announcements or declarations
New Auto-Interp
Negative Logits
ecd
-0.82
Explan
-0.74
perception
-0.72
understandable
-0.69
contrasts
-0.68
inexper
-0.67
adject
-0.66
brightness
-0.64
eloqu
-0.64
sophistication
-0.64
POSITIVE LOGITS
indeed
0.84
officially
0.78
partnered
0.70
iatus
0.70
embark
0.70
soon
0.69
finally
0.69
postpone
0.68
retiring
0.67
hiatus
0.65
Activations Density 0.441%