INDEX
Explanations
verbs related to actions or decisions, particularly involving confirmations or expressions
references to political decisions and controversies
New Auto-Interp
Negative Logits
unlucky
-0.73
sucks
-0.71
Veh
-0.66
BET
-0.65
lousy
-0.64
Tes
-0.63
soDeliveryDate
-0.63
haun
-0.63
Experiment
-0.62
Failed
-0.61
POSITIVE LOGITS
publicly
1.22
formally
1.16
specifics
1.09
explicitly
1.09
divul
1.03
commented
1.03
officially
1.00
comment
1.00
osponsors
0.98
disclosed
0.96
Activations Density 0.245%