INDEX
Explanations
instances where agreements or commitments are made
phrases indicating agreements or commitments
New Auto-Interp
Negative Logits
ults
-0.81
clips
-0.72
grain
-0.71
laughter
-0.68
bars
-0.65
////////////////////////////////
-0.65
echo
-0.62
geared
-0.62
hots
-0.62
icularly
-0.62
POSITIVE LOGITS
waive
1.09
abide
1.03
accept
1.03
cooperate
1.03
obey
1.01
submit
0.97
participate
0.97
settle
0.96
surrender
0.95
comply
0.94
Activations Density 0.851%