INDEX
Explanations
statements about efforts, actions, or initiatives undertaken to address certain situations or achieve specific goals
New Auto-Interp
Negative Logits
Proud
-0.79
atoon
-0.76
prints
-0.74
ceived
-0.71
Recorded
-0.71
listed
-0.71
hot
-0.70
Hate
-0.69
famous
-0.68
laughter
-0.68
POSITIVE LOGITS
curb
1.41
avert
1.40
bolster
1.34
stabilize
1.33
stimulate
1.30
alleviate
1.29
mitigate
1.29
counteract
1.26
reduce
1.26
tackle
1.24
Activations Density 0.273%