INDEX
Explanations
phrases related to conflicts or disagreements
references to likelihoods or probabilities
New Auto-Interp
Negative Logits
activate
-0.63
LC
-0.63
Gas
-0.62
Produ
-0.61
san
-0.61
unit
-0.60
activated
-0.60
orate
-0.59
cle
-0.59
gment
-0.59
POSITIVE LOGITS
odds
4.37
chances
2.16
probabilities
1.71
Odd
1.63
likelihood
1.58
probability
1.49
dds
1.43
bets
1.33
chance
1.13
risks
1.11
Activations Density 0.006%