INDEX
Explanations
phrases related to instructions or procedures
affirmative statements about people's actions or state of being
New Auto-Interp
Negative Logits
igraph
-0.65
icy
-0.64
ESE
-0.62
Discuss
-0.62
ilts
-0.60
airs
-0.60
)]
-0.59
ACP
-0.59
originate
-0.58
uca
-0.57
POSITIVE LOGITS
yourself
0.90
yourselves
0.89
probably
0.85
Tube
0.84
doubtless
0.82
Yourself
0.80
guessed
0.80
probably
0.80
mileage
0.79
forgiven
0.77
Activations Density 0.237%