INDEX
Explanations
instances involving a comparison of actions or behaviors against certain standards
phrases indicating uncertainty or ambivalence
New Auto-Interp
Negative Logits
_.
-0.66
à¼
-0.61
%.
-0.58
.$
-0.57
$.
-0.55
âĢķ
-0.53
aca
-0.52
,.
-0.52
!,
-0.52
().
-0.51
POSITIVE LOGITS
favourable
0.55
ested
0.55
adjusting
0.54
healed
0.54
recovering
0.51
parity
0.51
uned
0.51
recover
0.51
acquainted
0.50
accountable
0.49
Activations Density 1.164%