INDEX
Explanations
adjectives indicating intensity or severity
New Auto-Interp
Negative Logits
ften
-0.87
ologue
-0.86
undrum
-0.85
ittee
-0.83
hander
-0.82
washer
-0.80
hyde
-0.78
orah
-0.77
agonist
-0.76
ressor
-0.75
POSITIVE LOGITS
amounts
1.45
quantities
1.36
doses
1.23
versions
1.20
levels
1.19
situations
1.16
relationships
1.12
intervals
1.11
considerations
1.11
proportions
1.10
Activations Density 1.849%