INDEX
Explanations
comparative phrases indicating increasing levels or degrees of a quality or action
comparative phrases highlighting increasing or decreasing complexity or severity
New Auto-Interp
Negative Logits
hof
-0.72
apesh
-0.69
pb
-0.68
inside
-0.67
bart
-0.66
Haw
-0.66
utch
-0.66
ums
-0.65
âĢİ
-0.65
osures
-0.64
POSITIVE LOGITS
chances
0.75
likely
0.69
likelihood
0.69
chance
0.67
amount
0.64
probability
0.62
Dame
0.62
natureconservancy
0.62
temptation
0.62
surely
0.59
Activations Density 0.093%