INDEX
Explanations
specific references that start with the letter 'W' and are followed by a number
occurrences of a specific character or letter
New Auto-Interp
Negative Logits
gratification
-0.68
succeeding
-0.68
enclosed
-0.67
afore
-0.67
Prelude
-0.66
arial
-0.65
headache
-0.65
bottleneck
-0.65
Malfoy
-0.63
feudal
-0.62
POSITIVE LOGITS
ITNESS
1.30
INGS
1.24
ITCH
1.22
OW
1.19
OOD
1.17
edge
1.16
ipe
1.16
ILD
1.14
ORD
1.13
ALK
1.13
Activations Density 0.044%