INDEX
Explanations
references to positions or rankings on lists or charts
references to rankings or lists
New Auto-Interp
Negative Logits
nces
-0.87
nce
-0.68
noises
-0.67
uesday
-0.65
maid
-0.62
gery
-0.61
aeus
-0.59
matter
-0.59
girl
-0.59
inous
-0.58
POSITIVE LOGITS
list
1.34
lists
1.17
hierarchy
1.05
charts
1.04
checklist
1.03
radar
1.03
Lists
1.01
LIST
0.99
blacklist
0.99
rankings
0.98
Activations Density 0.477%