INDEX
Explanations
mentions of lists or items being on a list
references related to lists and rankings
New Auto-Interp
Negative Logits
prevailed
-0.70
wright
-0.69
apprehend
-0.67
nce
-0.63
situated
-0.62
Allows
-0.62
idian
-0.61
say
-0.60
wrest
-0.59
icz
-0.59
POSITIVE LOGITS
list
1.92
lists
1.47
List
1.46
list
1.38
blacklist
1.31
checklist
1.29
Lists
1.29
LIST
1.25
LIST
1.24
List
1.23
Activations Density 0.263%