INDEX
    Explanations

    mentions of lists or items being on a list

    references related to lists and rankings

    New Auto-Interp
    Negative Logits
     prevailed
    -0.70
    wright
    -0.69
     apprehend
    -0.67
    nce
    -0.63
     situated
    -0.62
    Allows
    -0.62
    idian
    -0.61
    say
    -0.60
     wrest
    -0.59
    icz
    -0.59
    POSITIVE LOGITS
     list
    1.92
     lists
    1.47
     List
    1.46
    list
    1.38
     blacklist
    1.31
     checklist
    1.29
     Lists
    1.29
     LIST
    1.25
    LIST
    1.24
    List
    1.23
    Act Density 0.263%

    No Known Activations