INDEX
    Explanations

    the term "list"

    instances of the word "list" and its variations

    New Auto-Interp
    Negative Logits
     Aber
    -0.62
     life
    -0.62
     lat
    -0.62
     Gore
    -0.60
     Metallic
    -0.59
     parchment
    -0.59
     Wildcats
    -0.59
     DH
    -0.58
     farm
    -0.57
     newfound
    -0.57
    POSITIVE LOGITS
    list
    1.17
    ening
    0.93
    witz
    0.93
    eners
    0.91
    lists
    0.88
    ener
    0.87
    ing
    0.86
    LIST
    0.86
    abet
    0.85
    erv
    0.85
    Act Density 0.006%

    No Known Activations