INDEX
    Explanations

    phrases portraying a sense of totality or completion

    references to the concept of "all" or totality

    New Auto-Interp
    Negative Logits
    yip
    -0.75
    hov
    -0.68
    raid
    -0.63
     Remastered
    -0.63
    Led
    -0.62
    gypt
    -0.62
    nell
    -0.62
     nowhere
    -0.61
    deck
    -0.61
    lav
    -0.60
    POSITIVE LOGITS
     traces
    1.07
    usions
    1.06
     sorts
    1.02
     semblance
    1.01
    ude
    1.00
    uding
    0.98
     kinds
    0.97
    ocating
    0.93
    udes
    0.92
     else
    0.91
    Act Density 0.074%

    No Known Activations