INDEX
    Explanations

    items listed or mentioned sequentially

    items or concepts related to lists, categories, and organizational structures

    New Auto-Interp
    Negative Logits
     arresting
    -0.63
    ãĥ©ãĥ³
    -0.62
    akin
    -0.61
     inflicting
    -0.58
    ÃŃn
    -0.57
     imposing
    -0.57
    iaz
    -0.57
    angering
    -0.57
     foreign
    -0.56
     fearing
    -0.54
    POSITIVE LOGITS
     varies
    1.18
     revolves
    1.14
     consists
    1.05
     boils
    1.03
     depends
    1.01
     tends
    0.98
     differs
    0.96
     reminds
    0.96
     sucks
    0.94
     :)
    0.93
    Act Density 0.460%

    No Known Activations