INDEX
    Explanations

    phrases indicating an action or impact

    phrases indicating actions or states of being

    New Auto-Interp
    Negative Logits
     Columb
    -0.64
    rones
    -0.63
    eteenth
    -0.63
    igor
    -0.62
    comings
    -0.61
    Cub
    -0.60
    ston
    -0.59
    estern
    -0.58
    CLUD
    -0.58
    locked
    -0.57
    POSITIVE LOGITS
     create
    1.22
     educate
    1.16
     revise
    1.15
     remove
    1.14
     shorten
    1.12
     reduce
    1.12
     introduce
    1.11
     elevate
    1.10
     minimize
    1.10
     simplify
    1.10
    Act Density 0.163%

    No Known Activations