INDEX
    Explanations

    words related to transformation or evolution

    variations of the word "become."

    New Auto-Interp
    Negative Logits
    mble
    -0.71
     rug
    -0.71
    tsky
    -0.66
     uphill
    -0.64
    DER
    -0.63
     TN
    -0.58
     antic
    -0.58
     belt
    -0.58
     Mankind
    -0.58
     Gong
    -0.58
    POSITIVE LOGITS
    oming
    1.09
    leans
    1.07
    bec
    1.03
    uity
    0.99
    zek
    0.95
    isons
    0.93
    racuse
    0.90
    imil
    0.87
    clair
    0.87
    uous
    0.85
    Act Density 0.004%

    No Known Activations