INDEX
    Explanations

    expressions related to change or transformation

    New Auto-Interp
    Negative Logits
    ety
    -0.71
    tumblr
    -0.68
     anecd
    -0.68
    ital
    -0.68
    via
    -0.66
     PLUS
    -0.65
    ivals
    -0.64
    xual
    -0.63
    iga
    -0.62
    outine
    -0.62
    POSITIVE LOGITS
     swung
    0.91
     swinging
    0.83
     tipping
    0.81
    inning
    0.78
    hatt
    0.76
     peeled
    0.75
     bowed
    0.75
    ayers
    0.73
     tightening
    0.73
    urned
    0.73
    Act Density 0.227%

    No Known Activations