INDEX
    Explanations

    verbs related to making modifications or adjustments

    instances of the word "change."

    New Auto-Interp
    Negative Logits
    ç«
    -0.84
    stra
    -0.72
    mination
    -0.70
    alty
    -0.67
    amina
    -0.66
    SourceFile
    -0.64
     AFB
    -0.63
     DRAGON
    -0.63
    Zip
    -0.62
    sie
    -0.62
    POSITIVE LOGITS
     hands
    0.75
     tack
    0.72
     perceptions
    0.71
    imedia
    0.70
     drastically
    0.69
     radically
    0.69
     alliances
    0.69
     dramatically
    0.69
    esty
    0.68
     lighting
    0.67
    Act Density 0.044%

    No Known Activations