INDEX
    Explanations

    words related to changes or differences

    mentions of change, particularly in a variety of contexts

    New Auto-Interp
    Negative Logits
    soever
    -0.75
    EEEE
    -0.66
    rs
    -0.64
    linger
    -0.64
    lynn
    -0.63
    WER
    -0.60
    Nat
    -0.59
    Wire
    -0.59
    OHN
    -0.58
    IDE
    -0.58
    POSITIVE LOGITS
    effic
    1.13
    ordinate
    1.12
     relation
    1.11
    efficiency
    1.10
     regards
    1.06
    humane
    1.03
    between
    1.02
     favor
    1.02
    animate
    0.99
    clusions
    0.99
    Act Density 0.168%

    No Known Activations