INDEX
    Explanations

    words related to progress or coming together

    New Auto-Interp
    Negative Logits
    ttes
    -0.72
     takeaway
    -0.67
     aver
    -0.67
     concession
    -0.66
     kinderg
    -0.64
     stakes
    -0.64
     eru
    -0.62
     wart
    -0.61
    haul
    -0.61
     Achievement
    -0.61
    POSITIVE LOGITS
    endment
    1.32
    sterdam
    1.27
    ethyst
    1.24
    nesty
    1.22
    bitious
    1.14
    essage
    1.11
    ajor
    1.07
    ateur
    1.07
    ilies
    1.05
    ulet
    1.02
    Act Density 0.680%

    No Known Activations