INDEX
    Explanations

    phrases related to joining a trend or movement

    phrases emphasizing the word "the."

    New Auto-Interp
    Negative Logits
    irection
    -0.67
    bear
    -0.66
    atically
    -0.66
    afia
    -0.64
    isin
    -0.64
    upon
    -0.63
    cade
    -0.63
     suppose
    -0.63
    thood
    -0.62
    usa
    -0.61
    POSITIVE LOGITS
     heels
    1.14
     bandwagon
    1.10
     shoulders
    1.02
     treadmill
    0.97
     doorstep
    0.95
     ledge
    0.93
     porch
    0.88
     pedest
    0.88
     sidelines
    0.84
     balcony
    0.83
    Act Density 0.144%

    No Known Activations