INDEX
    Explanations

    phrases indicating adoption, popularity, or acceptance

    phrases indicating activation or engagement with something

    New Auto-Interp
    Negative Logits
    Ĥª
    -0.72
     msec
    -0.61
    Ķ
    -0.60
    BUT
    -0.60
    aurus
    -0.59
    714
    -0.58
    ©¶æ¥µ
    -0.57
    Ĥİ
    -0.54
    cpp
    -0.54
    ¿½
    -0.53
    POSITIVE LOGITS
     behalf
    1.29
    erous
    1.07
    shore
    1.07
    etime
    0.97
    screen
    0.91
    top
    0.86
    eday
    0.83
     board
    0.82
    yx
    0.80
    demand
    0.80
    Act Density 0.083%

    No Known Activations