INDEX
    Explanations

    thumbs-related phrases like "thumbs up" and "thumbs down"

    references to gestures of approval or disapproval, specifically "thumbs up" and "thumbs down."

    New Auto-Interp
    Negative Logits
     Invasion
    -0.70
     Tale
    -0.69
    £ı
    -0.68
    enario
    -0.68
    lain
    -0.65
    anny
    -0.65
    arian
    -0.63
     sheltered
    -0.63
     tracing
    -0.63
    olate
    -0.63
    POSITIVE LOGITS
     thumbs
    3.92
    umbs
    1.61
     cheers
    1.36
     majorities
    0.94
     disabilities
    0.92
     enthusiastic
    0.89
     smiles
    0.89
    votes
    0.86
     nods
    0.85
     congr
    0.83
    Act Density 0.034%

    No Known Activations