INDEX
    Explanations

    expressions of agreement or affirmation

    Affirmation or agreement

    New Auto-Interp
    Negative Logits
    MLLoader
    -0.66
    __()
    -0.62
    )*/
    -0.61
     '\\;'
    -0.61
    Datuak
    -0.58
    μως
    -0.58
    }-${
    -0.57
     Alcott
    -0.56
    larak
    -0.56
    клопе
    -0.56
    POSITIVE LOGITS
     yeah
    1.09
    Yeah
    0.99
     Yeah
    0.98
    yeah
    0.91
     sure
    0.89
     YEAH
    0.79
    hhh
    0.77
     Yea
    0.74
    hh
    0.73
    Sure
    0.72
    Act Density 0.045%

    No Known Activations