INDEX
    Explanations

    expressions of excitement or enthusiasm related to performances and events

    New Auto-Interp
    Negative Logits
    **********/
    -0.75
    Demikian
    -0.71
     }?>
    -0.71
     terrific
    -0.70
     ")");
    -0.69
     Ανακτήθηκε
    -0.69
    Herzliche
    -0.67
    прочем
    -0.64
     daß
    -0.64
    AFX
    -0.63
    POSITIVE LOGITS
     like
    1.18
     kind
    1.02
     kinda
    0.96
     yeah
    0.84
     Like
    0.81
     sort
    0.80
    0.80
    Like
    0.78
     sorta
    0.78
    kind
    0.76
    Act Density 0.222%

    No Known Activations