INDEX
    Explanations

    phrases introducing a statement or emphasizing an upcoming point

    expressions indicating clarification or emphasis

    New Auto-Interp
    Negative Logits
    olving
    -0.69
    sbm
    -0.68
    hler
    -0.61
    cemic
    -0.60
    Loading
    -0.55
    nih
    -0.55
     resembled
    -0.54
     manuals
    -0.54
     Unsure
    -0.54
    eller
    -0.54
    POSITIVE LOGITS
     unequivocally
    0.99
     emphatically
    0.94
     tonight
    0.92
     congratulations
    0.91
     briefly
    0.90
     thank
    0.90
    thank
    0.89
     myself
    0.88
     upfront
    0.85
     something
    0.85
    Act Density 0.149%

    No Known Activations