INDEX
    Explanations

    sentences that express health issues and physical discomfort

    New Auto-Interp
    Negative Logits
    bigoplus
    -0.54
     inasmuch
    -0.53
     deoarece
    -0.48
    Moreover
    -0.48
    bigsqcup
    -0.46
    odotus
    -0.45
    ต่อไป
    -0.42
     poiché
    -0.41
     notably
    -0.41
    tably
    -0.41
    POSITIVE LOGITS
     gotta
    1.10
     got
    1.08
     Been
    1.06
     gonna
    1.05
     Need
    1.01
     Got
    1.01
    Been
    0.98
    Gonna
    0.98
     need
    0.95
     Took
    0.95
    Act Density 0.234%

    No Known Activations