INDEX
    Explanations

    thoughts, feelings, and emotions related to personal experiences and interactions

    expressions of personal feelings and experiences related to challenges and resilience

    New Auto-Interp
    Negative Logits
     Regions
    -0.68
    ussen
    -0.67
    Indeed
    -0.67
    isites
    -0.66
     respectively
    -0.66
    ¥ŀ
    -0.65
     Cosponsors
    -0.62
     Advis
    -0.62
    catentry
    -0.61
     unsurprisingly
    -0.61
    POSITIVE LOGITS
     myself
    1.69
     anymore
    1.23
     my
    1.13
     somebody
    0.95
     someday
    0.94
     fuckin
    0.90
     anybody
    0.89
     forever
    0.84
    puter
    0.83
     naked
    0.82
    Act Density 0.428%

    No Known Activations