INDEX
    Explanations

    references to caregiving and familial responsibilities

    New Auto-Interp
    Negative Logits
    opoulos
    -0.15
    _escape
    -0.14
    chalk
    -0.14
     vyj
    -0.13
    ABCDEFGHI
    -0.13
    éĻĦ
    -0.13
    Ø·ÙĦ
    -0.13
    798
    -0.13
    à¤¾à¤Ł
    -0.13
    æ³ķ人
    -0.13
    POSITIVE LOGITS
     care
    0.67
     caring
    0.58
     cared
    0.54
     cares
    0.50
     Care
    0.50
    care
    0.49
    -care
    0.48
     cuid
    0.47
    Care
    0.47
     caret
    0.40
    Act Density 0.298%

    No Known Activations