INDEX
    Explanations

    words related to caring or concern

    expressions of concern or indifference

    New Auto-Interp
    Negative Logits
    ãĥĥãĥī
    -0.73
    oute
    -0.73
    è¦ļéĨĴ
    -0.68
    ãĥ³ãĤ¸
    -0.67
    GV
    -0.65
    UES
    -0.65
    jam
    -0.65
     sclerosis
    -0.62
     resume
    -0.61
    adr
    -0.61
    POSITIVE LOGITS
    lessly
    1.15
    taker
    1.15
     passionately
    1.03
     cared
    0.99
    fully
    0.90
    giving
    0.86
    bear
    0.81
    tta
    0.80
    lessness
    0.79
    der
    0.79
    Act Density 0.015%

    No Known Activations