INDEX
    Explanations

    phrases related to caring, concern, and interest

    the concept of care and concern in various contexts

    New Auto-Interp
    Negative Logits
    ross
    -0.78
    hiba
    -0.77
    BuyableInstoreAndOnline
    -0.72
    Lay
    -0.72
    aurus
    -0.71
    MX
    -0.70
    cession
    -0.69
    Paper
    -0.68
    zynski
    -0.67
    SPONSORED
    -0.66
    POSITIVE LOGITS
     preserving
    0.99
     improving
    0.88
     aesthetics
    0.85
     fairness
    0.83
     maximizing
    0.81
     respecting
    0.80
     integrity
    0.78
    ĺħ
    0.78
     protecting
    0.77
     politics
    0.77
    Act Density 0.053%

    No Known Activations