INDEX
    Explanations

    words related to compassion and empathy

    New Auto-Interp
    Negative Logits
    еÑı
    -0.16
     spur
    -0.15
    773
    -0.14
    pais
    -0.14
    ha
    -0.14
     curb
    -0.14
    tte
    -0.14
     Von
    -0.14
     visual
    -0.14
    irsch
    -0.14
    POSITIVE LOGITS
    AGMENT
    0.15
    èªł
    0.14
    ữ
    0.14
     Passage
    0.14
    åIJ¾
    0.14
    -License
    0.14
    ixa
    0.14
     Ø·ÙĦا
    0.14
    _REUSE
    0.14
    ipple
    0.13
    Act Density 0.005%

    No Known Activations