INDEX
    Explanations

    conversations involving empathy and understanding of feelings

    New Auto-Interp
    Negative Logits
    bono
    -0.60
    مصادر
    -0.57
    copg
    -0.56
     control
    -0.55
    FontStyle
    -0.54
    rrggbb
    -0.53
    Diwedd
    -0.53
    ashier
    -0.52
    Literals
    -0.52
    nissen
    -0.52
    POSITIVE LOGITS
    ArgsConstructor
    0.64
     compréhen
    0.60
     understandable
    0.57
    难怪
    0.57
     légitime
    0.56
     understandably
    0.55
     normaux
    0.54
    Kjelder
    0.53
     hjemme
    0.52
     why
    0.50
    Act Density 0.290%

    No Known Activations