INDEX
    Explanations

    concepts related to self-awareness and personal growth

    expressions of self-awareness and self-identity.

    New Auto-Interp
    Negative Logits
    }));
    
    -0.78
    ()]);
    -0.76
    ]));
    
    -0.74
    })]
    -0.72
     })}
    -0.70
    ']);
    
    -0.70
    ")));
    
    -0.70
    $​
    -0.69
    '));
    
    -0.69
    ))]
    -0.68
    POSITIVE LOGITS
     self
    1.26
    self
    1.18
     Self
    1.18
    Self
    1.17
     SELF
    1.07
    SELF
    1.06
     selves
    0.91
     själv
    0.73
     Selbst
    0.73
    selbst
    0.73
    Act Density 0.217%

    No Known Activations