INDEX
    Explanations

    references to the self or personal interactions

    the pronoun "me" in various contexts, indicating a focus on personal identity and self-perception

    New Auto-Interp
    Negative Logits
    ulton
    -0.69
    atlantic
    -0.68
    emic
    -0.68
    naissance
    -0.65
    etheus
    -0.64
    iens
    -0.63
    ories
    -0.63
    Atlantic
    -0.63
    -)
    -0.63
    icion
    -0.63
    POSITIVE LOGITS
    adows
    0.86
     personally
    0.83
    imei
    0.81
    selves
    0.76
    adow
    0.75
     verbally
    0.73
    atic
    0.73
    self
    0.72
     uncond
    0.72
    zzo
    0.72
    Act Density 0.183%

    No Known Activations