INDEX
    Explanations

    instances of self-reflection or introspection

    New Auto-Interp
    Negative Logits
    eyer
    -0.15
    zel
    -0.15
    ÑĮ
    -0.14
    imet
    -0.14
     док
    -0.13
    ableView
    -0.13
    RL
    -0.13
    indi
    -0.13
    kart
    -0.13
    iment
    -0.12
    POSITIVE LOGITS
    adesh
    0.15
     Pis
    0.15
     Homo
    0.14
    anter
    0.14
     زÛĮ
    0.14
    omic
    0.14
    DMIN
    0.14
     Cosmic
    0.13
     Fuller
    0.13
    msgid
    0.13
    Act Density 0.000%

    No Known Activations