INDEX
    Explanations

    expressions related to self-awareness and introspection

    New Auto-Interp
    Negative Logits
     META
    -0.06
    _EVAL
    -0.06
    ENSE
    -0.06
     ********************************************************
    -0.06
    rades
    -0.06
    iais
    -0.06
    atta
    -0.06
     destinationViewController
    -0.06
    Utc
    -0.06
    insky
    -0.06
    POSITIVE LOGITS
     already
    0.07
    ucks
    0.07
    ivan
    0.06
    umas
    0.06
    orno
    0.06
    fec
    0.06
    eyond
    0.06
     ÙĦÙħ
    0.06
    ož
    0.06
    rottle
    0.06
    Act Density 0.021%

    No Known Activations