INDEX
    Explanations

    concepts related to reality checks and self-awareness in various contexts

    New Auto-Interp
    Negative Logits
    ใจ
    -0.45
     ģ
    -0.45
    dington
    -0.43
     referenties
    -0.42
    Jeografia
    -0.42
    pitch
    -0.41
    telli
    -0.41
     Fein
    -0.41
    expandindo
    -0.41
     banderas
    -0.41
    POSITIVE LOGITS
     reality
    1.45
    Reality
    1.30
     Reality
    1.24
    reality
    1.23
     realism
    1.18
     realities
    1.13
     realista
    1.07
     realist
    1.06
     realistic
    1.06
     grounded
    0.99
    Act Density 0.163%

    No Known Activations