INDEX
    Explanations

    concepts related to reality versus perception

    New Auto-Interp
    Negative Logits
    idopsis
    -0.77
    Alban
    -0.72
    rąg
    -0.67
    chiato
    -0.63
    PON
    -0.62
     electrónico
    -0.58
    gebracht
    -0.57
    mallows
    -0.57
     Morde
    -0.56
    bench
    -0.56
    POSITIVE LOGITS
     Reality
    1.38
     reality
    1.35
    Reality
    1.33
     realities
    1.29
    reality
    1.24
    ]';
    1.12
     الواقع
    1.08
     realidade
    0.98
    )";
    
    0.96
     realty
    0.95
    Act Density 0.079%

    No Known Activations