INDEX
    Explanations

    programming and software

    New Auto-Interp
    Negative Logits
    ocop
    -0.07
    Show
    -0.07
    -0.07
    _verify
    -0.06
    _buffers
    -0.06
    YPD
    -0.06
    گار
    -0.06
    
    -0.06
    ्ञ
    -0.06
    ีโ
    -0.06
    POSITIVE LOGITS
    !!
    0.07
    olución
    0.07
     Kostenlose
    0.06
    istencia
    0.06
     exclaimed
    0.06
     robert
    0.06
     Kanunu
    0.06
    .FAIL
    0.06
    \",↵
    0.06
    0.06
    Act Density 0.040%

    No Known Activations