INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    _special
    -0.07
    -0.06
    _shapes
    -0.06
    -0.06
    305
    -0.06
    burgh
    -0.06
    ните
    -0.06
    help
    -0.06
     tema
    -0.05
    _fragment
    -0.05
    POSITIVE LOGITS
    _experience
    0.07
    0.07
    0.07
     TL
    0.07
     coraz
    0.07
     ikt
    0.06
     resident
    0.06
    اوت
    0.06
     centrif
    0.06
     kancel
    0.06
    Act Density 0.035%

    No Known Activations