INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     valida
    -0.07
    	ID
    -0.07
     Swords
    -0.06
    Dialog
    -0.06
     August
    -0.06
     Spell
    -0.06
     NES
    -0.06
     Readonly
    -0.06
    _NETWORK
    -0.06
    InSeconds
    -0.06
    POSITIVE LOGITS
    calar
    0.07
     bát
    0.07
     مشکل
    0.07
     DSM
    0.07
     standart
    0.06
    0.06
     pragma
    0.06
     práv
    0.06
     ENV
    0.06
    	hr
    0.06
    Act Density 0.011%

    No Known Activations