INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     acess
    -0.07
    _FILTER
    -0.06
     decals
    -0.06
     :-)
    -0.06
    BYTES
    -0.06
     όμως
    -0.06
     Somerset
    -0.06
     BITTE
    -0.06
     ethos
    -0.06
     =======
    -0.06
    POSITIVE LOGITS
     lists
    0.07
    0.07
     навчання
    0.07
    	DB
    0.06
     hack
    0.06
    xlsx
    0.06
     banks
    0.06
     činnost
    0.06
    checks
    0.06
    (loop
    0.06
    Act Density 0.002%

    No Known Activations