INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    	top
    -0.07
    ipp
    -0.07
    fight
    -0.07
    azzi
    -0.06
    terror
    -0.06
     abol
    -0.06
    чини
    -0.06
    emploi
    -0.06
     città
    -0.06
    loit
    -0.06
    POSITIVE LOGITS
    ln
    0.09
    .redis
    0.07
     bắc
    0.07
    .createStatement
    0.07
     refining
    0.07
     accountant
    0.07
    (ln
    0.06
    rooms
    0.06
    .minutes
    0.06
     assaulting
    0.06
    Act Density 0.002%

    No Known Activations