INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     PRODUCTS
    -0.07
     favor
    -0.07
    іння
    -0.06
    aisy
    -0.06
     patron
    -0.06
    frames
    -0.06
     companies
    -0.06
    USAGE
    -0.06
    -around
    -0.06
    -0.06
    POSITIVE LOGITS
    	stat
    0.07
    ังม
    0.07
     prosecuted
    0.07
     genu
    0.07
     masturbation
    0.07
    plugins
    0.06
     cafes
    0.06
     وقت
    0.06
     configparser
    0.06
     Took
    0.06
    Act Density 0.002%

    No Known Activations