INDEX
    Explanations
    New Auto-Interp
    Negative Logits
    ượu
    -0.06
     perror
    -0.06
    	perror
    -0.06
    bulan
    -0.06
    $fields
    -0.06
    ۱۳
    -0.06
     occured
    -0.06
     рядом
    -0.06
    ();++
    -0.06
    				    
    -0.06
    POSITIVE LOGITS
     Breitbart
    0.07
     Razor
    0.06
     miserable
    0.06
    ait
    0.06
    ewitness
    0.06
     printing
    0.06
    Recipes
    0.06
     surrounds
    0.06
    られて
    0.06
    Yahoo
    0.06
    Act Density 0.003%

    No Known Activations