INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     IBase
    -0.07
    IBA
    -0.07
     વખતે
    -0.07
     Noch
    -0.07
     BAS
    -0.07
     yup
    -0.07
     mezelf
    -0.07
     mandatory
    -0.07
     Mayo
    -0.07
     jie
    -0.07
    POSITIVE LOGITS
    Winner
    0.08
     triv
    0.08
     minima
    0.08
    -winning
    0.08
     ced
    0.08
     csal
    0.07
     tempered
    0.07
     Michael
    0.07
    restrial
    0.07
     winner
    0.07
    Act Density 0.002%

    No Known Activations