INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     tattoos
    -0.09
     triumph
    -0.09
    Congrats
    -0.08
     casas
    -0.08
    ndrome
    -0.08
     Tattoos
    -0.08
    Congratulations
    -0.08
    Поз
    -0.08
     Congratulations
    -0.08
     विजय
    -0.08
    POSITIVE LOGITS
     farinha
    0.09
     Leopold
    0.08
     Barat
    0.08
     pellet
    0.08
     Vater
    0.08
    fos
    0.08
     خرد
    0.08
     Leica
    0.08
     pellets
    0.08
    0.07
    Act Density 0.012%

    No Known Activations