INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     refunded
    -0.08
     levy
    -0.07
    abilir
    -0.07
     passie
    -0.07
     berkembang
    -0.07
     devolución
    -0.07
     priorities
    -0.07
     drifting
    -0.07
     broaden
    -0.07
     aspectos
    -0.07
    POSITIVE LOGITS
    ца
    0.10
    ccc
    0.08
    corn
    0.08
    সি
    0.08
     infamous
    0.08
    0.08
     notorious
    0.08
    цо
    0.08
    0.08
    aq
    0.08
    Act Density 0.002%

    No Known Activations