INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     fundament
    -0.07
     rewards
    -0.07
     Prom
    -0.07
     Mil
    -0.07
    ,['
    -0.07
     milestone
    -0.07
     motivo
    -0.07
     open
    -0.07
     ['
    -0.07
     Catal
    -0.07
    POSITIVE LOGITS
    awah
    0.10
     تدو
    0.09
     deadlines
    0.09
    hamu
    0.09
    jajo
    0.08
     қамтамасыз
    0.08
     scén
    0.08
     पीछे
    0.08
    ennials
    0.08
    rolley
    0.08
    Act Density 0.018%

    No Known Activations