INDEX
    Explanations
    New Auto-Interp
    Negative Logits
     hela
    -0.08
     Compet
    -0.08
     competencia
    -0.08
     Starting
    -0.07
     Lost
    -0.07
    Compet
    -0.07
     querido
    -0.07
     Rus
    -0.07
    Sim
    -0.07
    ину
    -0.07
    POSITIVE LOGITS
     qualifies
    0.09
     philosoph
    0.08
    waż
    0.08
    ற்ப
    0.08
    _candidates
    0.08
    (!(
    0.08
    xp
    0.08
     brave
    0.07
    услов
    0.07
    Trash
    0.07
    Act Density 0.002%

    No Known Activations