INDEX
    Explanations

    punctuation followed by Spanish phrases

    New Auto-Interp
    Negative Logits
     egregious
    1.02
     proactive
    0.99
     limited
    0.96
     overarching
    0.96
     pesky
    0.95
     needing
    0.95
     tailored
    0.93
     decent
    0.92
     ethical
    0.92
     intelligent
    0.92
    POSITIVE LOGITS
    Pentru
    1.10
    În
    1.01
    mostrar
    1.01
    Untuk
    1.01
    если
    1.01
    Esta
    0.99
     aparición
    0.98
    desde
    0.97
    Dengan
    0.95
     μπορεί
    0.95
    Act Density 0.660%

    No Known Activations