INDEX
    Explanations

    action verbs, particularly variations of "do."

    New Auto-Interp
    Negative Logits
     houſe
    -0.89
     Ise
    -0.83
    Portail
    -0.82
     ſtate
    -0.78
     ſche
    -0.75
     perſon
    -0.75
     rodríguez
    -0.74
     scattata
    -0.74
     fernández
    -0.74
     sánchez
    -0.74
    POSITIVE LOGITS
     done
    1.72
    Doing
    1.46
     doing
    1.44
     do
    1.40
    doing
    1.40
     Doing
    1.36
     DOING
    1.34
    done
    1.30
     DONE
    1.29
     doin
    1.28
    Act Density 0.110%

    No Known Activations