INDEX
    Explanations

    carefully executed actions or plans

    New Auto-Interp
    Negative Logits
    anon
    -0.80
    olor
    -0.74
    Stars
    -0.70
     Cosponsors
    -0.68
    bucks
    -0.68
    XP
    -0.64
    hey
    -0.64
    asta
    -0.63
    Native
    -0.63
     Aid
    -0.63
    POSITIVE LOGITS
     crafted
    1.25
     calibrated
    1.24
     scrutin
    1.09
     calibr
    1.09
     chore
    1.05
     vetted
    1.04
     curated
    1.01
     deliber
    0.99
     cultivated
    0.96
     tailored
    0.94
    Act Density 0.047%

    No Known Activations