INDEX
    Explanations

    mentions of the word "man."

    mentions of the word "man."

    New Auto-Interp
    Negative Logits
    Assembly
    -0.71
    duration
    -0.71
    efficients
    -0.70
    ":["
    -0.70
    Fuel
    -0.70
    veyard
    -0.68
    FW
    -0.68
    Regular
    -0.67
     Cosponsors
    -0.67
    Priv
    -0.67
    POSITIVE LOGITS
    uscript
    1.12
    hunt
    1.05
    nered
    0.97
    hood
    0.94
    agers
    0.90
    fred
    0.87
    ifest
    0.85
    osphere
    0.82
     man
    0.81
    WithNo
    0.80
    Act Density 0.043%

    No Known Activations