INDEX
    Explanations

    mentions of working on various projects

    instances of the word "worked."

    New Auto-Interp
    Negative Logits
    ylon
    -0.67
    arta
    -0.66
    gran
    -0.64
     Bol
    -0.63
    idium
    -0.63
    ustomed
    -0.63
     Mae
    -0.62
    thur
    -0.62
     Tian
    -0.62
     venerable
    -0.61
    POSITIVE LOGITS
    bench
    0.94
    hops
    0.93
     worked
    0.88
    hirt
    0.88
     ethic
    0.86
     collabor
    0.85
    heet
    0.79
     overtime
    0.76
     arrang
    0.75
    baugh
    0.74
    Act Density 0.021%

    No Known Activations