INDEX
    Explanations

    phrases related to effort and responsibility in completing tasks

    New Auto-Interp
    Negative Logits
    achi
    -0.20
    asher
    -0.16
    ACHI
    -0.15
    packageName
    -0.15
     Rao
    -0.15
    hiro
    -0.15
    ubbo
    -0.14
    алÑĸв
    -0.14
    olf
    -0.14
    .isSuccessful
    -0.14
    POSITIVE LOGITS
     heavy
    0.36
    heavy
    0.34
     Heavy
    0.31
     dirty
    0.30
    dirty
    0.30
     leg
    0.30
     work
    0.28
     grunt
    0.28
    Heavy
    0.27
    grunt
    0.27
    Act Density 0.075%

    No Known Activations