INDEX
    Explanations

    phrases related to working hard or putting effort into tasks

    references to hard work or effort

    New Auto-Interp
    Negative Logits
     Tot
    -0.67
     Nest
    -0.66
    DATA
    -0.65
     Quarter
    -0.63
     Kut
    -0.63
     Sv
    -0.63
     Salvation
    -0.62
     Sut
    -0.61
     Expend
    -0.60
     Quin
    -0.60
    POSITIVE LOGITS
    esley
    0.80
    itud
    0.79
     enough
    0.79
    ened
    0.79
    balls
    0.77
     hard
    0.73
    entimes
    0.73
    ball
    0.72
    wired
    0.72
     harder
    0.71
    Act Density 0.027%

    No Known Activations