INDEX
    Explanations

    words related to attempts or efforts

    New Auto-Interp
    Negative Logits
    ahu
    -0.58
    amaz
    -0.55
    inth
    -0.55
    cised
    -0.54
    hea
    -0.54
    requisite
    -0.53
    scar
    -0.52
    icka
    -0.51
    marks
    -0.51
    dylib
    -0.50
    POSITIVE LOGITS
     desperately
    1.10
     unsuccessfully
    1.02
     to
    0.96
     harder
    0.93
     hard
    0.87
     valiant
    0.87
     vain
    0.78
     frantically
    0.76
     toget
    0.74
    hard
    0.73
    Act Density 0.054%

    No Known Activations