INDEX
    Explanations

    phrases related to doing one's best or making an effort

    phrases expressing capability or potential actions

    New Auto-Interp
    Negative Logits
     Lis
    -0.65
     Politics
    -0.63
     Cance
    -0.63
     Mour
    -0.63
     Passage
    -0.63
     Falk
    -0.63
     Ez
    -0.62
     Prin
    -0.62
     Rising
    -0.61
     Falling
    -0.61
    POSITIVE LOGITS
    berra
    1.05
    't
    1.04
     muster
    1.04
     feas
    0.93
    adian
    0.86
     reasonably
    0.84
    nesota
    0.82
     afford
    0.81
    iary
    0.79
    NOT
    0.79
    Act Density 0.094%

    No Known Activations