INDEX
    Explanations

    phrases related to readiness or willingness to undertake particular actions

    New Auto-Interp
    Negative Logits
     inder
    -0.99
     emphat
    -0.94
     zyn
    -0.92
     effe
    -0.88
     pessi
    -0.88
     fundament
    -0.87
     abnorm
    -0.87
     ert
    -0.87
     kram
    -0.86
     aen
    -0.85
    POSITIVE LOGITS
     willing
    1.22
    willing
    1.18
     willingness
    1.05
     Willing
    1.04
    Willing
    0.96
     unwilling
    0.78
     skimage
    0.62
     willingly
    0.62
     reluctant
    0.58
     bereit
    0.55
    Act Density 0.048%

    No Known Activations