INDEX
    Explanations

    terms related to commitment or adherence to a particular course of action

    the repeated use of the word "sticking."

    New Auto-Interp
    Negative Logits
    unes
    -0.81
    RT
    -0.73
    une
    -0.71
    BER
    -0.67
    uned
    -0.64
    uf
    -0.64
    OTO
    -0.63
    ept
    -0.63
    ogram
    -0.62
    eded
    -0.62
    POSITIVE LOGITS
     sticking
    1.13
     plaster
    0.99
     stick
    0.91
     caut
    0.84
     suspic
    0.83
     sticks
    0.82
     slic
    0.81
     proble
    0.78
     burner
    0.77
    pole
    0.76
    Act Density 0.007%

    No Known Activations