INDEX
    Explanations

    the word "why" and its variations, indicating a focus on questions or explanations

    New Auto-Interp
    Negative Logits
     Roller
    -0.78
    ymph
    -0.72
    trop
    -0.66
    rop
    -0.66
    amps
    -0.66
    robe
    -0.63
    hern
    -0.62
    lator
    -0.62
     puck
    -0.61
    Zone
    -0.61
    POSITIVE LOGITS
     why
    1.04
    soever
    1.03
     WHY
    0.96
    why
    0.95
    iterranean
    0.80
     exactly
    0.79
    Why
    0.79
    ihad
    0.75
    icago
    0.72
    ricanes
    0.69
    Act Density 0.034%

    No Known Activations