INDEX
    Explanations

    questions starting with "Why"

    instances of the word "Why" that express questioning or curiosity

    New Auto-Interp
    Negative Logits
     Roller
    -0.71
    ages
    -0.65
    rop
    -0.63
    lator
    -0.62
     puck
    -0.62
    ymph
    -0.62
     polymorph
    -0.62
     tuber
    -0.62
     medic
    -0.62
    pocket
    -0.61
    POSITIVE LOGITS
    soever
    1.12
     why
    0.94
    why
    0.91
     WHY
    0.90
    Why
    0.86
    iterranean
    0.75
     Why
    0.75
    tical
    0.71
    beit
    0.71
    ertodd
    0.71
    Act Density 0.037%

    No Known Activations