INDEX
    Explanations

    questions that begin with the word "Why."

    New Auto-Interp
    Negative Logits
     presented
    -0.58
    Hauptartikel
    -0.56
     tend
    -0.56
    erl
    -0.55
     ")[
    -0.54
    XXXXXXXX
    -0.50
     Barker
    -0.50
     relative
    -0.50
    Depend
    -0.50
     depend
    -0.49
    POSITIVE LOGITS
     why
    3.50
    why
    3.30
    Why
    3.10
     Why
    3.07
    WHY
    2.96
     WHY
    2.94
     pourquoi
    2.62
     Waarom
    2.56
     Warum
    2.48
     waarom
    2.40
    Act Density 0.059%

    No Known Activations