INDEX
    Explanations

    questions that begin with "How" or "What"

    New Auto-Interp
    Negative Logits
     Италијани
    -0.62
    これも
    -0.60
    нгред
    -0.60
    wußt
    -0.60
     **/
    
    -0.58
    ſicht
    -0.58
    })*/
    -0.58
    tvguidetime
    -0.57
     Succ
    -0.57
    WithIOException
    -0.56
    POSITIVE LOGITS
    How
    1.41
     How
    1.29
    What
    1.20
     What
    1.11
    Why
    0.93
     Why
    0.85
    Cómo
    0.81
    HOW
    0.79
    Hogyan
    0.77
     HOW
    0.75
    Act Density 0.182%

    No Known Activations