INDEX
    Explanations

    questions or phrases expressing curiosity or inquiry

    New Auto-Interp
    Negative Logits
    /**
    -0.64
    /*
    -0.60
    twimg
    -0.60
     endregion
    -0.60
    -0.59
    JspWriter
    -0.58
    цезда
    -0.57
    omiast
    -0.56
    parsedMessage
    -0.54
    //
    -0.53
    POSITIVE LOGITS
     Why
    1.33
    Why
    1.31
     why
    1.05
    why
    0.98
     Warum
    0.94
    WHY
    0.94
     WHY
    0.91
     Pourquoi
    0.90
     Waarom
    0.90
    Pourquoi
    0.85
    Act Density 0.010%

    No Known Activations