INDEX
    Explanations

    questions related to reasoning and seeking explanations

    "Does" followed by a question

    forms questions with "does"

    New Auto-Interp
    Negative Logits
     Мексичка
    -0.80
     المعيارى
    -0.77
     незавершена
    -0.77
     betweenstory
    -0.76
    DoubleQuotes
    -0.76
    bewerken
    -0.73
     Himo
    -0.73
    ThroughAttribute
    -0.72
    MLLoader
    -0.71
    SharedCtor
    -0.71
    POSITIVE LOGITS
     the
    0.87
     it
    0.73
     you
    0.71
     this
    0.62
     a
    0.61
     our
    0.60
    essing
    0.58
     anyone
    0.57
    udo
    0.55
     Tal
    0.53
    Act Density 0.117%

    No Known Activations