INDEX
    Explanations

    interrogative phrases and questions

    "Are" followed by a pronoun

    New Auto-Interp
    Negative Logits
    <bos>
    -0.65
     Jefus
    -0.63
     itſelf
    -0.57
    CodedInputStream
    -0.56
     AppComponent
    -0.54
     Chrift
    -0.53
     Himself
    -0.52
     Anybody
    -0.52
     Réponses
    -0.52
     himſelf
    -0.51
    POSITIVE LOGITS
     we
    1.09
     you
    0.94
     they
    0.84
     AssemblyCulture
    0.75
     wir
    0.67
     мы
    0.62
     wij
    0.61
    you
    0.60
    ']],
    0.60
     فريبيس
    0.59
    Act Density 0.100%

    No Known Activations