INDEX
    Explanations

    interrogative sentences or questions

    New Auto-Interp
    Negative Logits
    orgetown
    -0.17
     Cald
    -0.15
    atron
    -0.15
    šak
    -0.14
     Yelp
    -0.14
     Cure
    -0.14
    ÙĨÙĩ
    -0.14
    anova
    -0.14
    exc
    -0.13
    igen
    -0.13
    POSITIVE LOGITS
     answered
    0.17
     answer
    0.16
    asant
    0.16
     Answer
    0.16
     obt
    0.15
    apter
    0.15
    Asked
    0.14
     Lê
    0.14
    rawer
    0.14
    Ŀ
    0.14
    Act Density 0.024%

    No Known Activations