INDEX
    Explanations

    interrogative sentences starting with "What" or "I don't know what" that express uncertainty or confusion

    New Auto-Interp
    Negative Logits
    ulic
    -0.78
    inence
    -0.74
    eln
    -0.74
    heter
    -0.74
    erate
    -0.73
    hari
    -0.71
    robe
    -0.71
    gur
    -0.69
    emp
    -0.69
    enberg
    -0.67
    POSITIVE LOGITS
     happens
    1.29
     happened
    1.28
     transpired
    1.15
     kinds
    1.12
     else
    1.11
     constitutes
    1.11
     happ
    1.08
     exactly
    1.03
    soever
    0.99
     kind
    0.97
    Act Density 0.352%

    No Known Activations