INDEX
    Explanations

    expressions of confusion or discontent regarding expectations and reality

    New Auto-Interp
    Negative Logits
    avra
    -0.15
    otal
    -0.15
    _EXISTS
    -0.14
    á»ĵi
    -0.14
    presso
    -0.14
    kil
    -0.14
     пÑĢидеÑĤÑģÑı
    -0.14
    аж
    -0.13
    tility
    -0.13
    ront
    -0.13
    POSITIVE LOGITS
     supposed
    1.13
     suppose
    0.88
     meant
    0.71
     intended
    0.52
     supposedly
    0.51
     Suppose
    0.47
     alleged
    0.46
     purported
    0.46
     SUP
    0.46
    sup
    0.42
    Act Density 0.362%

    No Known Activations