INDEX
    Explanations

    phrases indicating a sense of expectation or obligation involving an apology or accountability

    New Auto-Interp
    Negative Logits
    ÙĤب
    -0.15
    á»ijng
    -0.15
     Injection
    -0.15
    ÑĮв
    -0.15
    è¶³
    -0.15
    ucher
    -0.15
    Injection
    -0.14
    ery
    -0.14
    argest
    -0.14
    olics
    -0.13
    POSITIVE LOGITS
     Hast
    0.16
     motor
    0.15
    alg
    0.15
    æĭ¬
    0.15
    (Component
    0.15
    ognito
    0.14
    zer
    0.14
    ipse
    0.14
    orate
    0.14
     aster
    0.14
    Act Density 0.014%

    No Known Activations