INDEX
    Explanations

    expressions of shock or surprise in various contexts

    New Auto-Interp
    Negative Logits
    .scalablytyped
    -0.19
    ettle
    -0.16
    -ci
    -0.16
    ëł¹
    -0.14
    estre
    -0.14
    rib
    -0.14
    ADB
    -0.14
    ราย
    -0.14
    cke
    -0.14
    ongyang
    -0.14
    POSITIVE LOGITS
    ingly
    0.33
    aper
    0.16
    ington
    0.15
    enga
    0.15
    habi
    0.15
    çĦ¶
    0.15
    ively
    0.15
     upon
    0.14
    871
    0.14
     Prize
    0.13
    Act Density 0.108%

    No Known Activations