INDEX
    Explanations

    expressions of expectation or surprise related to personal achievements or experiences

    New Auto-Interp
    Negative Logits
    chten
    -0.16
    buie
    -0.15
    elligent
    -0.15
    _pas
    -0.14
    ulary
    -0.14
    .scalablytyped
    -0.14
    çĦ¶
    -0.14
    راÙĨÙĩ
    -0.14
    è¡Ĩ
    -0.14
    cmc
    -0.14
    POSITIVE LOGITS
     Little
    0.24
    æĸĻ
    0.21
    Little
    0.21
     Expect
    0.20
     little
    0.19
     expectations
    0.19
     wil
    0.19
     expected
    0.18
    expect
    0.18
    Expect
    0.17
    Act Density 0.110%

    No Known Activations