INDEX
    Explanations

    contextual expressions of disbelief and surprise

    disbelief or surprise

    New Auto-Interp
    Negative Logits
     estekak
    -0.58
    ſelves
    -0.57
     tartalomajánló
    -0.55
    Personendaten
    -0.52
    audiovisuel
    -0.50
    IVEREF
    -0.50
     InputDecoration
    -0.50
     autorytatywna
    -0.49
    wiſe
    -0.47
     Photocase
    -0.45
    POSITIVE LOGITS
     disbelief
    0.65
     unbelievable
    0.56
     surprising
    0.55
     astonishing
    0.55
     incred
    0.54
     amazement
    0.54
     sorprend
    0.53
     astonishment
    0.52
     shocking
    0.52
     astonished
    0.51
    Act Density 0.331%

    No Known Activations