INDEX
    Explanations

    negative phrases or expressions indicating doubt or lack of certainty

    New Auto-Interp
    Negative Logits
    reen
    -0.15
     ìķĦëĭĪëĿ¼
    -0.14
     horribly
    -0.14
    å¹¶ä¸į
    -0.14
    avin
    -0.14
    olini
    -0.13
    ouv
    -0.13
     nonzero
    -0.13
    inia
    -0.13
     somehow
    -0.13
    POSITIVE LOGITS
     any
    0.25
     anymore
    0.24
     spared
    0.19
     much
    0.19
     Any
    0.18
    Any
    0.18
     less
    0.18
     anyhow
    0.18
    any
    0.18
     anybody
    0.17
    Act Density 0.221%

    No Known Activations