INDEX
    Explanations

    phrases expressing disbelief or strong negative emotions

    expressions of disbelief or incredulity

    New Auto-Interp
    Negative Logits
    aukee
    -0.82
     exting
    -0.76
    iHUD
    -0.74
    incial
    -0.72
    orage
    -0.70
    abase
    -0.67
    tails
    -0.66
    kamp
    -0.66
    ãĥīãĥ©
    -0.66
    ´
    -0.65
    POSITIVE LOGITS
     someone
    1.17
     somebody
    1.06
     anyone
    0.96
     nobody
    0.90
    someone
    0.85
     anybody
    0.83
     they
    0.81
     we
    0.79
     people
    0.77
     somehow
    0.76
    Act Density 0.115%

    No Known Activations