INDEX
    Explanations

    sentences where the speaker expresses uncertainty or seeks information

    expressions of uncertainty and mixed emotions

    New Auto-Interp
    Negative Logits
    surprisingly
    -0.67
     unsurprisingly
    -0.62
     predictably
    -0.59
    iannopoulos
    -0.55
    ãĥĥãĥĪ
    -0.54
     Cosponsors
    -0.53
    ortium
    -0.53
     ®
    -0.53
     similarly
    -0.52
    æ©Ł
    -0.52
    POSITIVE LOGITS
    ..."
    1.15
     â̦"
    1.11
    )."
    1.09
     ..."
    1.08
    â̦"
    1.07
    .")
    1.07
    .'"
    0.99
     fuckin
    0.98
    â̦."
    0.97
    !'"
    0.93
    Act Density 1.484%

    No Known Activations