INDEX
    Explanations

    negations and expressions of doubt or uncertainty

    New Auto-Interp
    Negative Logits
    ç«
    -0.15
     Shapiro
    -0.15
    ruz
    -0.14
    ','=',$
    -0.14
     Buchanan
    -0.14
     Initialise
    -0.14
    512
    -0.13
    кÑĥлÑĮ
    -0.13
    ept
    -0.13
    laz
    -0.13
    POSITIVE LOGITS
    cope
    0.18
     mate
    0.17
     altogether
    0.17
    Ïĥί
    0.16
    oen
    0.16
     cop
    0.16
     ar
    0.16
    aire
    0.16
     strictly
    0.16
    antes
    0.15
    Act Density 0.154%

    No Known Activations