INDEX
    Explanations

    expressions of uncertainty or doubt regarding individual actions and agency

    New Auto-Interp
    Negative Logits
     киÑĪ
    -0.17
    ulace
    -0.17
    )application
    -0.15
    å´İ
    -0.15
    Occurred
    -0.15
    rogram
    -0.15
    å¨ľ
    -0.15
    lant
    -0.15
     nicht
    -0.14
    æk
    -0.14
    POSITIVE LOGITS
     else
    0.27
     except
    0.22
    except
    0.21
     ever
    0.21
    	else
    0.18
    else
    0.17
     ELSE
    0.17
     EVER
    0.17
     Except
    0.17
    Except
    0.16
    Act Density 0.031%

    No Known Activations