INDEX
    Explanations

    negations or expressions of uncertainty

    New Auto-Interp
    Negative Logits
    trand
    -0.15
     Trophy
    -0.14
     Strict
    -0.14
    609
    -0.14
     Rena
    -0.14
    thren
    -0.14
    677
    -0.14
    669
    -0.14
    Ŀ
    -0.14
    го
    -0.14
    POSITIVE LOGITS
    engin
    0.17
    MBER
    0.17
    asher
    0.16
     Quint
    0.16
    dash
    0.15
    igkeit
    0.14
    icina
    0.14
    žel
    0.14
    ahlen
    0.14
    æ©
    0.14
    Act Density 0.184%

    No Known Activations