INDEX
    Explanations

    phrases indicating inclusivity or additional information

    New Auto-Interp
    Negative Logits
    zelf
    -0.17
    reur
    -0.16
    äd
    -0.14
    141
    -0.14
    onse
    -0.14
    tk
    -0.14
     sẵn
    -0.13
    uesday
    -0.13
    iper
    -0.13
    ipl
    -0.13
    POSITIVE LOGITS
     ones
    0.14
    ément
    0.14
     Hint
    0.14
    redicate
    0.14
     Affero
    0.14
    .googlecode
    0.13
    ipur
    0.13
    taÅŁ
    0.13
     Tort
    0.13
    ĤŃ
    0.13
    Act Density 0.015%

    No Known Activations