INDEX
    Explanations

    instances of negation language

    New Auto-Interp
    Negative Logits
    ing
    -0.73
     be
    -0.71
     able
    -0.66
    🏻
    -0.65
    NotBe
    -0.64
    Swift
    -0.63
     pince
    -0.62
    ientôt
    -0.61
    fromCharCode
    -0.61
    věř
    -0.61
    POSITIVE LOGITS
    Données
    0.87
     Moulton
    0.84
     SafeMath
    0.79
    Addo
    0.78
    KommentareTeilen
    0.76
     Völ
    0.76
    orses
    0.75
    Portail
    0.73
    dhury
    0.73
    UpInside
    0.73
    Act Density 0.121%

    No Known Activations