INDEX
    Explanations

    expressions related to contradictions and critiques of societal norms

    negative statements or conclusions

    New Auto-Interp
    Negative Logits
    MigrationBuilder
    -0.50
     stället
    -0.43
    WithIOException
    -0.43
     juridiques
    -0.42
     griega
    -0.42
    BufferException
    -0.42
     inalámbrica
    -0.41
     žem
    -0.41
     peixe
    -0.41
     heißen
    -0.41
    POSITIVE LOGITS
    0.50
    UrlResolution
    0.48
     unfair
    0.44
    DECREF
    0.43
    Literatuur
    0.41
    ietal
    0.41
     analogy
    0.41
     welfare
    0.40
    vele
    0.40
    dians
    0.40
    Act Density 0.220%

    No Known Activations