INDEX
    Explanations

    discussions around theories and beliefs, especially when evaluating their validity or acceptance

    Sentences ending with punctuation

    contrasting or incorrect statements

    New Auto-Interp
    Negative Logits
    XmlAccessType
    -0.76
    DockStyle
    -0.64
    MemoryWarning
    -0.60
    Mediabestanden
    -0.59
    WriteBarrier
    -0.56
    ApiOperation
    -0.56
     PyObject
    -0.54
    resizingMask
    -0.53
    Hauptartikel
    -0.52
    addPreferredGap
    -0.51
    POSITIVE LOGITS
     Pourtant
    0.77
     Résultat
    0.60
     WRONG
    0.60
     जबकि
    0.59
    0.57
     Wrong
    0.57
     forgetting
    0.57
     wrong
    0.56
     falschen
    0.56
     оригіналу
    0.52
    Act Density 0.345%

    No Known Activations