INDEX
    Explanations

    negations and exceptions in sentences

    New Auto-Interp
    Negative Logits
     المعيارى
    -0.52
    AndEndTag
    -0.48
     pleaſure
    -0.47
    EndInit
    -0.46
    enderror
    -0.46
    TagHelper
    -0.44
    ctest
    -0.44
    #+#
    -0.42
     LoginPage
    -0.41
    出版年
    -0.41
    POSITIVE LOGITS
     ujednoznacz
    0.64
     ilman
    0.60
     non
    0.56
     Non
    0.56
    脚注の使い方
    0.53
     zonder
    0.51
    Non
    0.50
     uden
    0.50
    Ohne
    0.50
     عدم
    0.49
    Act Density 0.621%

    No Known Activations