INDEX
    Explanations

    phrases that indicate exceptions or deviations from general rules or norms

    "exception" or "exceptions"

    "exception" or "exceptions"

    New Auto-Interp
    Negative Logits
     besk
    -0.40
     kí
    -0.39
     piú
    -0.37
    quetas
    -0.37
     reprend
    -0.37
    LabelTagHelper
    -0.37
    oya
    -0.37
    seamnă
    -0.36
    -0.35
    ActionCreators
    -0.35
    POSITIVE LOGITS
     exceptions
    3.16
     exception
    3.04
     Exceptions
    2.61
     excep
    2.28
    exception
    2.27
     Exception
    2.27
    Exceptions
    2.26
    exceptions
    2.23
     excepción
    2.10
     exemption
    2.09
    Act Density 0.534%

    No Known Activations