INDEX
    Explanations

    HTML tags indicating text formatting

    New Auto-Interp
    Negative Logits
    AndEndTag
    -0.89
    AddTagHelper
    -0.79
    parsedMessage
    -0.71
    tagHelperRunner
    -0.66
     برانيه
    -0.66
    twimg
    -0.63
     ब्रेकडाउन
    -0.62
     Theſe
    -0.62
     InputDecoration
    -0.61
     gyhoeddwyd
    -0.61
    POSITIVE LOGITS
    '
    0.53
    0.51
     the
    0.51
    .
    0.50
    (
    0.50
    0.50
    /
    0.48
     (
    0.48
     and
    0.48
    p
    0.47
    Act Density 0.206%

    No Known Activations