INDEX
    Explanations

    punctuation marks and sentence delimiters

    New Auto-Interp
    Negative Logits
    Skocz
    -0.68
    AddHtmlAttribute
    -0.63
    存于互联网档案馆
    -0.63
    Див
    -0.62
    ldk
    -0.61
    ResponseWriter
    -0.59
    hyrchwyd
    -0.58
    μως
    -0.58
    centralwidget
    -0.57
    Immobili
    -0.57
    POSITIVE LOGITS
    criptive
    0.55
    LabelTagHelper
    0.53
     betweenstory
    0.52
    oa̍t
    0.52
    ArrowToggle
    0.52
    omotor
    0.50
     nonUne
    0.49
    0.49
    كويكب
    0.48
    ilever
    0.47
    Act Density 0.536%

    No Known Activations