INDEX
    Explanations

    punctuation and formatting elements in the text

    New Auto-Interp
    Negative Logits
     Utama
    -0.46
    N
    -0.45
     potentially
    -0.45
    0
    -0.44
     Pener
    -0.44
    1
    -0.42
    -0.41
    J
    -0.41
    <em>
    -0.41
    ducir
    -0.40
    POSITIVE LOGITS
    tagHelperRunner
    0.89
    featureID
    0.85
    adpleegd
    0.79
    帖最后由
    0.78
    IsMutable
    0.78
    awtextra
    0.77
    EndInit
    0.75
    GTCX
    0.75
    ConstraintMaker
    0.73
    AndEndTag
    0.73
    Act Density 0.064%

    No Known Activations