INDEX
    Explanations

    repeated punctuation marks or periods

    New Auto-Interp
    Negative Logits
    UnusedPrivate
    -1.03
    TagMode
    -1.01
     חיצוניים
    -0.97
     Efq
    -0.94
    awtextra
    -0.94
    LayoutStyle
    -0.90
     snippetHide
    -0.88
     CreateTagHelper
    -0.87
    Tikang
    -0.84
    WriteBarrier
    -0.83
    POSITIVE LOGITS
    apache
    0.56
    .
    0.54
    import
    0.53
     /\.
    0.53
     trọng
    0.49
     Wh
    0.48
     the
    0.47
    gge
    0.46
    oso
    0.46
     Donald
    0.46
    Act Density 0.034%

    No Known Activations