INDEX
    Explanations

    punctuation, specifically parentheses and certain symbols

    New Auto-Interp
    Negative Logits
    ValueStyle
    -1.07
     بيها
    -1.06
     ویکی‌پدیا
    -1.00
    SourceChecksum
    -1.00
    findpost
    -0.96
    )\}$
    -0.92
     poffe
    -0.87
    animity
    -0.86
    impianto
    -0.85
    RenderAtEndOf
    -0.85
    POSITIVE LOGITS
     (
    0.80
    )(
    0.76
    外部連結
    0.73
     McGovern
    0.70
     Schröder
    0.70
    》(
    0.69
     Gruber
    0.68
    0.67
     elems
    0.66
    0.66
    Act Density 0.042%

    No Known Activations