INDEX
    Explanations

    punctuation marks and certain conjunctions or prepositions

    new paragraph or clause markers

    New Auto-Interp
    Negative Logits
     ویکی‌پدی
    -0.47
    parsedMessage
    -0.47
    complexContent
    -0.45
    };*/
    -0.45
     Italijanski
    -0.45
     relâche
    -0.43
    rungsseite
    -0.43
    LikeLiked
    -0.41
    +#+#
    -0.41
    Suivez
    -0.41
    POSITIVE LOGITS
    ViewFeatures
    0.48
    InjectAttribute
    0.46
    migrationBuilder
    0.44
     graag
    0.40
    0.39
    ValueStyle
    0.38
    RTEX
    0.38
    antMatchers
    0.37
     dress
    0.36
     दर
    0.36
    Act Density 0.015%

    No Known Activations