INDEX
    Explanations

    special characters used in formatting or coding, particularly those surrounding text

    sequences of special characters and specific formatting patterns

    New Auto-Interp
    Negative Logits
     Canaver
    -0.53
    ¶
    -0.48
     spoilers
    -0.47
     quotes
    -0.45
     Patreon
    -0.45
     disclaimer
    -0.44
     ðŁij
    -0.43
     Wiki
    -0.43
     trolling
    -0.43
     interviews
    -0.43
    POSITIVE LOGITS
    )."
    0.64
    )).
    0.56
    ).[
    0.54
    .).
    0.49
    ").
    0.49
    ));
    0.49
    ");
    0.46
    %).
    0.45
    destruct
    0.45
    uchi
    0.44
    Act Density 1.788%

    No Known Activations