INDEX
    Explanations

    special characters or symbols within text

    instances of a specific character or symbol in text

    New Auto-Interp
    Negative Logits
    ŃĶ
    -0.74
    icult
    -0.71
    ysis
    -0.67
    ocument
    -0.66
    ijn
    -0.66
     Blu
    -0.63
     frag
    -0.63
    uers
    -0.63
    ici
    -0.62
    #$#$
    -0.62
    POSITIVE LOGITS
    ––
    1.04
    âĪĴ
    0.87
    cases
    0.83
    advertisement
    0.82
    -+
    0.82
    0.79
    issues
    0.79
    micro
    0.78
    style
    0.77
    mediated
    0.76
    Act Density 0.019%

    No Known Activations