INDEX
    Explanations

    numbers and special characters that might be associated with sensitive information or data

    special characters or symbols

    New Auto-Interp
    Negative Logits
     Charlottesville
    -0.73
     mole
    -0.69
     dividing
    -0.67
    cko
    -0.66
     Gutenberg
    -0.65
     clipboard
    -0.64
     Mueller
    -0.64
     licensee
    -0.64
     racists
    -0.62
     suprem
    -0.62
    POSITIVE LOGITS
    Ĥ
    1.37
    ¹
    1.03
    ¨
    0.97
    £
    0.97
    çͰ
    0.95
    اÙĦ
    0.92
    atar
    0.91
    ading
    0.91
    agara
    0.90
    ب
    0.90
    Act Density 0.006%

    No Known Activations