INDEX
    Explanations

    terms related to the Ku Klux Klan or racist ideologies

    references to specific names and terms associated with notable historical or cultural contexts

    New Auto-Interp
    Negative Logits
    pection
    -0.84
     hars
    -0.68
     NEC
    -0.68
     Finder
    -0.67
     datas
    -0.65
    fires
    -0.63
    claw
    -0.59
     holiest
    -0.59
    FG
    -0.58
     Kraken
    -0.58
    POSITIVE LOGITS
    sburgh
    0.93
    nih
    0.86
    osate
    0.72
    roleum
    0.72
    åĬ
    0.69
    affer
    0.69
    irus
    0.69
    \":
    0.69
    sylvania
    0.69
    assium
    0.69
    Act Density 0.037%

    No Known Activations