INDEX
    Explanations

    terms related to a specific religious or cultural belief system

    words related to various forms of "arian," implying contexts involving specific ideologies or identities

    New Auto-Interp
    Negative Logits
    entry
    -0.68
    err
    -0.67
    redd
    -0.66
    pty
    -0.66
    same
    -0.66
    berman
    -0.65
    ura
    -0.65
    pless
    -0.64
    bench
    -0.64
    arton
    -0.63
    POSITIVE LOGITS
    arian
    1.09
    ism
    0.92
    cies
    0.92
    ity
    0.84
    arians
    0.80
    amental
    0.79
    omics
    0.77
    naire
    0.77
    ial
    0.77
    itarian
    0.76
    Act Density 0.015%

    No Known Activations