INDEX
    Explanations

    references to Christianity and related terms

    New Auto-Interp
    Negative Logits
    china
    -0.19
    reet
    -0.17
    chart
    -0.16
    ular
    -0.16
     Christianity
    -0.15
    gaard
    -0.15
    sert
    -0.15
    ustr
    -0.15
    sole
    -0.15
    IES
    -0.15
    POSITIVE LOGITS
    ized
    0.25
    ity
    0.21
    izing
    0.20
    -Muslim
    0.20
    å¾Ĵ
    0.19
    like
    0.18
    etz
    0.17
    ization
    0.17
    ize
    0.17
     Broadcasting
    0.16
    Act Density 0.013%

    No Known Activations