INDEX
    Explanations

    references to atheism and atheists

    New Auto-Interp
    Negative Logits
    AdapterFactory
    -0.16
    æĿ
    -0.15
    à¸ĵ
    -0.15
     Madison
    -0.14
    ango
    -0.14
    wor
    -0.14
    ahl
    -0.14
    æ¥ļ
    -0.14
    ssi
    -0.14
     Riley
    -0.14
    POSITIVE LOGITS
    /ay
    0.14
    221
    0.14
     Bakan
    0.14
    _permalink
    0.14
     Cliff
    0.13
    linkplain
    0.13
    .pool
    0.13
    /non
    0.13
    494
    0.13
    iram
    0.13
    Act Density 0.009%

    No Known Activations