INDEX
    Explanations

    names of notable personalities, such as "DeVos" and "Dawkins."

    mentions of specific individuals, particularly Betsy DeVos and Richard Dawkins

    New Auto-Interp
    Negative Logits
     Polic
    -0.79
     Fant
    -0.78
     Leth
    -0.71
     Pike
    -0.70
     pneum
    -0.68
     Lith
    -0.66
     buoy
    -0.65
     Antar
    -0.64
     fishermen
    -0.64
    ppo
    -0.62
    POSITIVE LOGITS
    liga
    0.90
    æł
    0.86
    etics
    0.84
    à¼
    0.84
    heimer
    0.83
    chool
    0.81
    ãĥ¼ãĥĨãĤ£
    0.80
    ulously
    0.79
    verse
    0.76
    hyde
    0.75
    Act Density 0.021%

    No Known Activations