INDEX
    Explanations

    mentions of the name "Bon" with varying activations

    references to a specific individual named "Bon."

    New Auto-Interp
    Negative Logits
    dfx
    -0.83
     Ethics
    -0.77
    gamer
    -0.68
    æĸ¹
    -0.68
     Marijuana
    -0.68
    INAL
    -0.67
    ELD
    -0.65
     Editorial
    -0.65
     Regulatory
    -0.64
     Nicotine
    -0.64
    POSITIVE LOGITS
    anza
    1.11
    iton
    1.08
    uses
    1.04
    Bon
    0.99
    gey
    0.98
    itors
    0.97
    illas
    0.95
    etooth
    0.93
    isson
    0.92
    vill
    0.91
    Act Density 0.009%

    No Known Activations