INDEX
    Explanations

    names containing the substring "Bon"

    New Auto-Interp
    Negative Logits
    dfx
    -0.93
     Ethics
    -0.79
    UGE
    -0.78
    ELD
    -0.77
    gamer
    -0.76
    INAL
    -0.73
     Editorial
    -0.72
    ODE
    -0.70
     Marijuana
    -0.69
    REAM
    -0.69
    POSITIVE LOGITS
    anza
    1.23
    iton
    1.21
    uses
    1.10
    gey
    1.08
    itors
    1.06
    neau
    1.06
    isson
    1.02
    etooth
    1.02
    Bon
    1.00
    nie
    0.99
    Act Density 8.902%

    No Known Activations