INDEX
    Explanations

    specific titles and formal qualifications

    New Auto-Interp
    Negative Logits
    iggins
    -0.16
     Bris
    -0.15
    гал
    -0.14
    _fatal
    -0.14
    dsn
    -0.14
    elpers
    -0.13
    è§
    -0.13
    awy
    -0.13
    poser
    -0.13
    ãĢ
    -0.13
    POSITIVE LOGITS
     Har
    0.20
     har
    0.19
     Bil
    0.18
    nic
    0.18
     nic
    0.17
    hiro
    0.16
     conf
    0.15
     niche
    0.15
     Dig
    0.15
     HAR
    0.15
    Act Density 0.008%

    No Known Activations