INDEX
    Explanations

    names of people prominent in various contexts

    New Auto-Interp
    Negative Logits
     himself
    -0.17
     Woo
    -0.15
    uss
    -0.14
     Erf
    -0.14
    873
    -0.14
     son
    -0.14
     Lev
    -0.14
     Baxter
    -0.14
    ÑĥÑģ
    -0.14
     Levin
    -0.14
    POSITIVE LOGITS
     herself
    0.20
    ová
    0.16
    ]=="
    0.15
    jeme
    0.15
     lesbian
    0.15
    ovna
    0.15
     Lesbian
    0.15
    fone
    0.15
    ÙĬدة
    0.15
    },'
    0.15
    Act Density 0.080%

    No Known Activations