INDEX
    Explanations

    articles and descriptors related to people and their attributes

    New Auto-Interp
    Negative Logits
    swer
    -0.16
    άνÏĦα
    -0.15
    omu
    -0.15
    elere
    -0.14
    ctrine
    -0.14
    astle
    -0.14
    crire
    -0.14
    ãĥ³ãĥĢ
    -0.14
    wang
    -0.13
    .crm
    -0.13
    POSITIVE LOGITS
     man
    0.14
     Platt
    0.14
    ระ
    0.14
    UNET
    0.14
    odore
    0.13
    usch
    0.13
    evi
    0.13
    opleft
    0.13
    quila
    0.13
    Hol
    0.13
    Act Density 0.105%

    No Known Activations