INDEX
    Explanations

    pronouns and their variations

    New Auto-Interp
    Negative Logits
    ikt
    -0.17
    leck
    -0.16
    td
    -0.15
    s
    -0.15
    amp
    -0.14
    eyes
    -0.13
    ibase
    -0.13
    sian
    -0.13
    acent
    -0.13
    eldorf
    -0.13
    POSITIVE LOGITS
    647
    0.16
     Cust
    0.15
     Powell
    0.15
    bsites
    0.15
    hausen
    0.14
     cust
    0.14
     powder
    0.14
    ève
    0.14
    Latch
    0.14
    eward
    0.14
    Act Density 0.030%

    No Known Activations