INDEX
    Explanations

    variations of the word "participate" and its derivatives

    New Auto-Interp
    Negative Logits
    inged
    -0.18
    zeÅĪ
    -0.15
    ingers
    -0.15
    tÄĽ
    -0.14
    halt
    -0.14
    bug
    -0.14
    änger
    -0.14
    liž
    -0.14
    isé
    -0.14
    eat
    -0.14
    POSITIVE LOGITS
    ipation
    0.25
    les
    0.23
    atory
    0.22
    LES
    0.19
    ip
    0.18
    abra
    0.17
    antes
    0.17
    ipp
    0.16
    ipe
    0.16
    iple
    0.16
    Act Density 0.006%

    No Known Activations