INDEX
    Explanations

    words related to person pronouns

    Followed by a preposition

    pronouns followed by nouns/verbs

    New Auto-Interp
    Negative Logits
     circ
    -0.47
    awtextra
    -0.47
    recep
    -0.47
     recep
    -0.47
     мәкал
    -0.47
    ôles
    -0.47
    quelize
    -0.46
     Prov
    -0.45
    toprule
    -0.45
     kaynağından
    -0.45
    POSITIVE LOGITS
     berdua
    0.58
    zelf
    0.44
     们
    0.43
     kautta
    0.42
     springfox
    0.40
     aikana
    0.40
     hindurch
    0.40
    langkah
    0.36
     lipat
    0.36
     aikaa
    0.36
    Act Density 0.244%

    No Known Activations