INDEX
    Explanations

    instances of the word "des" or its variations in different contexts

    New Auto-Interp
    Negative Logits
    sie
    -0.16
    .gdx
    -0.16
     sie
    -0.15
    ä¸Ī
    -0.14
    citation
    -0.14
    cba
    -0.14
    czy
    -0.14
     Sie
    -0.14
    tps
    -0.14
    reira
    -0.14
    POSITIVE LOGITS
    afi
    0.20
    van
    0.19
    emb
    0.19
    ple
    0.18
    vi
    0.18
    engan
    0.18
    vinc
    0.18
    mere
    0.18
    prend
    0.17
    mem
    0.17
    Act Density 0.004%

    No Known Activations