INDEX
    Explanations

    terms related to software development and technical processes

    New Auto-Interp
    Negative Logits
    sel
    -0.28
    sa
    -0.26
    son
    -0.26
    ìĿĦ
    -0.25
    re
    -0.25
    sh
    -0.24
    sha
    -0.24
    rer
    -0.23
    ship
    -0.22
    ses
    -0.22
    POSITIVE LOGITS
    Ìģ
    0.26
    iros
    0.25
    urope
    0.21
    yes
    0.20
    eer
    0.19
    chts
    0.19
    iras
    0.19
    arning
    0.19
    iro
    0.19
    ptides
    0.19
    Act Density 0.227%

    No Known Activations