INDEX
    Explanations

    references to digital technology

    New Auto-Interp
    Negative Logits
    avel
    -0.16
    ance
    -0.16
     bowed
    -0.15
    ela
    -0.15
    abil
    -0.15
    elle
    -0.15
    nice
    -0.15
    -strokes
    -0.15
    rr
    -0.14
    uch
    -0.14
    POSITIVE LOGITS
    ized
    0.28
    ization
    0.24
    ãĤ¿ãĥ«
    0.23
    izing
    0.20
    isiert
    0.20
    izador
    0.20
    IZED
    0.18
    izado
    0.18
    lsi
    0.17
    izes
    0.17
    Act Density 0.025%

    No Known Activations