INDEX
    Explanations

    words related to processes or actions in descriptions

    New Auto-Interp
    Negative Logits
    _singleton
    -0.16
    ema
    -0.16
    cete
    -0.15
    upy
    -0.15
     Lug
    -0.15
    Dem
    -0.14
    á»ı
    -0.14
    allback
    -0.14
     shaved
    -0.14
     Microsystems
    -0.14
    POSITIVE LOGITS
    ENCIL
    0.15
     Denn
    0.15
    assi
    0.14
    iders
    0.14
     Craft
    0.14
    orang
    0.14
    owns
    0.13
    ropdown
    0.13
     Went
    0.13
    atos
    0.13
    Act Density 0.004%

    No Known Activations