INDEX
    Explanations

    references to the concept of creation in various contexts

    New Auto-Interp
    Negative Logits
     val
    -0.16
    ug
    -0.16
    weg
    -0.15
    ogo
    -0.15
    itr
    -0.15
    boo
    -0.14
    irsch
    -0.14
    /Area
    -0.14
     meaning
    -0.14
    untu
    -0.14
    POSITIVE LOGITS
    RIX
    0.17
    nish
    0.17
    zier
    0.16
     Mur
    0.16
    ighbor
    0.15
    anje
    0.15
    QA
    0.15
     Barrier
    0.15
    anky
    0.15
    rix
    0.14
    Act Density 0.013%

    No Known Activations