INDEX
    Explanations

    phrases and actions related to transformative experiences and emotional expressions

    New Auto-Interp
    Negative Logits
    uite
    -0.17
    rente
    -0.15
    rame
    -0.15
    IGIN
    -0.15
    elpers
    -0.15
    vrd
    -0.15
    INVAL
    -0.15
    ober
    -0.15
    ÅĻeb
    -0.15
    quivo
    -0.14
    POSITIVE LOGITS
     
    0.17
    !
    0.15
     =
    0.15
     .
    0.15
     totiž
    0.15
     sooner
    0.15
     nä
    0.14
     ifs
    0.14
     namely
    0.14
     Roy
    0.14
    Act Density 0.217%

    No Known Activations