INDEX
    Explanations

    elements related to personal experience and introspection

    sentences expressing enthusiastic or supportive statements about actions or endeavors.

    New Auto-Interp
    Negative Logits
    })}
    -0.29
    OPSIS
    -0.28
     Leighton
    -0.28
     superstar
    -0.26
    \\
    -0.26
    gmx
    -0.26
     Waters
    -0.25
    нти
    -0.25
    })}\
    -0.25
     core
    -0.24
    POSITIVE LOGITS
     насељу
    0.72
    OGND
    0.72
     صوتيه
    0.69
    <unused43>
    0.69
    <unused58>
    0.69
    <unused8>
    0.69
    <pad>
    0.69
    <unused20>
    0.68
    <unused19>
    0.68
    <unused16>
    0.68
    Act Density 0.245%

    No Known Activations