INDEX
    Explanations

    emotional expressions related to life experiences

    New Auto-Interp
    Negative Logits
    ustr
    -0.16
     Cul
    -0.16
     Jeh
    -0.15
    776
    -0.15
    alian
    -0.14
     رÙĬ
    -0.14
    chner
    -0.14
    rav
    -0.14
     unpopular
    -0.14
    EFA
    -0.14
    POSITIVE LOGITS
    ãĤĩ
    0.18
    abbit
    0.14
    egasus
    0.14
    erne
    0.14
     üzere
    0.14
    nodoc
    0.14
    elic
    0.14
     Wald
    0.14
    arra
    0.14
    imest
    0.13
    Act Density 0.007%

    No Known Activations