INDEX
    Explanations

    concepts related to awareness and taking charge in various situations

    New Auto-Interp
    Negative Logits
    arus
    -0.19
    ocuk
    -0.15
    olik
    -0.14
    alez
    -0.14
    öff
    -0.14
    sto
    -0.13
    ernel
    -0.13
    ÑĦоÑĢма
    -0.13
    rame
    -0.13
    elden
    -0.13
    POSITIVE LOGITS
     of
    0.64
     cá»§a
    0.42
    of
    0.35
    _of
    0.34
     Of
    0.33
    	of
    0.33
    à¸Ĥà¸Ńà¸ĩ
    0.32
     ofs
    0.31
     thereof
    0.30
    -of
    0.29
    Act Density 0.140%

    No Known Activations