INDEX
    Explanations

    terms related to empowerment and capability enhancement

    New Auto-Interp
    Negative Logits
    ngo
    -0.16
    oenix
    -0.16
    خاÙĨÙĩ
    -0.15
    мага
    -0.15
    ãĥ¼ãĥĭ
    -0.14
    iggins
    -0.14
    ÏģιÏĥ
    -0.14
    $MESS
    -0.14
    arella
    -0.14
    ellen
    -0.14
    POSITIVE LOGITS
    /disable
    0.30
    ment
    0.18
    731
    0.16
     us
    0.16
    242
    0.15
    /dis
    0.15
    247
    0.15
    547
    0.15
    735
    0.15
    472
    0.15
    Act Density 0.025%

    No Known Activations