INDEX
    Explanations

    terms related to empowerment and enabling actions or opportunities

    New Auto-Interp
    Negative Logits
    wer
    -0.17
    à¹ģรà¸ĩ
    -0.15
    edn
    -0.14
    agate
    -0.14
    cular
    -0.14
    boy
    -0.13
     Horton
    -0.13
    ÐĬ
    -0.13
    urgeon
    -0.13
    lok
    -0.13
    POSITIVE LOGITS
    /disable
    0.24
    ipar
    0.15
    ÑģÑĤÑĮ
    0.15
    ERA
    0.15
    -disable
    0.15
    znám
    0.14
    735
    0.14
    наÑĩе
    0.14
    ì¦Ī
    0.14
    anced
    0.14
    Act Density 0.015%

    No Known Activations