INDEX
    Explanations

    elements related to policy decisions and their impact on society

    New Auto-Interp
    Negative Logits
    once
    -0.13
    ειο
    -0.13
    ãĥ
    -0.12
    GORITH
    -0.12
    ÚĺÙĩ
    -0.12
    аÑĪа
    -0.11
     geil
    -0.11
    emoc
    -0.10
    Å¡tÃŃ
    -0.10
    ÑĸллÑı
    -0.10
    POSITIVE LOGITS
     on
    1.19
     trên
    0.77
     на
    0.77
     auf
    0.72
     عÙĦÙī
    0.71
     på
    0.66
    	on
    0.63
     pada
    0.59
    à¸ļà¸Ļ
    0.54
    _on
    0.53
    Act Density 3.613%

    No Known Activations