INDEX
    Explanations

    phrases focusing on accountability and responsibility in governance and societal issues

    New Auto-Interp
    Negative Logits
     rather
    -0.20
     none
    -0.17
     instead
    -0.16
     both
    -0.16
     more
    -0.16
    anders
    -0.16
     (
    -0.15
    ary
    -0.15
    ima
    -0.15
    181
    -0.14
    POSITIVE LOGITS
     بÙĦÚ©Ùĩ
    0.20
     anymore
    0.19
    à¹ģà¸Ħ
    0.18
    plusplus
    0.18
    ä»ħ
    0.17
    Affected
    0.17
     LIMITED
    0.17
    affected
    0.17
     limited
    0.17
     sondern
    0.16
    Act Density 0.039%

    No Known Activations