INDEX
    Explanations

    references to laws and principles regarding morality and governance

    New Auto-Interp
    Negative Logits
    uien
    -0.16
    dux
    -0.16
    inent
    -0.15
    fone
    -0.15
    ERA
    -0.14
    .spin
    -0.14
     Sikh
    -0.14
    INED
    -0.14
    agon
    -0.14
    mdb
    -0.14
    POSITIVE LOGITS
    951
    0.16
     Burl
    0.16
    avra
    0.16
    aver
    0.14
    ayar
    0.14
    ipi
    0.14
    'gc
    0.14
    arest
    0.14
    autorelease
    0.14
     warning
    0.14
    Act Density 0.205%

    No Known Activations