INDEX
    Explanations

    terms related to personal responsibility and action

    terms related to accountability and regulation

    New Auto-Interp
    Negative Logits
     faintly
    -0.74
     distinctly
    -0.73
     definitely
    -0.73
     curiously
    -0.73
    usually
    -0.71
     generally
    -0.71
     equally
    -0.70
     cautiously
    -0.69
     specifically
    -0.69
    psey
    -0.68
    POSITIVE LOGITS
    ãĥīãĥ©
    0.81
    ourced
    0.74
    ifiable
    0.69
    Initialized
    0.69
    arded
    0.66
     fault
    0.66
    ãĥ£
    0.64
    rez
    0.64
    yth
    0.64
    uga
    0.63
    Act Density 0.715%

    No Known Activations