INDEX
    Explanations

    phrases related to policies, regulations, and financial transactions

    topics related to social justice and discrimination issues

    New Auto-Interp
    Negative Logits
     largeDownload
    -0.67
    DK
    -0.63
    _>
    -0.63
    Dialogue
    -0.62
    CONCLUS
    -0.61
    canon
    -0.60
    APTER
    -0.59
    ilogy
    -0.59
    ovie
    -0.58
    ruary
    -0.58
    POSITIVE LOGITS
     themselves
    0.87
     their
    0.78
     illegally
    0.71
     theirs
    0.71
     harmful
    0.70
    entimes
    0.69
    their
    0.69
     unwanted
    0.68
     lifestyles
    0.67
     costly
    0.67
    Act Density 2.100%

    No Known Activations