INDEX
    Explanations

    words related to authority, power dynamics, and legal proceedings

    important actions or consequences related to events, particularly in political or social contexts

    New Auto-Interp
    Negative Logits
     Revision
    -0.64
     postwar
    -0.63
     Kush
    -0.62
     1948
    -0.61
     1906
    -0.59
     notwithstanding
    -0.59
     Crunch
    -0.59
     Alley
    -0.58
     Huff
    -0.58
     Ballard
    -0.58
    POSITIVE LOGITS
    tnc
    0.84
    DCS
    0.81
    ]);
    0.78
    >]
    0.76
    aeus
    0.75
    Reviewer
    0.75
    taboola
    0.75
    erd
    0.72
    ');
    0.72
    >>>>
    0.69
    Act Density 0.569%

    No Known Activations