INDEX
    Explanations

    phrases related to societal issues and controversies, particularly related to justice, politics, and ideology

    phrases related to societal issues and governmental systems

    New Auto-Interp
    Negative Logits
    76561
    -0.67
    ces
    -0.67
    didn
    -0.63
    was
    -0.63
    poses
    -0.62
    doesn
    -0.62
    laun
    -0.61
    formed
    -0.60
    does
    -0.58
    wrote
    -0.57
    POSITIVE LOGITS
     be
    1.23
     succeed
    1.21
     survive
    1.20
     prove
    1.20
     convince
    1.10
     afford
    1.10
     suffice
    1.09
     justify
    1.08
     fail
    1.08
     decide
    1.07
    Act Density 0.157%

    No Known Activations