INDEX
    Explanations

    phrases related to societal issues and responses to them, particularly pertaining to political, economic, and social aspects

    New Auto-Interp
    Negative Logits
     Invention
    -0.71
    foundland
    -0.65
    :,
    -0.64
    ;;;;
    -0.61
     sqor
    -0.59
     noting
    -0.57
    .<
    -0.57
    margin
    -0.57
    rising
    -0.54
    asma
    -0.54
    POSITIVE LOGITS
     could
    1.16
     might
    1.15
     exists
    1.13
     cannot
    1.13
     would
    1.13
     should
    1.07
     existed
    1.07
     couldn
    1.00
     succeeds
    1.00
     went
    1.00
    Act Density 4.286%

    No Known Activations