INDEX
    Explanations

    references to political consequences and societal expectations

    New Auto-Interp
    Negative Logits
    ialis
    -0.16
     ke
    -0.15
     war
    -0.14
     Packing
    -0.14
    otion
    -0.14
     Kb
    -0.14
     sed
    -0.14
     tw
    -0.14
     m
    -0.14
    ervals
    -0.13
    POSITIVE LOGITS
     migrationBuilder
    0.17
    hang
    0.15
    нок
    0.15
    è±Ĩ
    0.14
    ìĭ¬
    0.14
    acen
    0.14
    shelf
    0.14
    ject
    0.14
    à¸ĩà¸Ĭ
    0.14
    touch
    0.14
    Act Density 0.299%

    No Known Activations