INDEX
    Explanations

    phrases related to law and order, specifically in the context of authority and societal control

    New Auto-Interp
    Negative Logits
    WWW
    -0.16
    rega
    -0.15
     neoliberal
    -0.14
    æ§ĺ
    -0.14
     Deng
    -0.14
     politic
    -0.14
    regor
    -0.14
     Din
    -0.13
    snapshot
    -0.13
    zte
    -0.13
    POSITIVE LOGITS
    RICS
    0.14
    ãĤ¨ãĥ«
    0.13
    =Math
    0.13
    .Generated
    0.13
     semiclass
    0.13
    /frontend
    0.13
    ov
    0.13
    assin
    0.13
    unsupported
    0.13
     Hemisphere
    0.13
    Act Density 0.017%

    No Known Activations