INDEX
    Explanations

    references to institutions and organizations in a political context

    New Auto-Interp
    Negative Logits
     sqor
    -0.68
    foundland
    -0.65
    MpServer
    -0.62
    ranging
    -0.60
    iannopoulos
    -0.59
    range
    -0.59
    eworthy
    -0.56
    erville
    -0.56
    ãĤ¨ãĥ«
    -0.55
    rising
    -0.55
    POSITIVE LOGITS
     intervened
    1.08
     deems
    1.03
     decides
    0.96
     hadn
    0.92
     interfered
    0.92
     couldn
    0.88
     refuses
    0.86
     approves
    0.85
     considers
    0.85
     interven
    0.84
    Act Density 0.337%

    No Known Activations