INDEX
    Explanations

    terms related to international relations and political policies

    concepts related to governance and regulation

    New Auto-Interp
    Negative Logits
    iren
    -0.65
    pired
    -0.56
     Yesterday
    -0.56
     Secrets
    -0.54
    htaking
    -0.54
     Congratulations
    -0.53
    hest
    -0.53
     Janeiro
    -0.52
     Michele
    -0.52
    javascript
    -0.51
    POSITIVE LOGITS
     altogether
    1.09
     instead
    1.04
     somewhere
    0.94
     someday
    0.92
    instead
    0.89
     elsewhere
    0.88
     sooner
    0.88
     depending
    0.87
     alternatively
    0.86
     whichever
    0.85
    Act Density 0.849%

    No Known Activations