INDEX
    Explanations

    references to political dictators

    references to dictators and dictatorial regimes

    New Auto-Interp
    Negative Logits
    older
    -0.83
    awks
    -0.83
    alle
    -0.80
    ilk
    -0.80
    ttp
    -0.80
    LAN
    -0.78
    atha
    -0.76
    Recommend
    -0.76
    IGH
    -0.75
    rb
    -0.75
    POSITIVE LOGITS
     dictator
    1.34
     dictatorship
    1.16
     dictators
    1.00
     nomine
    0.89
     regime
    0.85
     overth
    0.85
     regimes
    0.84
     tyrant
    0.82
     tyranny
    0.82
     overthrow
    0.81
    Act Density 0.009%

    No Known Activations