INDEX
    Explanations

    phrases related to political themes, especially focusing on specific individuals and events

    variations of the word "independent."

    New Auto-Interp
    Negative Logits
     sshd
    -0.93
    ongyang
    -0.88
    ĵĺ
    -0.71
    wagen
    -0.69
    idon
    -0.67
    interstitial
    -0.64
     AVG
    -0.64
    =-=-=-=-=-=-=-=-
    -0.63
    taboola
    -0.62
    =-=-=-=-
    -0.61
    POSITIVE LOGITS
    azeera
    0.87
    aucuses
    0.80
    ixture
    0.71
    oland
    0.71
    apolis
    0.70
    asia
    0.69
    ctr
    0.69
    cest
    0.68
    ogo
    0.68
    ja
    0.67
    Act Density 0.077%

    No Known Activations