INDEX
    Explanations

    phrases related to politics and government

    New Auto-Interp
    Negative Logits
     Maced
    -0.76
     buoy
    -0.75
     Valhalla
    -0.74
     Elys
    -0.69
     orn
    -0.67
     clut
    -0.67
     flex
    -0.66
     rounded
    -0.66
     brim
    -0.64
     seiz
    -0.64
    POSITIVE LOGITS
    sic
    1.67
    insert
    1.33
    REDACTED
    1.24
    laughs
    1.23
    being
    1.20
    blank
    1.13
    his
    1.12
    emphasis
    1.12
    their
    1.10
    the
    1.08
    Act Density 0.581%

    No Known Activations