INDEX
    Explanations

    phrases related to political discussions and policies

    New Auto-Interp
    Negative Logits
    irez
    -0.68
     Presence
    -0.65
     Prix
    -0.64
     redesign
    -0.63
    umption
    -0.61
    racuse
    -0.60
    ulas
    -0.60
    ipation
    -0.58
    pedia
    -0.58
     irresist
    -0.58
    POSITIVE LOGITS
     whatsoever
    0.89
    THING
    0.72
    worthiness
    0.70
     sudden
    0.70
     us
    0.68
    those
    0.66
    affles
    0.65
    course
    0.65
    these
    0.64
    llor
    0.64
    Act Density 0.043%

    No Known Activations