INDEX
    Explanations

    words related to political or government actions

    phrases indicating levels or scales of action and accountability

    New Auto-Interp
    Negative Logits
    Interested
    -0.70
    hin
    -0.68
    photo
    -0.64
    NESS
    -0.61
    Reply
    -0.60
    fill
    -0.60
    gins
    -0.60
    idon
    -0.59
    seeing
    -0.59
     [+
    -0.58
    POSITIVE LOGITS
     least
    1.12
    onement
    1.11
     stake
    1.05
     conferences
    0.93
     universities
    0.91
     airports
    0.89
    abase
    0.89
     home
    0.88
    omics
    0.86
     logger
    0.82
    Act Density 0.125%

    No Known Activations