INDEX
    Explanations

    references to political representatives and their affiliations

    New Auto-Interp
    Negative Logits
     Stake
    -0.14
    ationally
    -0.14
    Compat
    -0.14
    bulk
    -0.14
    iferay
    -0.14
     Woodward
    -0.14
     st
    -0.14
     Samp
    -0.13
    vana
    -0.13
    ssi
    -0.13
    POSITIVE LOGITS
    iol
    0.17
    oru
    0.16
    kenin
    0.15
    αν
    0.15
    orget
    0.15
    pert
    0.15
    hint
    0.14
    InstanceState
    0.14
     *</
    0.14
    ansa
    0.13
    Act Density 0.008%

    No Known Activations