INDEX
    Explanations

    political affiliations and positions

    Follows a letter, especially "D" or "R"

    New Auto-Interp
    Negative Logits
    らかに
    -0.46
    hat
    -0.45
     Etats
    -0.45
    PhysRevLett
    -0.44
     Feuerwehr
    -0.44
     gebruik
    -0.44
    rotnie
    -0.44
    ظيم
    -0.43
    alpin
    -0.43
    jandra
    -0.43
    POSITIVE LOGITS
     незавершена
    0.66
     licks
    0.63
     pinulongan
    0.61
    nsito
    0.61
     مشين
    0.61
     تضيفلها
    0.61
    httphttps
    0.60
    0.59
    BeginContext
    0.59
    EndContext
    0.59
    Act Density 0.038%

    No Known Activations