INDEX
    Explanations

    references to governmental or organizational policies

    references to government policies

    New Auto-Interp
    Negative Logits
    ITNESS
    -0.85
    issan
    -0.85
    athan
    -0.75
    ãĤ¨ãĥ«
    -0.73
    Vel
    -0.70
    Rocket
    -0.69
     Brotherhood
    -0.69
     Sabha
    -0.69
    avez
    -0.69
     Flavoring
    -0.67
    POSITIVE LOGITS
     policies
    1.11
     prescriptions
    0.91
     Policies
    0.90
     policy
    0.90
     preferences
    0.84
     stances
    0.83
    olicy
    0.82
    policy
    0.79
     governing
    0.79
    aroo
    0.77
    Act Density 0.012%

    No Known Activations