INDEX
    Explanations

    mentions of official policy, rules, regulations, or laws, especially in legal or governmental contexts

    New Auto-Interp
    Negative Logits
     policy
    -2.66
    Policy
    -2.47
     Policy
    -2.44
    policy
    -2.41
     POLICY
    -2.20
     policies
    -2.03
    POLICY
    -2.00
     Policies
    -1.84
    Policies
    -1.71
    policies
    -1.69
    POSITIVE LOGITS
    rungsseite
    0.56
    kesha
    0.51
     inconnu
    0.49
    ]--;
    0.47
    GEBURTS
    0.47
    CppMethod
    0.47
    athione
    0.46
    Nervous
    0.45
    delwed
    0.45
     brancas
    0.45
    Act Density 1.640%

    No Known Activations