INDEX
    Explanations

    references to policies and guidelines

    New Auto-Interp
    Negative Logits
     mystérie
    -0.44
    withCredentials
    -0.44
     aveug
    -0.40
    saraba
    -0.40
    Find
    -0.38
     kapturem
    -0.38
     אית
    -0.38
     dedans
    -0.38
    Egyptian
    -0.37
    The
    -0.36
    POSITIVE LOGITS
     POLICY
    1.16
     policy
    1.10
     Policy
    1.09
    policy
    1.07
    Policy
    1.02
    POLICY
    1.02
     Policies
    0.89
     POLICIES
    0.89
     policies
    0.87
    Policies
    0.84
    Act Density 0.149%

    No Known Activations