INDEX
    Explanations

    phrases related to rules, policies, and procedures

    words related to rules, policies, and structures of authority

    New Auto-Interp
    Negative Logits
    pload
    -0.78
    nen
    -0.71
     Citiz
    -0.71
     glim
    -0.69
     BUS
    -0.69
    ÃŃn
    -0.68
    plet
    -0.67
    ãĥij
    -0.66
    star
    -0.66
     Defin
    -0.65
    POSITIVE LOGITS
    ropy
    0.73
     XIV
    0.65
    -----
    0.63
    lain
    0.63
     relating
    0.62
    ":["
    0.62
    utra
    0.61
     XIII
    0.58
     ---
    0.58
    ulhu
    0.58
    Act Density 0.255%

    No Known Activations