INDEX
    Explanations

    words related to rules, regulations, and enforcement

    references to rules and laws

    New Auto-Interp
    Negative Logits
    ité
    -0.90
    itate
    -0.88
    acters
    -0.78
     Pradesh
    -0.75
    issance
    -0.75
     Hots
    -0.73
    itant
    -0.72
    velength
    -0.72
    assador
    -0.72
    ity
    -0.71
    POSITIVE LOGITS
    book
    1.21
    books
    1.12
    making
    0.99
    breaker
    0.92
    breakers
    0.91
    maker
    0.89
    makers
    0.84
    witz
    0.81
    lessness
    0.80
     rule
    0.77
    Act Density 0.021%

    No Known Activations