INDEX
    Explanations

    stricter rules and regulations

    New Auto-Interp
    Negative Logits
     otr
    -0.08
    -thread
    -0.08
    금을
    -0.08
    -0.08
    .skill
    -0.08
    ението
    -0.08
     salvation
    -0.07
    prung
    -0.07
    aviour
    -0.07
    -bike
    -0.07
    POSITIVE LOGITS
    力度
    0.13
     stringent
    0.12
     stric
    0.11
     strict
    0.11
    严格
    0.11
     नियम
    0.10
     regels
    0.10
    监管
    0.10
    措施
    0.10
     правила
    0.09
    Act Density 0.028%

    No Known Activations